scholarly journals The robustness of two phylogenetic methods: four-taxon simulations reveal a slight superiority of maximum likelihood over neighbor joining.

2020 ◽  
Vol 8 (1) ◽  
pp. 110-180
Author(s):  
David Goldstein

Abstract The last twenty or so years have witnessed a dramatic increase in the use of computational methods for inferring linguistic phylogenies. Although the results of this research have been controversial, the methods themselves are an undeniable boon for historical and Indo-European linguistics, if for no other reason than that they allow the field to pursue questions that were previously intractable. After a review of the advantages and disadvantages of computational phylogenetic methods, I introduce the following methods of phylogenetic inference in R: maximum parsimony; distance-based methods (UPGMA and neighbor joining); and maximum likelihood estimation. I discuss the strengths and weaknesses of each of these methods and in addition explicate various measures associated with phylogenetic estimation, including homoplasy indices and bootstrapping. Phylogenetic inference is carried out on the Indo-European dataset compiled by Don Ringe and Ann Taylor, which includes phonological, morphological, and lexical characters.


2017 ◽  
Vol 51 (s38) ◽  
Author(s):  
Alexei S. Kassian

AbstractThis paper deals with the problem of linguistic homoplasy (parallel or backward development), how it can be detected, what kinds of linguistic homoplasy can be distinguished and which varieties of the phenomenon are the most deleterious for the reconstruction of language phylogeny. It is proposed that language phylogeny reconstruction should consist of two main stages. Firstly, a strict consensus tree should be built on the basis of high-quality input data elaborated with the help of the main phylogenetic methods (such as Neighbor-joining, Bayesian MCMC, and Maximum parsimony), and ancestral character states, allowing us to reveal a certain number of homoplastic characters. Secondly, after the detected instances of homoplasy are eliminated from the input matrix, the consensus tree is to be compiled again. It is expected that after homoplastic optimization it will be possible to better resolve individual “problem clades”, and generally the homoplasy-optimized phylogeny should be more robust than the tree constructed initially. The proposed procedure is tested on the 110-item Swadesh wordlists of the Lezgian and Tsezic groups. The Lezgian and Tsezic results generally support theoretical expectations. The MLN (minimal lateral network) method, currently implemented in the LingPy software, is a helpful tool for the detection of linguistic homoplasy.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6984 ◽  
Author(s):  
Justin B. Miller ◽  
Lauren M. McKinnon ◽  
Michael F. Whiting ◽  
Perry G. Ridge

Background Common phylogenomic approaches for recovering phylogenies are often time-consuming and require annotations for orthologous gene relationships that are not always available. In contrast, alignment-free phylogenomic approaches typically use structure and oligomer frequencies to calculate pairwise distances between species. We have developed an approach to quickly calculate distances between species based on codon aversion. Methods Utilizing a novel alignment-free character state, we present CAM, an alignment-free approach to recover phylogenies by comparing differences in codon aversion motifs (i.e., the set of unused codons within each gene) across all genes within a species. Synonymous codon usage is non-random and differs between organisms, between genes, and even within a single gene, and many genes do not use all possible codons. We report a comprehensive analysis of codon aversion within 229,742,339 genes from 23,428 species across all kingdoms of life, and we provide an alignment-free framework for its use in a phylogenetic construct. For each species, we first construct a set of codon aversion motifs spanning all genes within that species. We define the pairwise distance between two species, A and B, as one minus the number of shared codon aversion motifs divided by the total codon aversion motifs of the species, A or B, containing the fewest motifs. This approach allows us to calculate pairwise distances even when substantial differences in the number of genes or a high rate of divergence between species exists. Finally, we use neighbor-joining to recover phylogenies. Results Using the Open Tree of Life and NCBI Taxonomy Database as expected phylogenies, our approach compares well, recovering phylogenies that largely match expected trees and are comparable to trees recovered using maximum likelihood and other alignment-free approaches. Our technique is much faster than maximum likelihood and similar in accuracy to other alignment-free approaches. Therefore, we propose that codon aversion be considered a phylogenetically conserved character that may be used in future phylogenomic studies. Availability CAM, documentation, and test files are freely available on GitHub at https://github.com/ridgelab/cam.


Phytotaxa ◽  
2016 ◽  
Vol 244 (3) ◽  
pp. 263 ◽  
Author(s):  
İsmail Eker ◽  
Buhara Yücesan ◽  
Muhammad Sameeullah ◽  
Walter Welß ◽  
Frieder Müller-Uri ◽  
...  

This study analyses phylogenetic and morphological relationships regarding the taxonomy of Digitalis sect. Globiflorae. Progesterone 5β-reductase (P5βR) is an important enzyme for cardenolide biosynthesis as well as a novel genetic marker inferring phylogenetic and biogeographic relationships in many plant species. Phylogenetic inference was conducted using the neighbor-joining method and maximum likelihood model on cDNA sequences of highly conserved P5βR (97.0–99.6%) isolated from members of sect. Globiflorae, including Digitalis trojana, D. cariensis, D. lamarckii, D. lanata subsp. lanata, D. ferruginea subsp. ferruginea and D. ferruginea subsp. schischkinii. Our phylogenetic and morphological results largely support Davis’ system of the genus Digitalis in the Flora of Turkey. We propose the existence of four species and three subspecies in sect. Globiflorae for Turkey. Digitalis cariensis is a distinct species endemic to Turkey. In spite of high morphological similarities, D. lamarckii was separated from D. trojana and D. lanata. Digitalis trojana is accepted at the infraspecific rank as D. lanata subsp. trojana. In addition, D. ferruginea is ranked under the two subspecies following Werner’s treatment. Our study presents a new understanding of the speciation patterns of endemic and non-endemic eastern Digitalis species distributed in Anatolia, Turkey, underlying molecular, biogeographical and comparative morphological analyses.


2016 ◽  
Vol 7 (1) ◽  
Author(s):  
Sayed Amer ◽  
Salih A. Basaid ◽  
Esmat Ali

A fragment of 772 bp of the chloroplast maturase K gene was amplified and sequenced for <em>Rosa x damascena trigintipetala</em> variety growing in Taif region of Saudi Arabia. The data were aligned with their counterparts of other varieties already found in the Genbank database and were analyzed by maximum-parsimony, neighbor-joining and maximum-likelihood methods and a single rooted tree was executed. <em>R. x damascena trigintipetala</em> was paraphyletic where one sample [A] clustered with all varieties while the second [B] was basal. <em>R. x damascena</em> was sister to <em>R. x chinensis semperflorens</em> with the later being basal. <em>R. x damascena gori</em> was basal for all taxa studied. <em>R. moschata</em> was inside the clade of <em>R. x damascena</em>. Hybridization could be possible among <em>R. damascena, R. chinensis</em> and <em>R. moschata</em>. The genetic distance and tree topology indicated that [A] variety could be originated from <em>R. moshata</em> while [B] could be originated from gori or <em>R. chinensis semperflorens</em>. We, therefore, may consider that <em>R. x damascena gori</em> or <em>R. chinensis</em> could be the origin of all nowadays <em>R. x damascena</em> varieties.


Sign in / Sign up

Export Citation Format

Share Document