scholarly journals A transcriptome-based study on the phylogeny and evolution of the taxonomically controversial subfamily Apioideae (Apiaceae)

2020 ◽  
Vol 125 (6) ◽  
pp. 937-953 ◽  
Author(s):  
Jun Wen ◽  
Yan Yu ◽  
Deng-Feng Xie ◽  
Chang Peng ◽  
Qing Liu ◽  
...  

Abstract Background and Aims A long-standing controversy in the subfamily Apioideae concerns relationships among the major lineages, which has prevented a comprehensive study of their fruits and evolutionary history. Here we use single copy genes (SCGs) generated from transcriptome datasets to generate a reliable species tree and explore the evolutionary history of Apioideae. Methods In total, 3351 SCGs were generated from 27 transcriptome datasets and one genome, and further used for phylogenetic analysis using coalescent-based methods. Fruit morphology and anatomy were studied in combination with the species tree. Eleven SCGs were screened out for dating analysis with two fossils selected for calibration. Key Results A well-supported species tree was generated with a topology [Chamaesieae, (Bupleureae, (Pleurospermeae, (Physospermopsis Clade, (Group C, (Group A, Group B)))))] that differed from previous trees. Daucinae and Torilidinae were not in the tribe Scandiceae and existed as sister groups to the Acronema Clade. Five branches (I–V) of the species tree showed low quartet support but strong local posterior probabilities. Dating analysis suggested that Apioideae originated around 56.64 Mya (95 % highest posterior density interval, 45.18–73.53 Mya). Conclusions This study resolves a controversial phylogenetic relationship in Apioideae based on 3351 SCGs and coalescent-based species tree estimation methods. Gene trees that contributed to the species tree may undergoing rapid evolutionary divergence and incomplete lineage sorting. Fruits of Apioideae might have evolved in two directions, anemochorous and hydrochorous, with epizoochorous as a derived mode. Molecular and morphological evidence suggests that Daucinae and Torilidinae should be restored to the tribe level. Our results provide new insights into the morphological evolution of this subfamily, which may contribute to a better understanding of species diversification in Apioideae. Molecular dating analysis suggests that uplift of the Qinghai–Tibetan Plateau (QTP) and climate changes probably drove rapid speciation and diversification of Apioideae in the QTP region.

2019 ◽  
Author(s):  
Qiqing Tao ◽  
Koichiro Tamura ◽  
Beatriz Mello ◽  
Sudhir Kumar

AbstractConfidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in these analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density (HPD) intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, i.e., the true time is contained within the CIs with a 95% probability. These developments will encourage broader use of computationally-efficient RelTime approach in molecular dating analyses and biological hypothesis testing.


2019 ◽  
Vol 37 (1) ◽  
pp. 280-290 ◽  
Author(s):  
Qiqing Tao ◽  
Koichiro Tamura ◽  
Beatriz Mello ◽  
Sudhir Kumar

Abstract Confidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in dating analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, that is, the actual time is contained within the CIs with a 94% probability. These developments will encourage broader use of computationally efficient RelTime approaches in molecular dating analyses and biological hypothesis testing.


2012 ◽  
Vol 279 (1747) ◽  
pp. 4617-4625 ◽  
Author(s):  
Oliver Haddrath ◽  
Allan J. Baker

The origin and timing of the diversification of modern birds remains controversial, primarily because phylogenetic relationships are incompletely resolved and uncertainty persists in molecular estimates of lineage ages. Here, we present a species tree for the major palaeognath lineages using 27 nuclear genes and 27 archaic retroposon insertions. We show that rheas are sister to the kiwis, emu and cassowaries, and confirm ratite paraphyly because tinamous are sister to moas. Divergence dating using 10 genes with broader taxon sampling, including emu, cassowary, ostrich, five kiwis, two rheas, three tinamous, three extinct moas and 15 neognath lineages, suggests that three vicariant events and possibly two dispersals are required to explain their historical biogeography. The age of crown group birds was estimated at 131 Ma (95% highest posterior density 122–138 Ma), similar to previous molecular estimates. Problems associated with gene tree discordance and incomplete lineage sorting in birds will require much larger gene sets to increase species tree accuracy and improve error in divergence times. The relatively rapid branching within neoaves pre-dates the extinction of dinosaurs, suggesting that the genesis of the radiation within this diverse clade of birds was not in response to the Cretaceous–Paleogene extinction event.


2020 ◽  
Author(s):  
Liming Cai ◽  
Zhenxiang Xi ◽  
Emily Moriarty Lemmon ◽  
Alan R Lemmon ◽  
Austin Mast ◽  
...  

Abstract The genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent (MSC) model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order includes nine of the top ten most unstable nodes in angiosperms, which have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 10.0%, 34.8%, and 21.4% of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution.


Author(s):  
María B Pisano ◽  
Andrés C A Culasso ◽  
Nancy Altabert ◽  
Maribel G Martínez Wassaf ◽  
Silvia V Nates ◽  
...  

Abstract Background Few studies about the evolutionary history of the hepatitis E virus (HEV) have been conducted. The aim of our work was to investigate and make inferences about the origin and routes of dispersion of HEV-3 in Argentina. Methods Phylogenetic, coalescent and phylogeographic analyses were performed using a 322-bp ORF2 genomic fragment of all HEV-3 sequences with known date and place of isolation published at GenBank until May 2018 (n=926), including 16 Argentinian sequences (isolated from pigs, water and humans). Results Phylogenetic analysis revealed two clades within HEV-3: abchij and efg. All Argentinian samples were grouped intermingled within clade 3abchij. The coalescent analysis showed that the most recent common ancestor for the clade 3abchij would have existed around the year 1967 (95% highest posterior density (HPD): 1963–1970). The estimated substitution rate was 1.01×10−2 (95%HPD: 9.3×10ˉ3–1.09×10ˉ2) substitutions/site/y, comparable with the rate previously described. The phylogeographic approach revealed a correspondence between phylogeny and place of origin for Argentinian samples, suggesting many HEV introductions in the country, probably from Europe and Japan. Conclusions This is the first evolutionary inference of HEV-3 that includes Argentinian strains, showing the circulation of many HEV-3 subtypes, obtained from different sources and places, with recent diversification processes. Accession numbers [KX812460], [KX812461], [KX812462], [KX812465], [KX812466], [KX812467], [KX812468], [KX812469].


2020 ◽  
Author(s):  
Mahim Mahbub ◽  
Zahin Wahab ◽  
Rezwana Reaz ◽  
M. Saifur Rahman ◽  
Md. Shamsuzzoha Bayzid

AbstractMotivationSpecies tree estimation from genes sampled from throughout the whole genome is complicated due to the gene tree-species tree discordance. Incomplete lineage sorting (ILS) is one of the most frequent causes for this discordance, where alleles can coexist in populations for periods that may span several speciation events. Quartet-based summary methods for estimating species trees from a collection of gene trees are becoming popular due to their high accuracy and statistical guarantee under ILS. Generating quartets with appropriate weights, where weights correspond to the relative importance of quartets, and subsequently amalgamating the weighted quartets to infer a single coherent species tree allows for a statistically consistent way of estimating species trees. However, handling weighted quartets is challenging.ResultsWe propose wQFM, a highly accurate method for species tree estimation from multi-locus data, by extending the quartet FM (QFM) algorithm to a weighted setting. wQFM was assessed on a collection of simulated and real biological datasets, including the avian phylogenomic dataset which is one of the largest phylogenomic datasets to date. We compared wQFM with wQMC, which is the best alternate method for weighted quartet amalgamation, and with ASTRAL, which is one of the most accurate and widely used coalescent-based species tree estimation methods. Our results suggest that wQFM matches or improves upon the accuracy of wQMC and ASTRAL.AvailabilitywQFM is available in open source form at https://github.com/Mahim1997/wQFM-2020.


2018 ◽  
Vol 96 (3) ◽  
pp. 443 ◽  
Author(s):  
Andrés Ernesto Ortiz-Rodriguez ◽  
Eydi Yanina Guerrero ◽  
Juan Francisco Ornelas

<p><strong>Background:</strong> The phylogenetic relationships of the <em>Bursera</em>-host specialist <em>Psittacanthus nudus</em>, <em>P. palmeri </em>and <em>P. sonorae</em> (Loranthaceae) remain uncertain. These mistletoe species exhibit morphological and phenological innovations probably related to their dry habitats, so that determining their phylogenetic position is key to the understanding of factors associated with the morphological evolution within <em>Psittacanthus</em>.</p><p><strong>Questions:</strong> (1) Is the evolution of some morphological innovations in the <em>Bursera</em>-host specialists associated with the ecological conditions linked to host diversification? (2) Does time of diversification in both lineages coincide?<strong></strong></p><p><strong>Study species:</strong> Fourteen species of <em>Psittacanthus</em>.</p><p><strong>Methods: </strong>Sequences of nuclear (ITS) and plastid (<em>trnL-trnF</em>) markers are analyzed with Bayesian inference, maximum likelihood and maximum parsimony methods, and molecular dating under a Bayesian approach estimated to elucidate the phylogenetic position and divergence timing of the<em> Bursera</em>-host specialists.</p><p><strong>Results:</strong> The <em>Bursera</em>-host specialists form a strongly supported clade, named here the ‘<em>Bursera</em> group’. The divergence time for the <em>Bursera</em>-host specialists was estimated at 7.89 Ma. Interestingly, phylogenetic relationships between <em>P. nudus</em> and <em>P. palmeri</em>, as currently circumscribed, were not fully resolved, making <em>P. palmeri</em> paraphyletic.</p><p><strong>Conclusions</strong>: Based on these results, the plants collected by type locality of <em>P. nudus</em> in Honduras should be named <em>P. palmeri</em>. The seasonal deciduousness of <em>P. palmeri </em>(including <em>P. nudus</em>) and morphology of <em>P. sonorae</em> (small size, fleshy leaves) are clearly adaptations to dry ecosystems where these species have lived for a long time. In parallel, the evolutionary history of these mistletoes seems to be correlated with the evolutionary history and diversification patterns of <em>Bursera</em>.</p>


2020 ◽  
Author(s):  
Liming Cai ◽  
Zhenxiang Xi ◽  
Emily Moriarty Lemmon ◽  
Alan R. Lemmon ◽  
Austin Mast ◽  
...  

ABSTRACTThe genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent (MSC) model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order alone includes nine of the top ten most unstable nodes in angiosperms, and the recalcitrant relationships along the backbone of the order have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 15%, 52%, and 32% of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution.


2015 ◽  
Author(s):  
Pranjal Vachaspati ◽  
Tandy Warnow

Background: Incomplete lineage sorting (ILS), modelled by the multi-species coalescent (MSC), is known to create discordance between gene trees and species trees, and lead to inaccurate species tree estimations unless appropriate methods are used to estimate the species tree. While many statistically consistent methods have been developed to estimate the species tree in the presence of ILS, only ASTRAL-2 and NJst have been shown to have good accuracy on large datasets. Yet, NJst is generally slower and less accurate than ASTRAL-2, and cannot run on some datasets. Results: We have redesigned NJst to enable it to run on all datasets, and we have expanded its design space so that it can be used with different distance-based tree estimation methods. The resultant method, ASTRID, is statistically consistent under the MSC model, and has accuracy that is competitive with ASTRAL-2. Furthermore, ASTRID is much faster than ASTRAL-2, completing in minutes on some datasets for which ASTRAL-2 used hours. Conclusions: ASTRID is a new coalescent-based method for species tree estimation that is competitive with the best current method in terms of accuracy, while being much faster. ASTRID is available in open source form on github.


2015 ◽  
Author(s):  
Jed Chou ◽  
Ashu Gupta ◽  
Shashank Yaduvanshi ◽  
Ruth Davidson ◽  
Mike Nute ◽  
...  

Background: Species tree estimation is challenging in the presence of incomplete lineage sorting (ILS), which can make gene trees different from the species tree. Because ILS is expected to occur and the standard concatenation approach can return incorrect trees with high support in the presence of ILS, “coalescent-based” summary methods (which first estimate gene trees and then combine gene trees into a species tree) have been developed that have theoretical guarantees of robustness to arbitrarily high amounts of ILS. Some studies have suggested that summary methods should only be used on “c-genes” (i.e., recombination-free loci) that can be extremely short (sometimes fewer than 100 sites). However, gene trees estimated on short alignments can have high estimation error, and summary methods tend to have high error on short c-genes. To address this problem, Chifman and Kubatko introduced SVDquartets, a new coalescent-based method. SVDquartets takes multi-locus unlinked single-site data, infers the quartet trees for all subsets of four species, and then combines the set of quartet trees into a species tree using a quartet amalgamation heuristic. Yet, the relative accuracy of SVDquartets to leading coalescent-based methods has not been assessed. Results: We compared SVDquartets to two leading coalescent-based methods (ASTRAL-2 and NJst), and to concatenation using maximum likelihood. We used a collection of simulated datasets, varying ILS levels, numbers of taxa, and number of sites per locus. Although SVDquartets was sometimes more accurate than ASTRAL-2 and NJst, most often the best results were obtained using ASTRAL-2, even on the shortest gene sequence alignments we explored (with only 10 sites per locus). Finally, concatenation was the most accurate of all methods under low ILS conditions. Conclusions: ASTRAL-2 generally had the best accuracy under higher ILS conditions, and concatenation had the best accuracy under the lowest ILS conditions. However, SVDquartets was competitive with the best methods under conditions with low ILS and small numbers of sites per locus. The good performance under many conditions of ASTRAL-2 in comparison to SVDquartets is surprising given the known vulnerability of ASTRAL-2 and similar methods to short gene sequences.


Sign in / Sign up

Export Citation Format

Share Document