scholarly journals A posteriori evaluation of molecular divergence dates using empirical estimates of time-heterogeneous fossilization rates

2017 ◽  
Author(s):  
Simon Gunkel ◽  
Jes Rust ◽  
Torsten Wappler ◽  
Christoph Mayer ◽  
Oliver Niehuis ◽  
...  

AbstractThe application of molecular clock concepts in phylogenetics permits estimating the divergence times of clades with an incomplete fossil record. However, the reliability of this approach is disputed, because the resulting estimates are often inconsistent with different sets of fossils and other parameters (clock models and prior settings) in the analyses. Here, we present the λ statistic, a likelihood approach for a posteriori evaluating the reliability of estimated divergence times. The λ statistic is based on empirically derived fossilization rates and evaluates the fit of estimated divergence times to the fossil record. We tested the performance of this measure with simulated data sets. Furthermore, we applied it to the estimated divergence times of (i) Clavigeritae beetles of the family Staphylinidae and (ii) all extant insect orders. The reanalyzed beetle data supports the originally published results, but shows that several fossil calibrations used do not increase the reliability of the divergence time estimates. Analyses of estimated inter-ordinal insect divergences indicate that uniform priors with soft bounds marginally outperform log-normal priors on node ages. Furthermore, a posteriori evaluation of the original published analysis indicates that several inter-ordinal divergence estimates might be too young. The λ statistic allows the comparative evaluation of any clade divergence estimate derived from different calibration approaches. Consequently, the application of different algorithms, software tools, and calibration schemes can be empirically assessed.


2020 ◽  
Vol 37 (6) ◽  
pp. 1819-1831
Author(s):  
Qiqing Tao ◽  
Jose Barba-Montoya ◽  
Louise A Huuki ◽  
Mary Kathleen Durnan ◽  
Sudhir Kumar

Abstract The conventional wisdom in molecular evolution is to apply parameter-rich models of nucleotide and amino acid substitutions for estimating divergence times. However, the actual extent of the difference between time estimates produced by highly complex models compared with those from simple models is yet to be quantified for contemporary data sets that frequently contain sequences from many species and genes. In a reanalysis of many large multispecies alignments from diverse groups of taxa, we found that the use of the simplest models can produce divergence time estimates and credibility intervals similar to those obtained from the complex models applied in the original studies. This result is surprising because the use of simple models underestimates sequence divergence for all the data sets analyzed. We found three fundamental reasons for the observed robustness of time estimates to model complexity in many practical data sets. First, the estimates of branch lengths and node-to-tip distances under the simplest model show an approximately linear relationship with those produced by using the most complex models applied on data sets with many sequences. Second, relaxed clock methods automatically adjust rates on branches that experience considerable underestimation of sequence divergences, resulting in time estimates that are similar to those from complex models. And, third, the inclusion of even a few good calibrations in an analysis can reduce the difference in time estimates from simple and complex models. The robustness of time estimates to model complexity in these empirical data analyses is encouraging, because all phylogenomics studies use statistical models that are oversimplified descriptions of actual evolutionary substitution processes.



Zootaxa ◽  
2009 ◽  
Vol 2107 (1) ◽  
pp. 41-52 ◽  
Author(s):  
CAROLINA M VOLOCH ◽  
PABLO R FREIRE ◽  
CLAUDIA A M RUSSO

Fossil record of penaeids indicates that the family exists since the Triassic period, but extant genera appeared only recently in Tertiary strata. Molecular based divergence time estimates on the matter of penaeid radiation were never properly addressed, due to shortcomings of the global molecular clock assumptions. Here, we studied the diversification patterns of the family, uncovering, more specifically, a correlation between fossil and extant Penaeid fauna. For this, we have used a Bayesian framework that does not assume a global clock. Our results suggest that Penaeid genera originated between 20 million years ago and 43 million years ago, much earlier than expected by previous molecular studies. Altogether, these results promptly discard late Tertiary or even Quaternary hypotheses that presumed a major glaciations influence on the diversification patterns of the family.



2020 ◽  
Vol 36 (Supplement_2) ◽  
pp. i884-i894
Author(s):  
Jose Barba-Montoya ◽  
Qiqing Tao ◽  
Sudhir Kumar

Abstract Motivation As the number and diversity of species and genes grow in contemporary datasets, two common assumptions made in all molecular dating methods, namely the time-reversibility and stationarity of the substitution process, become untenable. No software tools for molecular dating allow researchers to relax these two assumptions in their data analyses. Frequently the same General Time Reversible (GTR) model across lineages along with a gamma (+Γ) distributed rates across sites is used in relaxed clock analyses, which assumes time-reversibility and stationarity of the substitution process. Many reports have quantified the impact of violations of these underlying assumptions on molecular phylogeny, but none have systematically analyzed their impact on divergence time estimates. Results We quantified the bias on time estimates that resulted from using the GTR + Γ model for the analysis of computer-simulated nucleotide sequence alignments that were evolved with non-stationary (NS) and non-reversible (NR) substitution models. We tested Bayesian and RelTime approaches that do not require a molecular clock for estimating divergence times. Divergence times obtained using a GTR + Γ model differed only slightly (∼3% on average) from the expected times for NR datasets, but the difference was larger for NS datasets (∼10% on average). The use of only a few calibrations reduced these biases considerably (∼5%). Confidence and credibility intervals from GTR + Γ analysis usually contained correct times. Therefore, the bias introduced by the use of the GTR + Γ model to analyze datasets, in which the time-reversibility and stationarity assumptions are violated, is likely not large and can be reduced by applying multiple calibrations. Availability and implementation All datasets are deposited in Figshare: https://doi.org/10.6084/m9.figshare.12594638.



2019 ◽  
Author(s):  
Qiqing Tao ◽  
Koichiro Tamura ◽  
Beatriz Mello ◽  
Sudhir Kumar

AbstractConfidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in these analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density (HPD) intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, i.e., the true time is contained within the CIs with a 95% probability. These developments will encourage broader use of computationally-efficient RelTime approach in molecular dating analyses and biological hypothesis testing.



2020 ◽  
Author(s):  
Tom Carruthers ◽  
Robert W Scotland

Abstract Understanding and representing uncertainty is crucial in academic research, because it enables studies to build on the conclusions of previous studies, leading to robust advances in a particular field. Here, we evaluate the nature of uncertainty and the manner by which it is represented in divergence time estimation, a field that is fundamental to many aspects of macroevolutionary research, and where there is evidence that uncertainty has been seriously underestimated. We address this issue in the context of methods used in divergence time estimation, and with respect to the manner by which time-calibrated phylogenies are interpreted. With respect to methods, we discuss how the assumptions underlying different methods may not adequately reflect uncertainty about molecular evolution, the fossil record, or diversification rates. Therefore, divergence time estimates may not adequately reflect uncertainty, and may be directly contradicted by subsequent findings. For the interpretation of time-calibrated phylogenies, we discuss how the use of time-calibrated phylogenies for reconstructing general evolutionary timescales leads to inferences about macroevolution that are highly sensitive to methodological limitations in how uncertainty is accounted for. By contrast, we discuss how the use of time-calibrated phylogenies to test specific hypotheses leads to inferences about macroevolution that are less sensitive to methodological limitations. Given that many biologists wish to use time-calibrated phylogenies to reconstruct general evolutionary timescales, we conclude that the development of methods of divergence time estimation that adequately account for uncertainty is necessary.



2019 ◽  
Vol 99 (1) ◽  
pp. 105-367 ◽  
Author(s):  
Mao-Qiang He ◽  
Rui-Lin Zhao ◽  
Kevin D. Hyde ◽  
Dominik Begerow ◽  
Martin Kemler ◽  
...  

AbstractThe Basidiomycota constitutes a major phylum of the kingdom Fungi and is second in species numbers to the Ascomycota. The present work provides an overview of all validly published, currently used basidiomycete genera to date in a single document. An outline of all genera of Basidiomycota is provided, which includes 1928 currently used genera names, with 1263 synonyms, which are distributed in 241 families, 68 orders, 18 classes and four subphyla. We provide brief notes for each accepted genus including information on classification, number of accepted species, type species, life mode, habitat, distribution, and sequence information. Furthermore, three phylogenetic analyses with combined LSU, SSU, 5.8s, rpb1, rpb2, and ef1 datasets for the subphyla Agaricomycotina, Pucciniomycotina and Ustilaginomycotina are conducted, respectively. Divergence time estimates are provided to the family level with 632 species from 62 orders, 168 families and 605 genera. Our study indicates that the divergence times of the subphyla in Basidiomycota are 406–430 Mya, classes are 211–383 Mya, and orders are 99–323 Mya, which are largely consistent with previous studies. In this study, all phylogenetically supported families were dated, with the families of Agaricomycotina diverging from 27–178 Mya, Pucciniomycotina from 85–222 Mya, and Ustilaginomycotina from 79–177 Mya. Divergence times as additional criterion in ranking provide additional evidence to resolve taxonomic problems in the Basidiomycota taxonomic system, and also provide a better understanding of their phylogeny and evolution.



2019 ◽  
Vol 37 (1) ◽  
pp. 280-290 ◽  
Author(s):  
Qiqing Tao ◽  
Koichiro Tamura ◽  
Beatriz Mello ◽  
Sudhir Kumar

Abstract Confidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in dating analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, that is, the actual time is contained within the CIs with a 94% probability. These developments will encourage broader use of computationally efficient RelTime approaches in molecular dating analyses and biological hypothesis testing.



2017 ◽  
Author(s):  
Caroline Parins-Fukuchi ◽  
Joseph W. Brown

AbstractRecently, approaches that estimate species divergence times using fossil taxa and models of morphological evolution have exploded in popularity. These methods incorporate diverse biological and geological information to inform posterior reconstructions, and have been applied to several high-profile clades to positive effect. However, there are important examples where morphological data are misleading, resulting in unrealistic age estimates. While several studies have demonstrated that these approaches can be robust and internally consistent, the causes and limitations of these patterns remain unclear. In this study, we dissect signal in Bayesian dating analyses of three mammalian clades. For two of the three examples, we find that morphological characters provide little information regarding divergence times as compared to geological range information, with posterior estimates largely recapitulating those recovered under the prior. However, in the cetacean dataset, we find that morphological data do appreciably inform posterior divergence time estimates. We supplement these empirical analyses with a set of simulations designed to explore the efficiency and limitations of binary and 3-state character data in reconstructing node ages. Our results demonstrate areas of both strength and weakness for morphological clock analyses, and help to outline conditions under which they perform best and, conversely, when they should be eschewed in favour of purely geological approaches.



2013 ◽  
Vol 280 (1755) ◽  
pp. 20122686 ◽  
Author(s):  
Sophie Cardinal ◽  
Bryan N. Danforth

Reliable estimates on the ages of the major bee clades are needed to further understand the evolutionary history of bees and their close association with flowering plants. Divergence times have been estimated for a few groups of bees, but no study has yet provided estimates for all major bee lineages. To date the origin of bees and their major clades, we first perform a phylogenetic analysis of bees including representatives from every extant family, subfamily and almost all tribes, using sequence data from seven genes. We then use this phylogeny to place 14 time calibration points based on information from the fossil record for an uncorrelated relaxed clock divergence time analysis taking into account uncertainties in phylogenetic relationships and the fossil record. We explore the effect of placing a hard upper age bound near the root of the tree and the effect of different topologies on our divergence time estimates. We estimate that crown bees originated approximately 123 Ma (million years ago) (113–132 Ma), concurrently with the origin or diversification of the eudicots, a group comprising 75 per cent of angiosperm species. All of the major bee clades are estimated to have originated during the Middle to Late Cretaceous, which is when angiosperms became the dominant group of land plants.



2020 ◽  
Author(s):  
Kenneth De Baets ◽  
Paula Dentzien-Dias ◽  
G. William M. Harrison ◽  
D. Timothy J. Littlewood ◽  
Luke A. Parry

The fossil record of parasitic helminths is often stated to be severely limited. Many studies have therefore used host constraints to constrain molecular divergence time estimates of helminths. Here we review direct fossil evidence for several of these parasitic lineages belong to various phyla (Acanthocephala, Annelida, Arthropoda, Nematoda, Nematomorpha, Pentastomida, Platyhelminthes). Our compilation shows that the fossil record of soft-bodied helminths is patchy, but more diverse than commonly assumed. The fossil record provides evidence that ectoparasitic helminths (e.g., worm-like pentastomid arthropods) have been around since the early Paleozoic, while endoparasitic helminths (cestodes) arose at least during, or possibly even before the late Paleozoic. Nematode lineages parasitizing terrestrial plant and animal hosts have been in existence at least since the Devonian and Triassic, respectively. All major phyla (Acanthocephala, Annelida, Platyhelminthes. Nematoda, Nematomorpha) had evolved endoparasitic lineages at least since the Mesozoic. Interestingly, although parasitism is considered derived within Metazoa, the oldest evidence for Nematoda and Platyhelminthes includes body fossils of parasitic representatives. Furthermore, the oldest fossil evidence of these parasitic lineages often falls within molecular divergence time estimates based on host co-evolution suggesting the fossil record of helminths themselves might be just as good or at least complementary (and less circular in justification) to calibration based on host associations. Data also provide evidence for obvious host switches or extinctions, which cautions against models of pure co-divergence where use of host calibrations to constrain divergence time estimates may be considered.



Sign in / Sign up

Export Citation Format

Share Document