scholarly journals Robustness of RADseq for evolutionary network reconstruction from gene trees

2018 ◽  
Author(s):  
José Luis Blanco-Pastor ◽  
Yann J.K. Bertrand ◽  
Isabel María Liberal ◽  
Yanling Wei ◽  
E.Charles Brummer ◽  
...  

AbstractAlthough hybridization has played an important role in the evolution of many species, phylogenetic reconstructions that include hybridizing lineages have been historically constrained by the available models and data. Recently, the combined development of high-throughput sequencing and evolutionary network models offer new opportunities for phylogenetic inference under complex patterns of hybridization in the context of incomplete lineage sorting. Restriction site associated DNA sequencing (RADseq) has been a popular sequencing technique for evolutionary reconstructions of close relatives in the Next Generation Sequencing (NGS) era. However, the utility of RADseq data for the reconstruction of complex evolutionary networks has not been thoroughly discussed. Here, we used new molecular data collected from diploid perennial Medicago species using single-digest RADseq to reconstruct evolutionary networks from gene trees, an approach that is computationally tractable with datasets that include several species and complex patterns of hybridization. Our analyses revealed that complex network reconstructions from RADseq-derived gene trees were not robust under variations of the assembly parameters and filters. Filters to exclusively select loci with high phylogenetic information created datasets that retrieved the most anomalous topologies. Conversely, alternative clustering thresholds or filters on the number of samples per locus affected the level of missing data but had a lower impact on networks. When most anomalous networks were discarded, all remaining network analyses consistently supported a hybrid origin for M. carstiensis and M. cretacea.

2020 ◽  
Author(s):  
Fernando Lopes ◽  
Larissa R Oliveira ◽  
Amanda Kessler ◽  
Yago Beux ◽  
Enrique Crespo ◽  
...  

Abstract The phylogeny and systematics of fur seals and sea lions (Otariidae) have long been studied with diverse data types, including an increasing amount of molecular data. However, only a few phylogenetic relationships have reached acceptance because of strong gene-tree species tree discordance. Divergence times estimates in the group also vary largely between studies. These uncertainties impeded the understanding of the biogeographical history of the group, such as when and how trans-equatorial dispersal and subsequent speciation events occurred. Here we used high-coverage genome-wide sequencing for 14 of the 15 species of Otariidae to elucidate the phylogeny of the family and its bearing on the taxonomy and biogeographical history. Despite extreme topological discordance among gene trees, we found a fully supported species tree that agrees with the few well-accepted relationships and establishes monophyly of the genus Arctocephalus. Our data support a relatively recent trans-hemispheric dispersal at the base of a southern clade, which rapidly diversified into six major lineages between 3 to 2.5 Ma. Otaria diverged first, followed by Phocarctos and then four major lineages within Arctocephalus. However, we found Zalophus to be non-monophyletic, with California (Z. californianus) and Steller sea lions (Eumetopias jubatus) grouping closer than the Galapagos sea lion (Z. wollebaeki) with evidence for introgression between the two genera. Overall, the high degree of genealogical discordance was best explained by incomplete lineage sorting resulting from quasi-simultaneous speciation within the southern clade with introgresssion playing a subordinate role in explaining the incongruence among and within prior phylogenetic studies of the family.


2021 ◽  
Author(s):  
Simone Cardoni ◽  
Roberta Piredda ◽  
Thomas Denk ◽  
Guido W. Grimm ◽  
Aristotelis C. Papageorgiou ◽  
...  

Standard models of speciation assume strictly dichotomous genealogies in which a species, the ancestor, is replaced by two offspring species. The reality is more complex: plant species can evolve from other species via isolation when genetic drift exceeds gene flow; lineage mixing can give rise to new species (hybrid taxa such as nothospecies and allopolyploids). The multi–copy, potentially multi–locus 5S rDNA is one of few gene regions conserving signal from dichotomous and reticulate evolutionary processes down to the level of intra-genomic recombination. Here, we provide the first high-throughput sequencing (HTS) 5S intergenic spacer (5S – IGS) data for a lineage of wind-pollinated subtropical to temperate trees, the Fagus crenata – F. sylvatica s.l. lineage, and its distant relative F. japonica. The observed 4,963 unique 5S – IGS variants reflect a long history of repeated incomplete lineage sorting and lineage mixing since the early Cenozoic of two or more paralogous-homoeologous 5S rDNA lineages. Extant species of Fagus are genetic mosaics and, at least to some part, of hybrid origin.


2021 ◽  
Author(s):  
Niklas Reichelt ◽  
Jun Wen ◽  
Claudia Paetzold ◽  
Marc Appelhans

Background and aims: Zanthoxylum L. is the only pantropical genus within Rutaceae, with a few species native to temperate eastern Asia and North America. Efforts using Sanger sequencing failed to resolve the backbone phylogeny of Zanthoxylum. In this study, we employed target enrichment high-throughput sequencing to improve resolution. Gene trees were examined for concordance and sectional classifications of Zanthoxylum were evaluated. Off-target reads were investigated to identify putative single-copy markers for bait refinement, and low-copy markers for evidence of putative hybridization events. Methods: We developed a custom bait set for target enrichment of 745 exons in Zanthoxylum and applied it to 45 Zanthoxylum species and one Tetradium species as the outgroup. Illumina reads were processed via the HybPhyloMaker pipeline. Phylogenetic inferences were conducted using coalescent and concatenated methods. Concordance was assessed using quartet sampling. Off-target reads were assembled and putative single- and low-copy genes were extracted. Additional phylogenetic analyses were performed based on these alignments. Key results: Four major clades are supported within Zanthoxylum: the African clade, the Z. asiaticum clade, the Asian-Pacific-Australian clade, and the American-eastern Asian clade. While overall support has improved, regions of conflict are similar to those previously observed. Gene tree discordances indicate a hybridization event in the ancestor of the Hawaiian lineage, and incomplete lineage sorting for the American backbone. Off-target putative single-copy genes largely confirm on-target results, and putative low-copy genes provide additional evidence for hybridization in the Hawaiian lineage. Only two of the five sections of Zanthoxylum are resolved as monophyletic. Conclusion: Target enrichment is suitable to assess phylogenetic relationships in Zanthoxylum. Our phylogenetic analyses reveal that current sectional classifications need revision. Quartet tree concordance indicates several instances of reticulate evolution. Off-target reads are proven useful to identify additional phylogenetically informative regions for bait refinement or gene tree based approaches.


2020 ◽  
Author(s):  
Fernando Lopes ◽  
Larissa R. Oliveira ◽  
Amanda Kessler ◽  
Yago Beux ◽  
Enrique Crespo ◽  
...  

AbstractThe phylogeny and systematics of fur seals and sea lions (Otariidae) have long been studied with diverse data types, including an increasing amount of molecular data. However, only a few phylogenetic relationships have reached acceptance because of strong gene-tree species tree discordance. Divergence times estimates in the group also vary largely between studies. These uncertainties impeded the understanding of the biogeographical history of the group, such as when and how trans-equatorial dispersal and subsequent speciation events occurred. Here we used high-coverage genome-wide sequencing for 14 of the 15 species of Otariidae to elucidate the phylogeny of the family and its bearing on the taxonomy and biogeographical history. Despite extreme topological discordance among gene trees, we found a fully supported species tree that agrees with the few well-accepted relationships and establishes monophyly of the genusArctocephalus. Our data support a relatively recent trans-hemispheric dispersal at the base of a southern clade, which rapidly diversified into six major lineages between 3 to 2.5 Ma.Otariadiverged first, followed byPhocarctosand then four major lineages withinArctocephalus. However, we foundZalophusto be non-monophyletic, with California(Z. californianus)and Steller sea lions(Eumetopias jubatus)grouping closer than the Galapagos sea lion (Z. wollebaeki)with evidence for introgression between the two genera. Overall, the high degree of genealogical discordance was best explained by incomplete lineage sorting resulting from quasi-simultaneous speciation within the southern clade with introgresssion playing a subordinate role in explaining the incongruence among and within prior phylogenetic studies of the family.


2018 ◽  
Author(s):  
Julie Marin ◽  
Guillaume Achaz ◽  
Anton Crombach ◽  
Amaury Lambert

AbstractEvolutionary relationships between species are traditionally represented in the form of a tree, the species tree. Its reconstruction from molecular data is hindered by frequent conflicts between gene genealogies. Usually, these disagreements are explained by incomplete lineage sorting (ILS) due to random coalescences of gene lineages inside the edges of the species tree. This paradigm, the multi-species coalescent (MSC), is constantly violated by the ubiquitous presence of gene flow, leading to incongruences between gene trees that cannot be explained by ILS alone. Here we argue instead in favor of a vision acknowledging the importance of gene flow and where gene histories shape the species tree rather than the opposite. We propose a new framework for modeling the joint evolution of gene and species lineages relaxing the hierarchy between the species tree and gene trees. We implement this framework in two mathematical models called the gene-based diversification models (GBD): 1) GBD-forward following all evolving genomes and 2) GBD-backward based on coalescent theory. They feature four parameters tuning colonization, gene flow, genetic drift and genetic differentiation. We propose a quick inference method based on differences between gene trees. Applied to two empirical data-sets prone to gene flow, we find a better support for the GBD model than for the MSC model. Along with the increasing awareness of the extent of gene flow, this work shows the importance of considering the richer signal contained in genomic histories, rather than in the mere species tree, to better apprehend the complex evolutionary history of species.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Shahed Uddin Ahmed Shazib ◽  
Peter Vďačný ◽  
Marek Slovák ◽  
Eleni Gentekaki ◽  
Mann Kyoon Shin

Abstract The ciliate genus Spirostomum comprises eight morphospecies, inhabiting diverse aquatic environments worldwide, where they can be used as water quality indicators. Although Spirostomum species are relatively easily identified using morphological methods, the previous nuclear rDNA-based phylogenies indicated several conflicts in morphospecies delineation. Moreover, the single locus phylogenies and previous analytical approaches could not unambiguously resolve phylogenetic relationships among Spirostomum morphospecies. Here, we attempt to investigate species boundaries and evolutionary history of Spirostomum taxa, using 166 new sequences from multiple populations employing one mitochondrial locus (CO1 gene) and two nuclear loci (rRNA operon and alpha-tubulin gene). In accordance with previous studies, relationships among the eight Spirostomum morphospecies were poorly supported statistically in individual gene trees. To overcome this problem, we utilised for the first time in ciliates the Bayesian coalescent approach, which accounts for ancestral polymorphisms, incomplete lineage sorting, and recombination. This strategy enabled us to robustly resolve deep relationships between Spirostomum species and to support the hypothesis that taxa with compact macronucleus and taxa with moniliform macronucleus each form a distinct lineage. Bayesian coalescent-based delimitation analyses strongly statistically supported the traditional morphospecies concept but also indicated that there are two S. minus-like cryptic species and S. teres is non-monophyletic. Spirostomum teres was very likely defined by a set of ancestral features of lineages that also gave rise to S. yagiui and S. dharwarensis. However, molecular data from type populations of the morphospecies S. minus and S. teres are required to unambiguously resolve the taxonomic problems.


2019 ◽  
Author(s):  
Stella Huynh ◽  
Thomas Marcussen ◽  
François Felber ◽  
Christian Parisod

SummaryEvolutionary relationships among theAegilops-Triticumrelatives of cultivated wheats have been difficult to resolve owing to incomplete lineage sorting and reticulate evolution. Recent studies have suggested that the wheat D-genome lineage (progenitor ofAe. tauschii) originated through homoploid hybridization between the A-genome lineage (progenitor ofTriticums.str.) and the B-genome lineage (progenitor ofAe. speltoides). Scenarios of reticulation have been debated, calling for adequate phylogenetic analyses based on comprehensive sampling. To reconstruct the evolution ofAegilops-Triticumdiploids, we here combined high-throughput sequencing of 38 nuclear low-copy loci of multiple accessions of all 13 species with inferences of the species phylogeny using the full-parameterized MCMC_SEQ method. Phylogenies recovered a monophyleticAegilops-Triticumlineage that began diversifying ~6.5 Ma ago and gave rise to four sublineages, i.e. the A- (2 species), B- (1 species), D- (9 species) and T- (Ae. mutica) genome lineage. Full-parameterized phylogenies as well as patterns of tree dilation and tree compression supported a hybrid origin of the D-genome lineage from A and B ~4.1 Ma ago, and did not indicate additional hybridization events. This comprehensive and dated phylogeny of wheat relatives indicates that the origin of the hybrid D-genome was followed by intense diversification into almost all diploid as well as allopolyploid wild wheats.


2022 ◽  
Author(s):  
XiaoXu Pang ◽  
Da-Yong Zhang

The species studied in any evolutionary investigation generally constitute a very small proportion of all the species currently existing or that have gone extinct. It is therefore likely that introgression, which is widespread across the tree of life, involves "ghosts," i.e., unsampled, unknown, or extinct lineages. However, the impact of ghost introgression on estimations of species trees has been rarely studied and is thus poorly understood. In this study, we use mathematical analysis and simulations to examine the robustness of species tree methods based on a multispecies coalescent model under gene flow sourcing from an extant or ghost lineage. We found that very low levels of extant or ghost introgression can result in anomalous gene trees (AGTs) on three-taxon rooted trees if accompanied by strong incomplete lineage sorting (ILS). In contrast, even massive introgression, with more than half of the recipient genome descending from the donor lineage, may not necessarily lead to AGTs. In cases involving an ingroup lineage (defined as one that diverged no earlier than the most basal species under investigation) acting as the donor of introgression, the time of root divergence among the investigated species was either underestimated or remained unaffected, but for the cases of outgroup ghost lineages acting as donors, the divergence time was generally overestimated. Under many conditions of ingroup introgression, the stronger the ILS was, the higher was the accuracy of estimating the time of root divergence, although the topology of the species tree is more prone to be biased by the effect of introgression.


2022 ◽  
Vol 12 ◽  
Author(s):  
Martha Kandziora ◽  
Petr Sklenář ◽  
Filip Kolář ◽  
Roswitha Schmickl

A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.


Sign in / Sign up

Export Citation Format

Share Document