structured coalescent
Recently Published Documents


TOTAL DOCUMENTS

45
(FIVE YEARS 8)

H-INDEX

11
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Himani Sachdeva

This paper considers how local adaptation and reproductive isolation between hybridizing populations is influenced by linkage disequilibria (LD) between multiple divergently selected loci in scenarios where both gene flow and genetic drift degrade local adaptation. It shows that the combined effects of multi-locus LD and genetic drift on allele frequencies at selected loci and on heterozygosity at neutral loci are predicted accurately by incorporating (deterministic) effective migration rates into the diffusion approximation (for selected loci) and into the structured coalescent (for neutral loci). Theoretical approximations are tested against individual-based simulations and used to investigate the conditions for the maintenance of local adaptation on an island subject to one-way migration from a differently adapted mainland, and in an infinite-island population with two different habitats under divergent selection. The analysis clarifies the conditions under which LD between sets of locally deleterious alleles allows these to be collectively eliminated despite drift, causing sharper and (under certain conditions) shifted migration thresholds for loss of adaptation. Local adaptation also has counter-intuitive effects on neutral (relative) divergence: FST is highest for a pair of subpopulations belonging to the same (rare) habitat, despite the lack of reproductive isolation between them.


2021 ◽  
Vol 18 (179) ◽  
pp. 20210041
Author(s):  
Daniel Dawson ◽  
David Rasmussen ◽  
Xinxia Peng ◽  
Cristina Lanzas

Indirect (environmental) and direct (host–host) transmission pathways cannot easily be distinguished when they co-occur in epidemics, particularly when they occur on similar time scales. Phylodynamic reconstruction is a potential approach to this problem that combines epidemiological information (temporal, spatial information) with pathogen whole-genome sequencing data to infer transmission trees of epidemics. However, factors such as differences in mutation and transmission rates between host and non-host environments may obscure phylogenetic inference from these methods. In this study, we used a network-based transmission model that explicitly models pathogen evolution to simulate epidemics with both direct and indirect transmission. Epidemics were simulated according to factorial combinations of direct/indirect transmission proportions, host mutation rates and conditions of environmental pathogen growth. Transmission trees were then reconstructed using the phylodynamic approach SCOTTI (structured coalescent transmission tree inference) and evaluated. We found that although insufficient diversity sets a lower bound on when accurate phylodynamic inferences can be made, transmission routes and assumed pathogen lifestyle affected pathogen population structure and subsequently influenced both reconstruction success and the likelihood of direct versus indirect pathways being reconstructed. We conclude that prior knowledge of the likely ecology and population structure of pathogens in host and non-host environments is critical to fully using phylodynamic techniques.


2021 ◽  
Author(s):  
Ugnė Stolz ◽  
Nicola Felix Müller ◽  
Tanja Stadler ◽  
Timothy Glenn Vaughan

The structured coalescent allows inferring migration patterns between viral sub-populations from genetic sequence data. However, these analyses typically assume that no genetic recombination process impacted the sequence evolution of pathogens. For segmented viruses, such as influenza, that can undergo reassortment this assumption is broken. Reassortment reshuffles the segments of different parent lineages upon a coinfection event, which means that the shared history of viruses has to be represented by a network instead of a tree. Therefore, full genome analyses of such viruses is complex or even impossible. While this problem has been addressed for unstructured populations, it is still impossible to account for population structure, such as induced by different host populations, while also accounting for reassortment% at the same time. We address this by extending the structured coalescent to account for reassortment and present a framework for investigating possible ties between reassortment and migration (host jump) events. This method can accurately estimate sub-population dependent effective populations sizes, reassortment and migration rates from simulated data. Additionally, we apply the new model to avian influenza A/H5N1 sequences, sampled from two avian host types, Anseriformes and Galliformes. We contrast our results with a structured coalescent without reassortment inference, which assumes independently evolving segments. This reveals that taking into account segment reassortment and using sequencing data from several viral segments for joint phylodynamic inference leads to different estimates for effective population sizes, migration and clock rates. This new model is implemented as the Structured Coalescent with Reassortment (SCoRe) package for BEAST 2.5 and is available at https://github.com/jugne/SCORE.


Heredity ◽  
2021 ◽  
Vol 126 (4) ◽  
pp. 706-706
Author(s):  
Willy Rodríguez ◽  
Olivier Mazet ◽  
Simona Grusea ◽  
Armando Arredondo ◽  
Josué M. Corujo ◽  
...  

2020 ◽  
Author(s):  
Sophie Seidel ◽  
Tanja Stadler ◽  
Timothy G. Vaughan

AbstractUnderstanding how disease transmission occurs between subpopulations is critically important for guiding disease control efforts irrespective of whether the subpopulations represent geographically separated people, age or risk groups. The structured coalescent (SC) and the multitype birth-death (MBD) model can both be used to infer migration rates between subpopulations from phylogenies reconstructed from pathogen genetic sequences. However, the two classes of phylodynamic methods rely on different assumptions.Here, we report on a simulation study which compares inferences made using these models for a variety of migration rates in both endemic diseases and epidemic outbreaks. For the epidemic outbreak, we found that the MBD recovers the true migration rates better than the SC regardless of migration rate. We hypothesize that the inaccurate SC estimates stem from the its assumption of a constant population size. For the endemic scenario, our analysis shows that both models obtain a similar coverage of the migration rates, while the SC provides slightly narrower posterior intervals. Irrespective of the scenario, both models estimate the root location with similar coverage.Our study provides concrete modelling advice for infectious disease analysts. For endemic disease either model can be used, while for epidemic outbreaks the MBD should be the model of choice. Additionally, our study reveals the need to develop the SC further such that varying population sizes can easily be taken into account.Author summaryControlling an infectious disease requires us to quantify and understand how it spreads through pools of susceptible individuals, defined by their belonging to different geographical regions, age or risk groups. Rates of pathogen movement between these pools can be inferred from pathogen phylogenies which are themselves reconstructed from pathogen genetic sequences collected from infected individuals. Two popular foundations for such models are the multitype birth-death model and the structured coalescent.Although these models fulfill the same purpose, they differ in their assumptions and can, hence, produce contrasting results. To assess the appropriateness of the models in different situations, we performed a simulation study. We find that, for endemic diseases, both models are able to estimate the migration parameters reliably. For epidemic outbreaks, however, the multitype birth-death model obtains better estimates of the migration rates. We hypothesize that the structured coalescent’s inaccurate estimates for the epidemic scenario arise because it assumes a constant number of infected individuals through time.


2020 ◽  
Author(s):  
Jianan Wang ◽  
Michael D. Coffey ◽  
Nicola De Maio ◽  
Erica M. Goss

AbstractThe genetic structure and diversity of plant pathogen populations are the outcomes of evolutionary interactions with hosts and local environments, and migration at different scales, including human-enabled long-distance dispersal events. As a result, patterns of genetic variation in present populations may elucidate the history of pathogens. Phytophthora palmivora is a devastating oomycete that causes disease in a broad range of plant hosts in the tropics and subtropics worldwide. The center of diversity of P. palmivora is in Southeast Asia, but it is a destructive pathogen of hosts native to South America. Our objective was to use multilocus sequence analysis to resolve the origin and historical migration pathways of P. palmivora. Our analysis supports Southeast Asia as a center of diversity of P. palmivora and indicates that a single colonization event was responsible for the global pandemic of black pod disease of cacao. Analysis using the structured coalescent indicated that P. palmivora emerged on cacao and that cacao has been the major source of migrants to populations in Asia, Africa, and Pacific Islands. To explain these results, we hypothesize widespread introgression between the pandemic cacao lineage and populations native to Asia and the Pacific Islands. The complex evolutionary history of P. palmivora is a consequence of geographic isolation followed by long-distance movement and host jumps that allowed global expansion with cacao, coconut and other hosts.


2019 ◽  
Vol 5 (2) ◽  
Author(s):  
Nicola F Müller ◽  
Gytis Dudas ◽  
Tanja Stadler

Abstract Population dynamics can be inferred from genetic sequence data by using phylodynamic methods. These methods typically quantify the dynamics in unstructured populations or assume migration rates and effective population sizes to be constant through time in structured populations. When considering rates to vary through time in structured populations, the number of parameters to infer increases rapidly and the available data might not be sufficient to inform these. Additionally, it is often of interest to know what predicts these parameters rather than knowing the parameters themselves. Here, we introduce a method to  infer the predictors for time-varying migration rates and effective population sizes by using a generalized linear model (GLM) approach under the marginal approximation of the structured coalescent. Using simulations, we show that our approach is able to reliably infer the model parameters and its predictors from phylogenetic trees. Furthermore, when simulating trees under the structured coalescent, we show that our new approach outperforms the discrete trait GLM model. We then apply our framework to a previously described Ebola virus dataset, where we infer the parameters and its predictors from genome sequences while accounting for phylogenetic uncertainty. We infer weekly cases to be the strongest predictor for effective population size and geographic distance the strongest predictor for migration. This approach is implemented as part of the BEAST2 package MASCOT, which allows us to jointly infer population dynamics, i.e. the parameters and predictors, within structured populations, the phylogenetic tree, and evolutionary parameters.


2019 ◽  
Author(s):  
Samuel J. Bloomfield ◽  
Timothy G. Vaughan ◽  
Jackie Benschop ◽  
Jonathan C. Marshall ◽  
David T. S. Hayman ◽  
...  

AbstractAncestral state reconstruction models use genetic data to characterize a group of organisms’ common ancestor. These models have been applied to salmonellosis outbreaks to estimate the number of transmissions between different animal species that share similar geographical locations, with animal host as the state. However, as far as we are aware, no studies have validated these models for outbreak analysis. In this study, salmonellosis outbreaks were simulated using a stochastic Susceptible-Infected-Recovered model, and the host population and transmission parameters of these simulated outbreaks were estimated using Bayesian ancestral state reconstruction models (discrete trait analysis (DTA) and structured coalescent (SC)). These models were unable to accurately estimate the number of transmissions between the host populations or the amount of time spent in each host population. The DTA model was inaccurate because it assumed the number of isolates sampled from each host population was proportional to the number of individuals infected within each host population. The SC model was inaccurate possibly because it assumed that each host population’s effective population size was constant over the course of the simulated outbreaks. This study highlights the need for phylodynamic models that can take into consideration factors that influence the characteristics and behavior of outbreaks, e.g. changing effective population sizes, variation in infectious periods, intra-population transmissions, and disproportionate sampling of infected individuals.


Heredity ◽  
2018 ◽  
Vol 121 (6) ◽  
pp. 663-678 ◽  
Author(s):  
Willy Rodríguez ◽  
Olivier Mazet ◽  
Simona Grusea ◽  
Armando Arredondo ◽  
Josué M. Corujo ◽  
...  

2018 ◽  
Author(s):  
Willy Rodríguez ◽  
Olivier Mazet ◽  
Simona Grusea ◽  
Simon Boitard ◽  
Lounès Chikhi

AbstractIn the last years, a wide range of methods allowing to reconstruct past population size changes from genome-wide data have been developed. At the same time, there has been an increasing recognition that population structure can generate genetic data similar to those produced under models of population size change. Recently, Mazet et al. (2016) showed that, for any model of population structure, it is always possible to find a panmictic model with a particular function of population size changes, having exactly the same distribution of T2 (the coalescence time for a sample of size two) to that of the structured model. They called this function IICR (Inverse Instantaneous Coalescence Rate) and showed that it does not necessarily correspond to population size changes under non panmictic models. Besides, most of the methods used to analyse data under models of population structure tend to arbitrarily fix that structure and to minimise or neglect population size changes. Here we extend the seminal work of Herbots (1994) on the structured coalescent and propose a new framework, the Non-Stationary Structured Coalescent (NSSC) that incorporates demographic events (changes in gene flow and/or deme sizes) to models of nearly any complexity. We show how to compute the IICR under a wide family of stationary and non-stationary models. As an example we address the question of human and Neanderthal evolution and discuss how the NSSC framework allows to interpret genomic data under this new perspective.Author summaryGenomic data are becoming available for a rapidly increasing number of species, and contain information about their recent evolutionary history. If we wish to understand how they expanded, contracted or admixed as a consequence of recent and ancient environmental changes, we need to develop general inferential methods. Currently, demographic inference is either done assuming that a species is a single panmictic population or using arbitrary structured models. We use the concept of IICR (Inverse of the Instantaneous Coalescence Rate) together with Markov chains theory to develop a general inferential framework which we call the Non-Stationary Structured Coalescent and apply it to explain human and Neanderthal genomic data in a single structured model.


Sign in / Sign up

Export Citation Format

Share Document