scholarly journals Dynamic sampling bias and overdispersion induced by skewed offspring distributions

2021 ◽  
Author(s):  
Takashi Okada ◽  
Oskar Hallatschek

Natural populations often show enhanced genetic drift consistent with a strong skew in their offspring number distribution. The skew arises because the variability of family sizes is either inherently strong or amplified by population expansions, leading to so-called ‘jackpot’ events. The resulting allele frequency fluctuations are large and, therefore, challenge standard models of population genetics, which assume sufficiently narrow offspring distributions. While the neutral dynamics backward in time can be readily analyzed using coalescent approaches, we still know little about the effect of broad offspring distributions on the dynamics forward in time, especially with selection. Here, we employ an exact asymptotic analysis combined with a scaling hypothesis to demonstrate that over-dispersed frequency trajectories emerge from the competition of conventional forces, such as selection or mutations, with an emerging time-dependent sampling bias against the minor allele. The sampling bias arises from the characteristic time-dependence of the largest sampled family size within each allelic type. Using this insight, we establish simple scaling relations for allele frequency fluctuations, fixation probabilities, extinction times, and the site frequency spectra that arise when offspring numbers are distributed according to a power law n−(1+α). To demonstrate that this coarse-grained model captures a wide variety of non-equilibrium dynamics, we validate our results in traveling waves, where the phenomenon of ‘gene surfing’ can produce any exponent 1 < α < 2. We argue that the concept of a dynamic sampling bias is useful generally to develop both intuition and statistical tests for the unusual dynamics of populations with skewed offspring distributions, which can confound commonly used tests for selection or demographic history.

Genetics ◽  
2021 ◽  
Author(s):  
Takashi Okada ◽  
Oskar Hallatschek

Abstract Natural populations often show enhanced genetic drift consistent with a strong skew in their offspring number distribution. The skew arises because the variability of family sizes is either inherently strong or amplified by population expansions. The resulting allele-frequency fluctuations are large and, therefore, challenge standard models of population genetics, which assume sufficiently narrow offspring distributions. While the neutral dynamics backward in time can be readily analyzed using coalescent approaches, we still know little about the effect of broad offspring distributions on the forward-in-time dynamics, especially with selection. Here, we employ an asymptotic analysis combined with a scaling hypothesis to demonstrate that over-dispersed frequency trajectories emerge from the competition of conventional forces, such as selection or mutations, with an emerging time-dependent sampling bias against the minor allele. The sampling bias arises from the characteristic time-dependence of the largest sampled family size within each allelic type. Using this insight, we establish simple scaling relations for allele-frequency fluctuations, fixation probabilities, extinction times, and the site frequency spectra that arise when offspring numbers are distributed according to a power law n−(1+α). To demonstrate that this coarse-grained model captures a wide variety of evolutionary dynamics, we validate our results in traveling waves, where the phenomenon of ’gene surfing’ can produce any exponent 1 &lt; α &lt; 2. We argue that the concept of a dynamic sampling bias is useful to develop both intuition and statistical tests for the unusual dynamics of populations with skewed offspring distributions, which can confound commonly used tests for selection or demographic history.


Genetics ◽  
2000 ◽  
Vol 156 (1) ◽  
pp. 457-467 ◽  
Author(s):  
Z W Luo ◽  
S H Tao ◽  
Z-B Zeng

Abstract Three approaches are proposed in this study for detecting or estimating linkage disequilibrium between a polymorphic marker locus and a locus affecting quantitative genetic variation using the sample from random mating populations. It is shown that the disequilibrium over a wide range of circumstances may be detected with a power of 80% by using phenotypic records and marker genotypes of a few hundred individuals. Comparison of ANOVA and regression methods in this article to the transmission disequilibrium test (TDT) shows that, given the genetic variance explained by the trait locus, the power of TDT depends on the trait allele frequency, whereas the power of ANOVA and regression analyses is relatively independent from the allelic frequency. The TDT method is more powerful when the trait allele frequency is low, but much less powerful when it is high. The likelihood analysis provides reliable estimation of the model parameters when the QTL variance is at least 10% of the phenotypic variance and the sample size of a few hundred is used. Potential use of these estimates in mapping the trait locus is also discussed.


2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Xiaoming Liu ◽  
Yun-Xin Fu

An amendment to this paper has been published and can be accessed via the original article.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Nandika Perera ◽  
Gayani Galhena ◽  
Gaya Ranawaka

AbstractA new 16 X-short tandem repeat (STR) multiplex PCR system has recently been developed for Sr Lankans, though its applicability in evolutionary genetics and forensic investigations has not been thoroughly assessed. In this study, 838 unrelated individuals covering all four major ethnic groups (Sinhalese, Sri Lankan Tamils, Indian Tamils and Moors) in Sri Lanka were successfully genotyped using this new multiplex system. The results indicated a high forensic efficiency for the tested loci in all four ethnicities confirming its suitability for forensic applications of Sri Lankans. Allele frequency distribution of Indian Tamils showed subtle but statistically significant differences from those of Sinhalese and Moors, in contrast to frequency distributions previously reported for autosomal STR alleles. This suggest a sex biased demographic history among Sri Lankans requiring a separate X-STR allele frequency database for Indian Tamils. Substantial differences observed in the patterns of LD among the four groups demand the use of a separate haplotype frequency databases for each individual ethnicity. When analysed together with other 14 world populations, all Sri Lankan ethnicities except Indian Tamils clustered closely with populations from Indian Bhil tribe, Bangladesh and Europe reflecting their shared Indo-Aryan ancestry.


Genetics ◽  
1986 ◽  
Vol 114 (4) ◽  
pp. 1165-1190
Author(s):  
Charles F Aquadro ◽  
Susan F Desse ◽  
Molly M Bland ◽  
Charles H Langley ◽  
Cathy C Laurie-Ahlberg

ABSTRACT Variation in the DNA restriction map of a 13-kb region of chromosome ll including the alcohol dehydrogenase structural gene (Adh) was examined in Drosophila melanogaster from natural populations. Detailed analysis of 48 D. melanogaster lines representing four eastern United States populations revealed extensive DNA sequence variation due to base substitutions, insertions and deletions. Cloning of this region from several lines allowed characterization of length variation as due to unique sequence insertions or deletions [nine sizes; 21-200 base pairs (bp)] or transposable element insertions (several sizes, 340 bp to 10.2 kb, representing four different elements). Despite this extensive variation in sequences flanking the Adh gene, only one length polymorphism is clearly associated with altered Adh expression (a copia element approximately 250 bp 5′ to the distal transcript start site). Nonetheless, the frequency spectra of transposable elements within and between Drosophila species suggests they are slightly deleterious. Strong nonrandom associations are observed among Adh region sequence variants, ADH allozyme (Fast vs. Slow), ADH enzyme activity and the chromosome inversion ln(2L)t. Phylogenetic analysis of restriction map haplotypes suggest that the major twofold component of ADH activity variation (high vs. low, typical of Fast and Slow allozymes, respectively) is due to sequence variation tightly linked to and possibly distinct from that underlying the allozyme difference. The patterns of nucleotide and haplotype variation for Fast and Slow allozyme lines are consistent with the recent increase in frequency and spread of the Fast haplotype associated with high ADH activity. These data emphasize the important role of evolutionary history and strong nonrandom associations among tightly linked sequence variation as determinants of the patterns of variation observed in natural populations.


1994 ◽  
Vol 24 (8) ◽  
pp. 1726-1733 ◽  
Author(s):  
J. Beaulieu ◽  
J.-P. Simon

The level of genetic diversity of natural populations of eastern white pine (Pinusstrobus L.) from Quebec was estimated from allozyme variants of 18 loci coding 12 enzyme systems. On average, a white pine population was polymorphic at 50.6% of loci, had 1.96 alleles and 1.22 effective alleles per locus, and observed and expected heterozygosities of 0.176 and 0.180, respectively. The level of genetic diversity was lower in the populations of the St. Lawrence lowlands than in those of western Quebec. This observation will help in guiding the selection program of the eastern white pine improvement program under way in Quebec. Genetic differentiation among sampled populations was weak and accounted for only 2% of the total diversity. The estimate of gene flow was very high, resulting in low values for genetic distances among populations. Only one locus showed a heterogeneity of allelic frequencies among populations after the Bonferroni procedure was applied for simultaneous statistical tests. A cluster analysis based on genetic distances among populations revealed that the Anticosti and Abitibi populations, located at the limit of the natural range of white pine, were similar to populations from regions that were geographically the most distant.


Genetics ◽  
2020 ◽  
Vol 214 (4) ◽  
pp. 1019-1030 ◽  
Author(s):  
Raul Torres ◽  
Markus G. Stetter ◽  
Ryan D. Hernandez ◽  
Jeffrey Ross-Ibarra

Neutral genetic diversity across the genome is determined by the complex interplay of mutation, demographic history, and natural selection. While the direct action of natural selection is limited to functional loci across the genome, its impact can have effects on nearby neutral loci due to genetic linkage. These effects of selection at linked sites, referred to as genetic hitchhiking and background selection (BGS), are pervasive across natural populations. However, only recently has there been a focus on the joint consequences of demography and selection at linked sites, and some empirical studies have come to apparently contradictory conclusions as to their combined effects. To understand the relationship between demography and selection at linked sites, we conducted an extensive forward simulation study of BGS under a range of demographic models. We found that the relative levels of diversity in BGS and neutral regions vary over time and that the initial dynamics after a population size change are often in the opposite direction of the long-term expected trajectory. Our detailed observations of the temporal dynamics of neutral diversity in the context of selection at linked sites in nonequilibrium populations provide new intuition about why patterns of diversity under BGS vary through time in natural populations and help reconcile previously contradictory observations. Most notably, our results highlight that classical models of BGS are poorly suited for predicting diversity in nonequilibrium populations.


2019 ◽  
Vol 79 (4) ◽  
pp. 971-984 ◽  
Author(s):  
Shifen Xu ◽  
Liyun Jiang ◽  
Gexia Qiao ◽  
Jing Chen

AbstractAphids live in symbiosis with a variety of bacteria, including the obligate symbiont Buchnera aphidicola and diverse facultative symbionts. The symbiotic associations for one aphid species, especially for polyphagous species, often differ across populations. In the present study, by using high-throughput 16S rRNA sequencing, we surveyed in detail the microbiota in natural populations of the cotton aphid Aphis gossypii in China and assessed differences in bacterial diversity with respect to host plant and geography. The microbial community of A. gossypii was dominated by a few heritable symbionts. Arsenophonus was the most dominant secondary symbiont, and Spiroplasma was detected for the first time. Statistical tests and ordination analyses showed that host plants rather than geography seemed to have shaped the associated symbiont composition. Special symbiont communities inhabited the Cucurbitaceae-feeding populations, which supported the ecological specialization of A. gossypii on cucurbits from the viewpoint of symbiotic bacteria. Correlation analysis suggested antagonistic interactions between Buchnera and coexisting secondary symbionts and more complicated interactions between different secondary symbionts. Our findings lend further support to an important role of the host plant in structuring symbiont communities of polyphagous aphids and will improve our understanding of the interactions among phytophagous insects, symbionts, and environments.


1986 ◽  
Vol 48 (3) ◽  
pp. 161-166 ◽  
Author(s):  
Catherine Montchamp-Moreau ◽  
Mariano Katz

SummaryWe analyse the progression of linkage disequilibrium produced by random genetic drift in populations subject to cyclical fluctuations in size. Our model is applied to natural populations of Drosophila which show an annual demographic cycle of bottleneck (finite size) and demographic burst (size supposed to be infinite). In these populations, linkage disequilibrium stabilizes in such a way that, at equilibrium, the expected square of the correlation of gene frequencies E(r2) shows a stable cycle from year to year. If two loci are tightly linked, E(r2) barely varies during the annual cycle. Its values remain close to the value expected in a population of the same but constant effective size. If two loci are loosely linked, fluctuations in E(r2) are large. The maximum value, reached at the end of the bottleneck, is 10 to 100 times greater than the value obtained at the end of the burst. Our results show that the interpretation of observed linkage disequilibrium, by means of statistical tests, requires an accurate knowledge of population demography.


2011 ◽  
Vol 79 (4) ◽  
pp. 203-219 ◽  
Author(s):  
Sergio Lukić ◽  
Jody Hey ◽  
Kevin Chen

Sign in / Sign up

Export Citation Format

Share Document