A Coalescent Estimator of the Population Recombination Rate

Population genetic models often use a population recombination parameter 4Nc, where N is the effective population size and c is the recombination rate per generation. In many ways 4Nc is comparable to 4Nu, the population mutation rate. Both combine genome level and population level processes, and together they describe the rate of production of genetic variation in a population. However, 4Nc is more difficult to estimate. For a population sample of DNA sequences, historical recombination can only be detected if polymorphisms exist, and even then most recombination events are not detectable. This paper describes an estimator of 4Nc, hereafter designated γ (gamma), that was developed using a coalescent model for a sample of four DNA sequences with recombination. The reliability of γ was assessed using multiple coalescent simulations. In general γ has low to moderate bias, and the reliability of γ is comparable, though less, than that for a widely used estimator of 4Nu. If there exists an independent estimate of the recombination rate (per generation, per base pair), γ can be used to estimate the effective population size or the neutral mutation rate.

Download Full-text

A phylogenetic estimator of effective population size or mutation rate.

Genetics ◽

10.1093/genetics/136.2.685 ◽

1994 ◽

Vol 136 (2) ◽

pp. 685-692 ◽

Cited By ~ 1

Author(s):

Y X Fu

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Dna Sequences ◽

High Efficiency ◽

Minimum Variance ◽

Population Subdivision ◽

Effective Population ◽

Fisher Model ◽

Mitochondrial Sequences

Abstract A new estimator of the essential parameter theta = 4Ne mu from DNA polymorphism data is developed under the neutral Wright-Fisher model without recombination and population subdivision, where Ne is the effective population size and mu is the mutation rate per locus per generation. The new estimator has a variance only slightly larger than the minimum variance of all possible unbiased estimators of the parameter and is substantially smaller than that of any existing estimator. The high efficiency of the new estimator is achieved by making full use of phylogenetic information in a sample of DNA sequences from a population. An example of estimating theta by the new method is presented using the mitochondrial sequences from an American Indian population.

Download Full-text

Coalescent Theory for a Partially Selfing Population

Genetics ◽

10.1093/genetics/146.4.1489 ◽

1997 ◽

Vol 146 (4) ◽

pp. 1489-1499 ◽

Cited By ~ 1

Author(s):

Yun-Xin Fu

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Dna Sequences ◽

Coalescent Theory ◽

Selfing Rate ◽

Effective Population ◽

Diploid Population ◽

Approximate Formulas ◽

Segregating Sites

A coalescent theory for a sample of DNA sequences from a partially selfing diploid population and an algorithm for simulating such samples are developed in this article. Approximate formulas are given for the expectation and the variance of the number of segregating sites in a sample of k sequences from n individuals. Several new estimators of the important parameters θ = 4Nμ and the selfing rate s, where N and μ are, respectively, the effective population size and the mutation rate per sequence per generation, are proposed and their sampling properties are studied.

Download Full-text

The Nonadaptive Forces of Evolution

10.1093/oso/9780198830870.003.0004 ◽

2018 ◽

Author(s):

Bruce Walsh ◽

Michael Lynch

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Recombination Rate ◽

Effective Population ◽

Evolutionary Forces ◽

Wide Range ◽

Population Sizes ◽

Genomic Results

This chapter examines the relative strengths of the nonadaptive evolutionary forces (drift, mutation, recombination) acting on genomes. It reviews estimators for effective population size, mutation rate, and recombination rate, and summarizes the known genomic results over a wide range of taxa. The mutation rate tends to be lower in organisms with larger effective population sizes, consistent with the drift-barrier hypothesis wherein selection is ineffective when it is less than the reciprocal of the effective population size.

Download Full-text

Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.

Genetics ◽

10.1093/genetics/140.4.1421 ◽

1995 ◽

Vol 140 (4) ◽

pp. 1421-1430 ◽

Cited By ~ 4

Author(s):

M K Kuhner ◽

J Yamato ◽

J Felsenstein

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Prior Probability ◽

Sequence Data ◽

Population Sample ◽

Small Samples ◽

Effective Population ◽

And Migration ◽

Varying Population

Abstract We present a new way to make a maximum likelihood estimate of the parameter 4N mu (effective population size times mutation rate per site, or theta) based on a population sample of molecular sequences. We use a Metropolis-Hastings Markov chain Monte Carlo method to sample genealogies in proportion to the product of their likelihood with respect to the data and their prior probability with respect to a coalescent distribution. A specific value of theta must be chosen to generate the coalescent distribution, but the resulting trees can be used to evaluate the likelihood at other values of theta, generating a likelihood curve. This procedure concentrates sampling on those genealogies that contribute most of the likelihood, allowing estimation of meaningful likelihood curves based on relatively small samples. The method can potentially be extended to cases involving varying population size, recombination, and migration.

Download Full-text

Estimating Effective Population Size or Mutation Rate With Microsatellites

Genetics ◽

10.1534/genetics.166.1.555 ◽

2004 ◽

Vol 166 (1) ◽

pp. 555-563 ◽

Cited By ~ 42

Author(s):

Hongyan Xu ◽

Yun-Xin Fu

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Effective Population

Download Full-text

Mobile elements reveal small population size in the ancient ancestors of Homo sapiens

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.0909000107 ◽

2010 ◽

Vol 107 (5) ◽

pp. 2147-2152 ◽

Cited By ~ 27

Author(s):

Chad D. Huff ◽

Jinchuan Xing ◽

Alan R. Rogers ◽

David Witherspoon ◽

Lynn B. Jorde

Keyword(s):

Population Size ◽

Effective Population Size ◽

Dna Sequences ◽

Mobile Element ◽

Small Population ◽

Mobile Elements ◽

Population History ◽

Recent Common Ancestor ◽

Demographic Model ◽

Effective Population

The genealogies of different genetic loci vary in depth. The deeper the genealogy, the greater the chance that it will include a rare event, such as the insertion of a mobile element. Therefore, the genealogy of a region that contains a mobile element is on average older than that of the rest of the genome. In a simple demographic model, the expected time to most recent common ancestor (TMRCA) is doubled if a rare insertion is present. We test this expectation by examining single nucleotide polymorphisms around polymorphic Alu insertions from two completely sequenced human genomes. The estimated TMRCA for regions containing a polymorphic insertion is two times larger than the genomic average (P < <10−30), as predicted. Because genealogies that contain polymorphic mobile elements are old, they are shaped largely by the forces of ancient population history and are insensitive to recent demographic events, such as bottlenecks and expansions. Remarkably, the information in just two human DNA sequences provides substantial information about ancient human population size. By comparing the likelihood of various demographic models, we estimate that the effective population size of human ancestors living before 1.2 million years ago was 18,500, and we can reject all models where the ancient effective population size was larger than 26,000. This result implies an unusually small population for a species spread across the entire Old World, particularly in light of the effective population sizes of chimpanzees (21,000) and gorillas (25,000), which each inhabit only one part of a single continent.

Download Full-text

Methods for Estimating Demography and Detecting Between-Locus Differences in the Effective Population Size and Mutation Rate

Molecular Biology and Evolution ◽

10.1093/molbev/msy212 ◽

2018 ◽

Vol 36 (2) ◽

pp. 423-433 ◽

Cited By ~ 4

Author(s):

Kai Zeng ◽

Benjamin C Jackson ◽

Henry J Barton

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Effective Population

Download Full-text

Description and validation of a method for simultaneous estimation of effective population size and mutation rate from human population data.

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.86.23.9407 ◽

1989 ◽

Vol 86 (23) ◽

pp. 9407-9411 ◽

Cited By ~ 18

Author(s):

R. Chakraborty ◽

J. V. Neel

Keyword(s):

Population Size ◽

Effective Population Size ◽

Mutation Rate ◽

Human Population ◽

Population Data ◽

Simultaneous Estimation ◽

Effective Population

Download Full-text

Robust estimation of recent effective population size from number of independent origins in soft sweeps

10.1101/472266 ◽

2018 ◽

Author(s):

Bhavin S. Khatri ◽

Austin Burt

Keyword(s):

Population Size ◽

Effective Population Size ◽

Demographic History ◽

Population Sample ◽

Natural Populations ◽

Population History ◽

Effective Population ◽

Current Frequency ◽

Population Size Estimate ◽

Recurrent Mutations

Estimating recent effective population size is of great importance in characterising and predicting the evolution of natural populations. Methods based on nucleotide diversity may underestimate current day effective population sizes due to historical bottlenecks, whilst methods that reconstruct demographic history typically only detect long-term variations. However, soft selective sweeps, which leave a fingerprint of mutational history by recurrent mutations on independent haplotype backgrounds, holds promise of an estimate more representative of recent population history. Here we present a simple and robust method of estimation based only on knowledge of the number of independent recurrent origins and the current frequency of the beneficial allele in a population sample, independent of the strength of selection and age of the mutation. Using a forward time theoretical framework, we show the mean number of origins is a function of θ = 2Nμ and current allele frequency, through a simple equation, and the distribution is approximately Poisson. This estimate is robust to whether mutants pre-existed before selection arose, and is equally accurate for diploid populations with incomplete dominance. For fast (e.g., seasonal) demographic changes compared to time scale for fixation of the mutant allele, and for moderate peak-to-trough ratios, we show our constant population size estimate can be used to bound the maximum and minimum population size. Applied to the Vgsc gene of Anopheles gambiae, we estimate an effective population size of roughly 6 × 107, and including seasonal demographic oscillations, a minimum effective population size greater than 6 × 106 and a maximum less than 3 × 109.

Download Full-text

Natural selection does not affect the estimates of effective population size based on linkage disequilibrium

10.1101/2021.08.16.456457 ◽

2021 ◽

Author(s):

Irene Novo ◽

Armando Caballero ◽

Enrique Santiago

Keyword(s):

Linkage Disequilibrium ◽

Population Size ◽

Effective Population Size ◽

Recombination Rate ◽

Nucleotide Diversity ◽

Loss Of Function ◽

Effective Population ◽

Genomic Regions ◽

The Relationship ◽

Key Parameter

The effective population size ( N e ) is a key parameter to quantify the magnitude of genetic drift and inbreeding, with important implications in human evolution. The increasing availability of high-density genetic markers allows the estimation of historical changes in N e across time using measures of genome diversity or linkage disequilibrium between markers. Selection is expected to reduce diversity and N e , and this reduction is modulated by the heterogeneity of the genome in terms of recombination rate. Here we investigate by computer simulations the consequences of selection (both positive and negative) and of recombination rate heterogeneity in the estimation of historical N e . We also investigate the relationship between diversity parameters and N e across the different regions of the genome using human marker data. We show that the estimates of historical N e obtained from linkage disequilibrium between markers ( N e LD ) are virtually unaffected by selection. In contrast, those estimates obtained by coalescence mutation-recombination-based methods can be strongly affected by it, what could have important consequences for the estimation of human demography. The simulation results are supported by the analysis of human data. The estimates of N e LD obtained for particular genomic regions do not correlate with recombination rate, nucleotide diversity, polymorphism, background selection statistic, minor allele frequency of SNPs, loss of function and missense variants and gene density. This suggests that N e LD measures are merely indicative of demographic changes in population size across generations.

Download Full-text