scholarly journals Weak Correlation between Nucleotide Variation and Recombination Rate across the House Mouse Genome

2020 ◽  
Vol 12 (4) ◽  
pp. 293-299
Author(s):  
Michael E Kartje ◽  
Peicheng Jing ◽  
Bret A Payseur

Abstract Positive selection and purifying selection reduce levels of variation at linked neutral loci. One consequence of these processes is that the amount of neutral diversity and the meiotic recombination rate are predicted to be positively correlated across the genome—a prediction met in some species but not others. To better document the prevalence of selection at linked sites, we used new and published whole-genome sequences to survey nucleotide variation in population samples of the western European house mouse (Mus musculus domesticus) from Germany, France, and Gough Island, a remote volcanic island in the south Atlantic. Correlations between sequence variation and recombination rates estimated independently from dense linkage maps were consistently very weak (ρ ≤ 0.06), though they exceeded conventional significance thresholds. This pattern persisted in comparisons between genomic regions with the highest and lowest recombination rates, as well as in models incorporating the density of transcribed sites, the density of CpG dinucleotides, and divergence between mouse and rat as covariates. We conclude that natural selection affects linked neutral variation in a restricted manner in the western European house mouse.

Genetics ◽  
1997 ◽  
Vol 147 (3) ◽  
pp. 1303-1316
Author(s):  
Michael W Nachman

Introns of four X-linked genes (Hprt, Plp, Glra2, and Amg) were sequenced to provide an estimate of nucleotide diversity at nuclear genes within the house mouse and to test the neutral prediction that the ratio of intraspecific polymorphism to interspecific divergence is the same for different loci. Hprt and Plp lie in a region of the X chromosome that experiences relatively low recombination rates, while Glra2 and Amg lie near the telomere of the X chromosome, a region that experiences higher recombination rates. A total of 6022 bases were sequenced in each of 10 Mus domesticus and one M. caroli. Average nucleotide diversity (π) for introns within M. domesticus was quite low (π = 0.078%). However, there was substantial variation in the level of heterozygosity among loci. The two telomeric loci, Glra2 and Amg, had higher ratios of polymorphism to divergence than the two loci experiencing lower recombination rates. These results are consistent with the hypothesis that heterozygosity is reduced in regions with lower rates of recombination, although sampling of additional genes is needed to establish whether there is a general correlation between heterozygosity and recombination rate as in Drosophila melanogaster.


Genetics ◽  
2003 ◽  
Vol 165 (4) ◽  
pp. 2213-2233 ◽  
Author(s):  
Na Li ◽  
Matthew Stephens

AbstractWe introduce a new statistical model for patterns of linkage disequilibrium (LD) among multiple SNPs in a population sample. The model overcomes limitations of existing approaches to understanding, summarizing, and interpreting LD by (i) relating patterns of LD directly to the underlying recombination process; (ii) considering all loci simultaneously, rather than pairwise; (iii) avoiding the assumption that LD necessarily has a “block-like” structure; and (iv) being computationally tractable for huge genomic regions (up to complete chromosomes). We examine in detail one natural application of the model: estimation of underlying recombination rates from population data. Using simulation, we show that in the case where recombination is assumed constant across the region of interest, recombination rate estimates based on our model are competitive with the very best of current available methods. More importantly, we demonstrate, on real and simulated data, the potential of the model to help identify and quantify fine-scale variation in recombination rate from population data. We also outline how the model could be useful in other contexts, such as in the development of more efficient haplotype-based methods for LD mapping.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
Fanny Pouyet ◽  
Simon Aeschbacher ◽  
Alexandre Thiéry ◽  
Laurent Excoffier

Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.


2019 ◽  
Author(s):  
Fanny Pouyet

Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.


2020 ◽  
Author(s):  
William R. Rice

Previous work found that the centromeric repeats of the Western European house mouse (Mus musculus domesticus) are composed predominantly of a 120 bp monomer that is shared by the X and autosomes. Polymorphism in length and sequence was also reported. Here I quantified the length and sequence polymorphism of the centromeric repeats found on the X and autosomes. The levels of local and global sequence variation were also compared. I found three length variants: a 64mer, 112mer and 120mer with relative frequencies of 2.4%, 8.6%, and 89%, respectively. There was substantial sequence variation within all three length variants with a rank-order of: 64mer < 120mer < 112mer. The 64mer was never found alone on long Sanger traces, and was arranged predominantly as a 176 bp higher-order repeat composed of a 64/112mer dimer. Reanalysis of archived ChIP-seq reads found that all three length variants were enriched with the foundational centromere protein CENP-A, but the enrichment was far higher for the 120mer. This pattern indicates that only the 120mer contributes substantially to the functional centromeres, i.e., to the kinetochore-binding, centric cores of the centromeric repeat arrays. Despite only moderate sequence divergence among random pairs of 120mers (averaging 5.9%), other measures of sequence diversity were exceptionally high: i) variant richness (numerical diversity) –on average, one new sequence variant was observed every 4th additional monomer randomly sampled (in N = 7.2 × 103 monomers), and ii) variant evenness –all of the nearly 2 × 103 observed sequence variants were at low frequency, with the most common variant having a frequency of only 5.7%. I next used long Sanger trace data from the Mouse Genome Project to assess the pattern of monomer diversity among neighboring 120mers. Unexpectedly, side-by-side monomers were rarely identical in sequence, and sequence divergence between these neighbors was nearly as high as that between random pairs taken from the genome-wide pool of all 120mers. I also used long Sanger traces to determine sequence variation among neighborhoods of 5 contiguous 120 bp monomers. Sequence diversity within these small regions typically spanned most of the entire range of that found genome-wide. Despite high sequence variation within these neighborhoods, the density of monomers with functional binding motifs for CENP-B (i.e., b-boxes with sequence NTTCGNNNNANNCGGGN) was strongly conserved at about 50%. The overarching pattern of monomer structure at the centromeric repeats of this subspecies is: i) high homogeneity in the density CENP-B binding sites, and ii) high heterogeneity in monomer sequence at both local and global levels.


Author(s):  
William Rice

The companion paper (Rice 2020) found that the centromeric repeats of the Western European house mouse (Mus musculus domesticus) have unusual structure: i) despite moderate pairwise sequence divergence (average = 5.9%), no monomer sequence was common and many hundreds of monomer sequences were observed, ii) local sequence divergence among neighboring monomers was nearly as high as genome-wide divergence, and iii) matching sequences were rare between side-by-side monomers. Here I integrate information from many published studies to formulate a hypothesis for the evolution of this structure. Non-matching sequences of neighboring centromeric monomers is hypothesized to be selectively favored in the context of molecular drive because it reduces the rate of monomer deletion during repair of double strand breaks (DSBs) via the Single Strand Annealing (SSA) pathway. The foundation for the hypothesis is the observation that centromeres of most populations of M. m. domestics reside close to the telomere, i.e., all their chromosomes are telocentrics. This proximity influences repair of centromeric DSBs because it places at least part of the centromere within the Telomere-Affected Repair Region (TARR; a location with increased concentrations of the shelterin-complex proteins that bind telomeres, especially TRF2). Shelterin proteins increase the level of 5&rsquo;&rarr;3&rsquo; end resection at DSBs and thereby: i) decrease the frequency of repair via the c- NHEJ pathway, and ii) increase the frequency of homology-directed repair (HD-repair) &ndash;including the SSA repair pathway. It is hypothesized that certain &lsquo;trigger&rsquo; events (e.g., sub-telomeric deletions) occur in local populations that increase the influence of TARR on the centromere. This increase elevates the occurrence of SSA repair of centromeric DSBs to a level that causes centromeres to begin to gradually shrink. Chronic shrinkage leads to coevolution between centromere size and karyotype number. Once centromeres shrink to a size below a critical minimum (that causes substantially reduced kinetochore size), fusions between non- homologous telocentrics with undersized centromeres produces metacentrics with an expanded centromere size (and a corresponding &lsquo;quantum-jump&rsquo; in kinetochore size). These metacentrics: i) accumulate to fixation because they are favored by centromere drive, and ii) are released from the influence of TARR and thereby gradually recover larger centromere size. Fission of metacentrics with enlarged centromeres can next plausibly regenerate pairs of telocentrics with sufficiently large centromeres (which recruit normal-sized kinetochores) to be favored by centromere drive and accumulate to fixation. This fixation completes a cycle of coevolution within genomes that oscillate between two extremes: i) high karyotype number (2N = 40; all telocentrics) with larger centromeres, and ii) low karyotype number (2N &lt;&lt; 40; mainly metacentrics) with initially small centromeres that gradually increase in size.


2021 ◽  
Author(s):  
Tom R Booker ◽  
Bret A Payseur ◽  
Anna Tigano

Background selection (BGS), the effect that purifying selection exerts on sites linked to deleterious alleles, is expected to be ubiquitous across eukaryotic genomes. The effects of BGS reflect the interplay of the rates and fitness effects of deleterious mutations with recombination. A fundamental assumption of BGS models is that recombination rates are invariant over time. However, in some lineages recombination rates evolve rapidly, violating this central assumption. Here, we investigate how recombination rate evolution affects genetic variation under BGS. We show that recombination rate evolution modifies the effects of BGS in a manner similar to a localised change in the effective population size, potentially leading to an underestimation of the genome-wide effects of selection. Furthermore, we find evidence that recombination rate evolution in the ancestors of modern house mice may have impacted inferences of the genome-wide effects of selection in that species.


2014 ◽  
Vol 281 (1786) ◽  
pp. 20140297 ◽  
Author(s):  
Laia Capilla ◽  
Nuria Medarde ◽  
Alexandra Alemany-Schmidt ◽  
Maria Oliver-Bonet ◽  
Jacint Ventura ◽  
...  

Despite the existence of formal models to explain how chromosomal rearrangements can be fixed in a population in the presence of gene flow, few empirical data are available regarding the mechanisms by which genome shuffling contributes to speciation, especially in mammals. In order to shed light on this intriguing evolutionary process, here we present a detailed empirical study that shows how Robertsonian (Rb) fusions alter the chromosomal distribution of recombination events during the formation of the germline in a Rb system of the western house mouse ( Mus musculus domesticus ). Our results indicate that both the total number of meiotic crossovers and the chromosomal distribution of recombination events are reduced in mice with Rb fusions and that this can be related to alterations in epigenetic signatures for heterochromatinization. Furthermore, we detected novel house mouse Prdm9 allelic variants in the Rb system. Remarkably, mean recombination rates were positively correlated with a decrease in the number of ZnF domains in the Prdm9 gene. The suggestion that recombination can be modulated by both chromosomal reorganizations and genetic determinants that control the formation of double-stranded breaks during meiosis opens new avenues for understanding the role of recombination in chromosomal speciation.


Genetics ◽  
1998 ◽  
Vol 148 (1) ◽  
pp. 423-434
Author(s):  
Jan Dvorřák ◽  
Ming-Cheng Luo ◽  
Zu-Li Yang

Abstract RFLP was investigated at 52 single-copy gene loci among six species of Aegilops, including both cross-fertilizing and self-fertilizing species. Average gene diversity (H) was found to correlate with the level of outcrossing. No relationship was found between H and the phylogenetic status of a species. In all six species, the level of RFLP at a locus was a function of the position of the locus on the chromosome and the recombination rate in the neighborhood of the locus. Loci in the proximal chromosome regions, which show greatly reduced recombination rates relative to the distal regions, were significantly less variable than loci in the distal chromosome regions in all six species. Variation in recombination rates was also reflected in the haplotype divergence between closely related species; loci in the chromosome regions with low recombination rates were found to be diverged less than those in the chromosome regions with high recombination rates. This relationship was not found among the more distantly related species.


Genetics ◽  
2000 ◽  
Vol 156 (3) ◽  
pp. 1285-1298 ◽  
Author(s):  
Bret A Payseur ◽  
Michael W Nachman

Abstract Background (purifying) selection on deleterious mutations is expected to remove linked neutral mutations from a population, resulting in a positive correlation between recombination rate and levels of neutral genetic variation, even for markers with high mutation rates. We tested this prediction of the background selection model by comparing recombination rate and levels of microsatellite polymorphism in humans. Published data for 28 unrelated Europeans were used to estimate microsatellite polymorphism (number of alleles, heterozygosity, and variance in allele size) for loci throughout the genome. Recombination rates were estimated from comparisons of genetic and physical maps. First, we analyzed 61 loci from chromosome 22, using the complete sequence of this chromosome to provide exact physical locations. These 61 microsatellites showed no correlation between levels of variation and recombination rate. We then used radiation-hybrid and cytogenetic maps to calculate recombination rates throughout the genome. Recombination rates varied by more than one order of magnitude, and most chromosomes showed significant suppression of recombination near the centromere. Genome-wide analyses provided no evidence for a strong positive correlation between recombination rate and polymorphism, although analyses of loci with at least 20 repeats suggested a weak positive correlation. Comparisons of microsatellites in lowest-recombination and highest-recombination regions also revealed no difference in levels of polymorphism. Together, these results indicate that background selection is not a major determinant of microsatellite variation in humans.


Sign in / Sign up

Export Citation Format

Share Document