neutral mutations
Recently Published Documents


TOTAL DOCUMENTS

166
(FIVE YEARS 38)

H-INDEX

25
(FIVE YEARS 4)

2021 ◽  
Author(s):  
Brian Charlesworth

The effects of selection on variability at linked sites have an important influence on levels and patterns of within-population variation across the genome. Most theoretical models of these effects have assumed that selection is sufficiently strong that allele frequency changes at the loci concerned are largely deterministic. These models have led to the conclusion that directional selection for new selectively favorable mutations, or against recurrent deleterious mutations, reduces nucleotide site diversity at linked neutral sites. Recent work has shown, however, that fixations of weakly selected mutations, accompanied by significant stochastic changes in allele frequencies, can sometimes cause higher diversity at linked sites when compared with the effects of fixations of neutral mutations. The present paper extends this work by deriving approximate expressions for the mean times to loss and fixation of mutations subject to selection, and analysing the conditions under which selection increases rather than reduces these times. Simulations are used to examine the relations between diversity at a neutral site and the fixation and loss times of mutations at a linked site subject to selection. It is shown that the long-term level of neutral diversity can be increased over the equilibrium expectation in the absence of selection by recurrent fixations and losses of linked, weakly selected dominant or partially dominant favorable mutations, and by linked recessive or partially recessive deleterious mutations. The results are used to examine the conditions under which associative overdominance, as opposed to background selection, is likely to operate.


2021 ◽  
Author(s):  
David Ding ◽  
Anna G. Green ◽  
Boyuan Wang ◽  
Thuy-Lan Vo Lite ◽  
Eli N. Weinstein ◽  
...  

Proteins often accumulate neutral mutations that do not affect current functions1 but can profoundly influence future mutational possibilities and functions2-4. Understanding such hidden potential has major implications for protein design and evolutionary forecasting5-7, but has been limited by a lack of systematic efforts to identify potentiating mutations8,9. Here, through the comprehensive analysis of a bacterial toxin-antitoxin system, we identified all possible single substitutions in the toxin that enable it to tolerate otherwise interface-disrupting mutations in its antitoxin. Strikingly, the majority of enabling mutations in the toxin do not contact, and promote tolerance non-specifically to, many different antitoxin mutations, despite covariation in homologs occurring primarily between specific pairs of contacting residues across the interface. In addition, the enabling mutations we identified expand future mutational paths that both maintain old toxin-antitoxin interactions and form new ones. These non-specific mutations are missed by widely used covariation and machine learning methods10,11. Identifying such enabling mutations will be critical for ensuring continued binding of therapeutically relevant proteins, such as antibodies, aimed at evolving targets12-14.


2021 ◽  
Author(s):  
Narod Kebabci ◽  
Ahmet Can Timucin ◽  
Emel Timucin

AbstractProtein stability datasets contain neutral mutations that are highly concentrated in a much narrower ΔΔG range than destabilizing and stabilizing mutations. Notwith-standing their high density, often studies analyzing stability datasets and/or predictors ignore the neutral mutations and use a binary classification scheme labeling only destabilizing and stabilizing mutations. Recognizing that highly concentrated neutral mutations would affect the quality of stability datasets, we have explored three protein stability datasets; S2648, PON-tstab and the symmetric Ssym that differ in size and quality. A characteristic leptokurtic shape in the ΔΔG distributions of all three datasets including the curated and symmetric ones were reported due to concentrated neutral mutations. To further investigate the impact of neutral mutations on ΔΔG predictions, we have comprehensively assessed the performance of eleven predictors on the PON-tstab dataset. Correlation and error analyses showed that all of the predictors performed the best on the neutral mutations while their performance became gradually worse as the ΔΔG of the mutations departed further from the neutral zone regardless of the direction, implying a bias towards dense mutations. To this end, after unraveling the role of concentrated neutral mutations in biases of stability datasets, we described a systematic under-sampling approach to balance the ΔΔG distributions. Before under-sampling, mutations were clustered based on their biochemical and/or structural features and then three mutations were systematically selected from every 2 kcal/mol of each cluster. Upon implementation of this approach by distinct clustering schemes, we generated five subsets varying in size and ΔΔG distributions. All subsets notably showed amelioration of not only the shape of ΔΔG distributions but also other pre-existing imbalances in the frequency distributions. We also reported differences in the performance of the predictors between the parent and under-sampled subsets due to the enrichment of previously under-represented mutations in the subsets. Altogether, this study not only elaborated the pivotal role of concentrated mutations in the dataset biases but also contemplated and realized a rational strategy to tackle this and other forms of biases. Under-sampling code is available on GitHub (https://github.com/narodkebabci/gRoR).


Author(s):  
Robert Horvath ◽  
Emily B Josephs ◽  
Edouard Pesquet ◽  
John R Stinchcombe ◽  
Stephen I Wright ◽  
...  

Abstract Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.


2021 ◽  
Author(s):  
Sebastian Cuadros-Espinoza ◽  
Guillaume Laval ◽  
Lluís Quintana-Murci ◽  
Etienne Patin

Admixture has been a pervasive phenomenon in human history, shaping extensively the patterns of population genetic diversity. There is increasing evidence to suggest that admixture can also facilitate genetic adaptation to local environments, i.e., admixed populations acquire beneficial mutations from source populations, a process that we refer to as adaptive admixture. However, the role of adaptive admixture in human evolution and the power to detect it are poorly characterized. Here, we use extensive computer simulations to evaluate the power of several neutrality statistics to detect natural selection in the admixed population, accounting for background selection and assuming different admixture scenarios. We show that two statistics based on admixture proportions, F_adm and LAD, show high power to detect mutations that are beneficial in the admixed population, whereas iHS and F_ST falsely detect neutral mutations that have been selected in the source populations only. By combining F_adm and LAD into a single statistic, we scanned the genomes of 15 worldwide, admixed populations for signatures of adaptive admixture. We confirm that lactase persistence and resistance to malaria have been under adaptive admixture in West Africa and in Madagascar, North Africa and South Asia, respectively. Our approach also uncovers new cases of adaptive admixture, including the APOL1 / MYH9 locus in the Fulani nomads and PKN2 in East Indonesians, involved in resistance to infection and metabolism, respectively. Collectively, our study provides new evidence that adaptive admixture has occurred in multiple human populations, whose genetic history is characterized by periods of isolation and spatial expansions resulting in increased gene flow.


2021 ◽  
pp. 1-23
Author(s):  
Léo Françoso Dal Piccol Sotto ◽  
Franz Rothlauf ◽  
Vinçcius Veloso de Melo ◽  
Márcio P. Basgalupp

Abstract Linear Genetic Programming (LGP) represents programs as sequences of instructions and has a Directed Acyclic Graph (DAG) dataflow. The results of instructions are stored in registers that can be used as arguments by other instructions. Instructions that are disconnected from the main part of the program are called non-effective instructions, or structural introns. They also appear in other DAG-based GP approaches like Cartesian Genetic Programming (CGP). This paper studies four hypotheses on the role of structural introns: non-effective instructions (1) serve as evolutionary memory, where evolved information is stored and later used in search, (2) preserve population diversity, (3) allow neutral search, where structural introns increase the number of neutral mutations and improve performance, and (4) serve as genetic material to enable program growth. We study different variants of LGP controlling the influence of introns for symbolic regression, classification, and digital circuits problems. We find that there is (1) evolved information in the non-effective instructions that can be reactivated and that (2) structural introns can promote programs with higher effective diversity. However, both effects have no influence on LGP search performance. On the other hand, allowing mutations to not only be applied to effective but also to noneffective instructions (3) increases the rate of neutral mutations and (4) contributes to program growth by making use of the genetic material available as structural introns. This comes along with a significant increase of LGP performance, which makes structural introns important for LGP.


2021 ◽  
Author(s):  
Maxwell A Sherman ◽  
Adam Yaari ◽  
Oliver Priebe ◽  
Felix Dietlein ◽  
Po-Ru Loh ◽  
...  

An ongoing challenge to better understand and treat cancer is to distinguish neutral mutations, which do not affect tumor fitness, from those that provide a proliferative advantage. However, the variability of mutation rates has limited our ability to model patterns of neutral mutations and therefore identify cancer driver mutations. Here, we predict cancer-specific mutation rates genome-wide by leveraging deep neural networks to learn mutation rates within kilobase-scale regions and then refining these estimates to test for evidence of selection on combinations of mutations by comparing observed to expected mutation counts. We mapped mutation rates for 37 cancer types and used these maps to identify new putative drivers in understudied regions of the genome including cryptic alternative-splice sites, 5 prime untranslated regions and infrequently mutated genes. These results, available for exploration via web interface, indicate the potential for high-resolution neutral mutation models to empower further driver discovery as cancer sequencing cohorts grow.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Xiao-Lin Chu ◽  
Quan-Guo Zhang

Abstract Background Mutation accumulation (MA) has profound ecological and evolutionary consequences. One example is that accumulation of conditionally neutral mutations leads to fitness trade-offs among heterogenous habitats which cause population divergence. Here we suggest that temperature, which controls the rates of all biochemical and biophysical processes, should play a crucial role for determining mutational effects. Particularly, warmer temperatures may mitigate the effects of some, not all, deleterious mutations and cause stronger environmental dependence in MA effects. Results We experimentally tested the above hypothesis by measuring the growth performance of ten Escherichia coli genotypes on six carbon resources across ten temperatures, where the ten genotypes were derived from a single ancestral strain and accumulated spontaneous mutations. We analyzed resource dependence of MA consequences for growth yields. The MA genotypes typically showed reduced growth yields relative to the ancestral type; and the magnitude of reduction was smaller at intermediate temperatures. Stronger resource dependence in MA consequences for growth performance was observed at higher temperatures. Specifically, the MA genotypes were more likely to show impaired growth performance on all the six carbon resources when grown at lower temperatures; but suffered growth performance loss only on some, not all the six, carbon substrates at higher temperatures. Conclusions Higher temperatures increase the chance that MA causes conditionally neutral fitness effects while MA is more likely to cause fitness loss regardless of available resources at lower temperatures. This finding has implications for understanding how geographic patterns in population divergence may emerge, and how conservation practices, particularly protection of diverse microhabitats, may mitigate the impacts of global warming.


2021 ◽  
Author(s):  
Neil A Robertson ◽  
Eric Latorre-Crespo ◽  
Maria Terrada-Terradas ◽  
Alison C Purcell ◽  
Benjamin J Livesey ◽  
...  

The prevalence of clonal haematopoiesis of indeterminate potential (CHIP) in healthy individuals increases rapidly from age 60 onwards and has been associated with increased risk for malignancy, heart disease and ischemic stroke. CHIP is driven by somatic mutations in stem cells that are also drivers of myeloid malignancies. Since mutations in stem cells often drive leukaemia, we hypothesised that stem cell fitness substantially contributes to transformation from CHIP to leukaemia. Stem cell fitness is defined as the proliferative advantage over cells carrying no or only neutral mutations. We set out to quantify the fitness effects of CHIP drivers over a 15 year timespan in older age, using longitudinal error-corrected sequencing data. It is currently unknown whether mutations in different CHIP genes lead to distinct fitness advantages that could form the basis for patient stratification. We developed a new method based on drift-induced fluctuation (DIF) filtering to extract fitness effects from longitudinal data, and thus quantify the growth potential of variants within each individual. Our approach discriminates naturally drifting populations of cells and faster growing clones, while taking into account individual mutational context. We show that gene-specific fitness differences can outweigh inter-individual variation and therefore could form the basis for personalised clinical management.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Mariëlle J. F. M. van Kooten ◽  
Clio A. Scheidegger ◽  
Matthias Christen ◽  
Beat Christen

AbstractSequence rewriting enables low-cost genome synthesis and the design of biological systems with orthogonal genetic codes. The error-free, robust rewriting of nucleotide sequences can be achieved with a complete annotation of gene regulatory elements. Here, we compare transcription in Caulobacter crescentus to transcription from plasmid-borne segments of the synthesized genome of C. ethensis 2.0. This rewritten derivative contains an extensive amount of supposedly neutral mutations, including 123’562 synonymous codon changes. The transcriptional landscape refines 60 promoter annotations, exposes 18 termination elements and links extensive transcription throughout the synthesized genome to the unintentional introduction of sigma factor binding motifs. We reveal translational regulation for 20 CDS and uncover an essential translational regulatory element for the expression of ribosomal protein RplS. The annotation of gene regulatory elements allowed us to formulate design principles that improve design schemes for synthesized DNA, en route to a bright future of iteration-free programming of biological systems.


Sign in / Sign up

Export Citation Format

Share Document