scholarly journals The role of mutation bias in adaptive molecular evolution: insights from convergent changes in protein function

2019 ◽  
Author(s):  
Jay F. Storz ◽  
Chandrasekhar Natarajan ◽  
Anthony V. Signore ◽  
Christopher C. Witt ◽  
David M. McCandlish ◽  
...  

AbstractAn underexplored question in evolutionary genetics concerns the extent to which mutational bias in the production of genetic variation influences outcomes and pathways of adaptive molecular evolution. In the genomes of at least some vertebrate taxa, an important form of mutation bias involves changes at CpG dinucleotides: If the DNA nucleotide cytosine (C) is immediately 5’ to guanine (G) on the same coding strand, then – depending on methylation status – point mutations at both sites occur at an elevated rate relative to mutations at non-CpG sites. Here we examine experimental data from case studies in which it has been possible to identify the causative substitutions that are responsible for adaptive changes in the functional properties of vertebrate hemoglobin (Hb). Specifically, we examine the molecular basis of convergent increases in Hb-O2affinity in high-altitude birds. Using a data set of experimentally verified, affinity-enhancing mutations in the Hbs of highland avian taxa, we tested whether causative changes are enriched for mutations at CpG dinucleotides relative to the frequency of CpG mutations among all possible missense mutations. The tests revealed that a disproportionate number of causative amino acid replacements were attributable to CpG mutations, suggesting that mutation bias can influence outcomes of molecular adaptation.


2019 ◽  
Vol 374 (1777) ◽  
pp. 20180238 ◽  
Author(s):  
Jay F. Storz ◽  
Chandrasekhar Natarajan ◽  
Anthony V. Signore ◽  
Christopher C. Witt ◽  
David M. McCandlish ◽  
...  

An underexplored question in evolutionary genetics concerns the extent to which mutational bias in the production of genetic variation influences outcomes and pathways of adaptive molecular evolution. In the genomes of at least some vertebrate taxa, an important form of mutation bias involves changes at CpG dinucleotides: if the DNA nucleotide cytosine (C) is immediately 5′ to guanine (G) on the same coding strand, then—depending on methylation status—point mutations at both sites occur at an elevated rate relative to mutations at non-CpG sites. Here, we examine experimental data from case studies in which it has been possible to identify the causative substitutions that are responsible for adaptive changes in the functional properties of vertebrate haemoglobin (Hb). Specifically, we examine the molecular basis of convergent increases in Hb–O 2 affinity in high-altitude birds. Using a dataset of experimentally verified, affinity-enhancing mutations in the Hbs of highland avian taxa, we tested whether causative changes are enriched for mutations at CpG dinucleotides relative to the frequency of CpG mutations among all possible missense mutations. The tests revealed that a disproportionate number of causative amino acid replacements were attributable to CpG mutations, suggesting that mutation bias can influence outcomes of molecular adaptation. This article is part of the theme issue ‘Convergent evolution in the genomics era: new insights and directions’.



2021 ◽  
Author(s):  
Kuan Pern Tan ◽  
Tejashree Rajaram Kanitkar ◽  
Kwoh Chee Keong ◽  
M.S. Madhusudhan

1.AbstractPredicting the functional consequences of single point mutations has relevance to protein function annotation and to clinical analysis/diagnosis. We developed and tested Packpred that makes use of a multi-body clique statistical potential in combination with a depth dependent amino acid substitution matrix (FADHM) and positional Shannon Entropy to predict the functional consequences of point mutations in proteins. Parameters were trained over a saturation mutagenesis data set of T4-lysozyme (1966 mutations). The method was tested over another saturation mutagenesis data set (CcdB; 1534 mutations) and the Missense3D data set (4099 mutations). The performance of Packpred was compared against those of six other contemporary methods. With MCC values of 0.42, 0.47 and 0.36 on the training and testing data sets respectively, Packpred outperforms all method in all data sets, with the exception of marginally underperforming to FADHM in the CcdB data set. On analyzing the results, we could build meta servers that chose best performing methods of wild type amino acids and for wild type-mutant amino acid pairs. This lead to an increase of MCC value of 0.40 and 0.51 for the two meta predictors respectively on the Missense3D data set. We conjecture that it is possible to improve accuracy with better meta predictors as among the 7 methods compared, at the least one method or another is able to correctly predict ∼99% of the data.



2021 ◽  
Vol 8 ◽  
Author(s):  
Kuan Pern Tan ◽  
Tejashree Rajaram Kanitkar ◽  
Chee Keong Kwoh ◽  
Mallur Srivatsan Madhusudhan

Predicting the functional consequences of single point mutations has relevance to protein function annotation and to clinical analysis/diagnosis. We developed and tested Packpred that makes use of a multi-body clique statistical potential in combination with a depth-dependent amino acid substitution matrix (FADHM) and positional Shannon entropy to predict the functional consequences of point mutations in proteins. Parameters were trained over a saturation mutagenesis data set of T4-lysozyme (1,966 mutations). The method was tested over another saturation mutagenesis data set (CcdB; 1,534 mutations) and the Missense3D data set (4,099 mutations). The performance of Packpred was compared against those of six other contemporary methods. With MCC values of 0.42, 0.47, and 0.36 on the training and testing data sets, respectively, Packpred outperforms all methods in all data sets, with the exception of marginally underperforming in comparison to FADHM in the CcdB data set. A meta server analysis was performed that chose best performing methods of wild-type amino acids and for wild-type mutant amino acid pairs. This led to an increase in the MCC value of 0.40 and 0.51 for the two meta predictors, respectively, on the Missense3D data set. We conjecture that it is possible to improve accuracy with better meta predictors as among the seven methods compared, at least one method or another is able to correctly predict ∼99% of the data.



1996 ◽  
Vol 76 (02) ◽  
pp. 253-257 ◽  
Author(s):  
Takeshi Hagiwara ◽  
Hiroshi Inaba ◽  
Shinichi Yoshida ◽  
Keiko Nagaizumi ◽  
Morio Arai ◽  
...  

SummaryGenetic materials from 16 unrelated Japanese patients with von Willebrand disease (vWD) were analyzed for mutations. Exon 28 of the von Willebrand factor (vWF) gene, where point mutations have been found most frequent, was screened by various restriction-enzyme analyses. Six patients were observed to have abnormal restriction patterns. By sequence analyses of the polymerase chain-reaction products, we identified a homozygous R1308C missense mutation in a patient with type 2B vWD; R1597W, R1597Q, G1609R and G1672R missense mutations in five patients with type 2A; and a G1659ter nonsense mutation in a patient with type 3 vWD. The G1672R was a novel missense mutation of the carboxyl-terminal end of the A2 domain. In addition, we detected an A/C polymorphism at nucleotide 4915 with HaeIII. There was no particular linkage disequilibrium of the A/C polymorphism, either with the G/A polymorphism at nucleotide 4391 detected with Hphl or with the C/T at 4891 detected with BstEll.



Genetics ◽  
2001 ◽  
Vol 159 (2) ◽  
pp. 839-852 ◽  
Author(s):  
Peter P Calabrese ◽  
Richard T Durrett ◽  
Charles F Aquadro

Abstract Recently Kruglyak, Durrett, Schug, and Aquadro showed that microsatellite equilibrium distributions can result from a balance between polymerase slippage and point mutations. Here, we introduce an elaboration of their model that keeps track of all parts of a perfect repeat and a simplification that ignores point mutations. We develop a detailed mathematical theory for these models that exhibits properties of microsatellite distributions, such as positive skewness of allele lengths, that are consistent with data but are inconsistent with the predictions of the stepwise mutation model. We use our theoretical results to analyze the successes and failures of the genetic distances (δμ)2 and DSW when used to date four divergences: African vs. non-African human populations, humans vs. chimpanzees, Drosophila melanogaster vs. D. simulans, and sheep vs. cattle. The influence of point mutations explains some of the problems with the last two examples, as does the fact that these genetic distances have large stochastic variance. However, we find that these two features are not enough to explain the problems of dating the human-chimpanzee split. One possible explanation of this phenomenon is that long microsatellites have a mutational bias that favors contractions over expansions.



2020 ◽  
Vol 19 ◽  
pp. 153303382098379
Author(s):  
Xiying Yu ◽  
Ying Teng ◽  
Xingran Jiang ◽  
Hui Yuan ◽  
Wei Jiang

Background: Cancer stem cells (CSCs) are considered the main cause of cancer recurrence and metastasis, and DNA methylation is involved in the maintenance of CSCs. However, the methylation profile of esophageal CSCs remains unknown. Methods: Side population (SP) cells were isolated from esophageal squamous cell carcinoma (ESCC) cell lines KYSE150 and EC109. Sphere-forming cells were collected from human primary esophageal cancer cells. SP cells and sphere-forming cells were used as substitutes for cancer stem-like cells. We investigated the genome-wide DNA methylation profile in esophageal cancer stem-like cells using reduced representation bisulfite sequencing (RRBS). Results: Methylated cytosine (mC) was found mostly in CpG dinucleotides, located mostly in the intronic, intergenic, and exonic regions. Forty intersected differentially methylated regions (DMRs) were identified in these 3 groups of samples. Thirteen differentially methylated genes with the same alteration trend were detected; these included OTX1, SPACA1, CD163L1, ST8SIA2, TECR, CADM3, GRM1, LRRK1, CHSY1, PROKR2, LINC00658, LOC100506688, and NKD2. DMRs covering ST8SIA2 and GRM1 were located in exons. These differentially methylated genes were involved in 10 categories of biological processes and 3 cell signaling pathways. Conclusions: When compared to non-CSCs, cancer stem-like cells have a differential methylation status, which provides an important biological base for understanding esophageal CSCs and developing therapeutic targets for esophageal cancer.







Neurology ◽  
2018 ◽  
Vol 91 (23) ◽  
pp. e2170-e2181 ◽  
Author(s):  
Oswaldo Lorenzo-Betancor ◽  
Patrick R. Blackburn ◽  
Emily Edwards ◽  
Rocío Vázquez-do-Campo ◽  
Eric W. Klee ◽  
...  

ObjectiveTo identify novel genes involved in the etiology of intracranial aneurysms (IAs) or subarachnoid hemorrhages (SAHs) using whole-exome sequencing.MethodsWe performed whole-exome sequencing in 13 individuals from 3 families with an autosomal dominant IA/SAH inheritance pattern to look for candidate genes for disease. In addition, we sequenced PCNT exon 38 in a further 161 idiopathic patients with IA/SAH to find additional carriers of potential pathogenic variants.ResultsWe identified 2 different variants in exon 38 from the PCNT gene shared between affected members from 2 different families with either IA or SAH (p.R2728C and p.V2811L). One hundred sixty-four samples with either SAH or IA were Sanger sequenced for the PCNT exon 38. Five additional missense mutations were identified. We also found a second p.V2811L carrier in a family with a history of neurovascular diseases.ConclusionThe PCNT gene encodes a protein that is involved in the process of microtubule nucleation and organization in interphase and mitosis. Biallelic loss-of-function mutations in PCNT cause a form of primordial dwarfism (microcephalic osteodysplastic primordial dwarfism type II), and ≈50% of these patients will develop neurovascular abnormalities, including IAs and SAHs. In addition, a complete Pcnt knockout mouse model (Pcnt−/−) published previously showed general vascular abnormalities, including intracranial hemorrhage. The variants in our families lie in the highly conserved PCNT protein-protein interaction domain, making PCNT a highly plausible candidate gene in cerebrovascular disease.



Sign in / Sign up

Export Citation Format

Share Document