A Phenotype–Genotype Codon Model for Detecting Adaptive Evolution

2019 ◽  
Vol 69 (4) ◽  
pp. 722-738 ◽  
Author(s):  
Christopher T Jones ◽  
Noor Youssef ◽  
Edward Susko ◽  
Joseph P Bielawski

Abstract A central objective in biology is to link adaptive evolution in a gene to structural and/or functional phenotypic novelties. Yet most analytic methods make inferences mainly from either phenotypic data or genetic data alone. A small number of models have been developed to infer correlations between the rate of molecular evolution and changes in a discrete or continuous life history trait. But such correlations are not necessarily evidence of adaptation. Here, we present a novel approach called the phenotype–genotype branch-site model (PG-BSM) designed to detect evidence of adaptive codon evolution associated with discrete-state phenotype evolution. An episode of adaptation is inferred under standard codon substitution models when there is evidence of positive selection in the form of an elevation in the nonsynonymous-to-synonymous rate ratio $\omega$ to a value $\omega > 1$. As it is becoming increasingly clear that $\omega > 1$ can occur without adaptation, the PG-BSM was formulated to infer an instance of adaptive evolution without appealing to evidence of positive selection. The null model makes use of a covarion-like component to account for general heterotachy (i.e., random changes in the evolutionary rate at a site over time). The alternative model employs samples of the phenotypic evolutionary history to test for phenomenological patterns of heterotachy consistent with specific mechanisms of molecular adaptation. These include 1) a persistent increase/decrease in $\omega$ at a site following a change in phenotype (the pattern) consistent with an increase/decrease in the functional importance of the site (the mechanism); and 2) a transient increase in $\omega$ at a site along a branch over which the phenotype changed (the pattern) consistent with a change in the site’s optimal amino acid (the mechanism). Rejection of the null is followed by post hoc analyses to identify sites with strongest evidence for adaptation in association with changes in the phenotype as well as the most likely evolutionary history of the phenotype. Simulation studies based on a novel method for generating mechanistically realistic signatures of molecular adaptation show that the PG-BSM has good statistical properties. Analyses of real alignments show that site patterns identified post hoc are consistent with the specific mechanisms of adaptation included in the alternate model. Further simulation studies show that the covarion-like component of the PG-BSM plays a crucial role in mitigating recently discovered statistical pathologies associated with confounding by accounting for heterotachy-by-any-cause. [Adaptive evolution; branch-site model; confounding; mutation-selection; phenotype–genotype.]

2021 ◽  
Vol 12 ◽  
Author(s):  
Vladimir M. Jovanovic ◽  
Melanie Sarfert ◽  
Carlos S. Reyna-Blanco ◽  
Henrike Indrischek ◽  
Dulce I. Valdivia ◽  
...  

Gene regulatory factors (GRFs), such as transcription factors, co-factors and histone-modifying enzymes, play many important roles in modifying gene expression in biological processes. They have also been proposed to underlie speciation and adaptation. To investigate potential contributions of GRFs to primate evolution, we analyzed GRF genes in 27 publicly available primate genomes. Genes coding for zinc finger (ZNF) proteins, especially ZNFs with a Krüppel-associated box (KRAB) domain were the most abundant TFs in all genomes. Gene numbers per TF family differed between all species. To detect signs of positive selection in GRF genes we investigated more than 3,000 human GRFs with their more than 70,000 orthologs in 26 non-human primates. We implemented two independent tests for positive selection, the branch-site-model of the PAML suite and aBSREL of the HyPhy suite, focusing on the human and great ape branch. Our workflow included rigorous procedures to reduce the number of false positives: excluding distantly similar orthologs, manual corrections of alignments, and considering only genes and sites detected by both tests for positive selection. Furthermore, we verified the candidate sites for selection by investigating their variation within human and non-human great ape population data. In order to approximately assign a date to positively selected sites in the human lineage, we analyzed archaic human genomes. Our work revealed with high confidence five GRFs that have been positively selected on the human lineage and one GRF that has been positively selected on the great ape lineage. These GRFs are scattered on different chromosomes and have been previously linked to diverse functions. For some of them a role in speciation and/or adaptation can be proposed based on the expression pattern or association with human diseases, but it seems that they all contributed independently to human evolution. Four of the positively selected GRFs are KRAB-ZNF proteins, that induce changes in target genes co-expression and/or through arms race with transposable elements. Since each positively selected GRF contains several sites with evidence for positive selection, we suggest that these GRFs participated pleiotropically to phenotypic adaptations in humans.


2015 ◽  
Author(s):  
Stephane Guindon

In a recent study, Murrell et al. (2015) compared the performance of several branch-site models of codon evolution. Their interpretation of results published by Lu & Guindon (2014) suggests that the stochastic branch-site model implemented in the software fitmodel is anti-conservative altogether, i.e., positive selection is detected more often than expected when analyzing sequences evolving under a mixture of neutrality and negative selection. I argue here that this presentation of the performance of fitmodel is misleading and should not deter evolutionary biologists from using this approach in exploratory analyses of selection patterns at the molecular level.


2017 ◽  
Author(s):  
Aarti Venkat ◽  
Matthew W. Hahn ◽  
Joseph W. Thornton

ABSTRACTPhylogenetic tests of adaptive evolution, which infer positive selection from an excess of nonsynonymous changes, assume that nucleotide substitutions occur singly and independently. But recent research has shown that multiple errors at adjacent sites often occur in single events during DNA replication. These multinucleotide mutations (MNMs) are overwhelmingly likely to be nonsynonymous. We therefore evaluated whether phylogenetic tests of adaptive evolution, such as the widely used branch-site test, might misinterpret sequence patterns produced by MNMs as false support for positive selection. We explored two genome-wide datasets comprising thousands of coding alignments – one from mammals and one from flies – and found that codons with multiple differences (CMDs) account for virtually all the support for lineage-specific positive selection inferred by the branch-site test. Simulations under genome-wide, empirically derived conditions without positive selection show that realistic rates of MNMs cause a strong and systematic bias in the branch-site and related tests; the bias is sufficient to produce false positive inferences approximately as often as the branch-site test infers positive selection from the empirical data. Our analysis indicates that genes may often be inferred to be under positive selection simply because they stochastically accumulated one or a few MNMs. Because these tests do not reliably distinguish sequence patterns produced by authentic positive selection from those caused by neutral fixation of MNMs, many published inferences of adaptive evolution using these techniques may therefore be artifacts of model violation caused by unincorporated neutral mutational processes. We develop an alternative model that incorporates MNMs and may be helpful in reducing this bias.


2015 ◽  
Author(s):  
Stephane Guindon

In a recent study, Murrell et al. (2015) compared the performance of several branch-site models of codon evolution. Their interpretation of results published by Lu & Guindon (2014) suggests that the stochastic branch-site model implemented in the software fitmodel is anti-conservative altogether, i.e., positive selection is detected more often than expected when analyzing sequences evolving under a mixture of neutrality and negative selection. I argue here that this presentation of the performance of fitmodel is misleading and should not deter evolutionary biologists from using this approach in exploratory analyses of selection patterns at the molecular level.


Author(s):  
Joshua H T Potter ◽  
Kalina T J Davies ◽  
Laurel R Yohe ◽  
Miluska K R Sanchez ◽  
Edgardo M Rengifo ◽  
...  

Abstract Dietary adaptation is a major feature of phenotypic and ecological diversification, yet the genetic basis of dietary shifts is poorly understood. Among mammals, Neotropical leaf-nosed bats (family Phyllostomidae) show unmatched diversity in diet; from a putative insectivorous ancestor, phyllostomids have radiated to specialize on diverse food sources, including blood, nectar, and fruit. To assess whether dietary diversification in this group was accompanied by molecular adaptations for changing metabolic demands, we sequenced 89 transcriptomes across 58 species, and combined these with published data to compare ∼13,000 protein coding genes across 66 species. We tested for positive selection on focal lineages, including those inferred to have undergone dietary shifts. Unexpectedly, we found a broad signature of positive selection in the ancestral phyllostomid branch, spanning genes implicated in the metabolism of all major macronutrients, yet few positively selected genes at the inferred switch to plantivory. Branches corresponding to blood- and nectar-based diets showed selection in loci underpinning nitrogenous waste excretion and glycolysis, respectively. Intriguingly, patterns of selection in metabolism genes were mirrored by those in loci implicated in craniofacial remodelling, a trait previously linked to phyllostomid dietary specialisation. Finally, using simulations, we show that the widely-used branch-site model is likely to be misspecified, with the implication that it is too conservative and probably under-reports true cases of positive selection. Our findings point to a complex picture of adaptive radiation, in which the evolution of new dietary specialisations has been facilitated by early adaptations combined with the generation of new genetic variation.


Insects ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 656
Author(s):  
Xiao-Dong Xu ◽  
Jia-Yin Guan ◽  
Zi-Yi Zhang ◽  
Yu-Rou Cao ◽  
Yin-Yin Cai ◽  
...  

We determined 15 complete and two nearly complete mitogenomes of Heptageniidae belonging to three subfamilies (Heptageniinae, Rhithrogeninae, and Ecdyonurinae) and six genera (Afronurus, Epeorus, Leucrocuta, Maccaffertium, Stenacron, and Stenonema). Species of Rhithrogeninae and Ecdyonurinae had the same gene rearrangement of CR-I-M-Q-M-ND2, whereas a novel gene rearrangement of CR-I-M-Q-NCR-ND2 was found in Heptageniinae. Non-coding regions (NCRs) of 25–47 bp located between trnA and trnR were observed in all mayflies of Heptageniidae, which may be a synapomorphy for Heptageniidae. Both the BI and ML phylogenetic analyses supported the monophyly of Heptageniidae and its subfamilies (Heptageniinae, Rhithrogeninae, and Ecdyonurinae). The phylogenetic results combined with gene rearrangements and NCR locations confirmed the relationship of the subfamilies as (Heptageniinae + (Rhithrogeninae + Ecdyonurinae)). To assess the effects of low-temperature stress on Heptageniidae species from Ottawa, Canada, we found 27 positive selection sites in eight protein-coding genes (PCGs) using the branch-site model. The selection pressure analyses suggested that mitochondrial PCGs underwent positive selection to meet the energy requirements under low-temperature stress.


Genetics ◽  
2000 ◽  
Vol 154 (3) ◽  
pp. 1231-1238 ◽  
Author(s):  
David J Begun ◽  
Penn Whitley

Abstract NF-κB and IκB proteins have central roles in regulation of inflammation and innate immunity in mammals. Homologues of these proteins also play an important role in regulation of the Drosophila immune response. Here we present a molecular population genetic analysis of Relish, a Drosophila NF-κB/IκB protein, in Drosophila simulans and D. melanogaster. We find strong evidence for adaptive protein evolution in D. simulans, but not in D. melanogaster. The adaptive evolution appears to be restricted to the IκB domain. A possible explanation for these results is that Relish is a site of evolutionary conflict between flies and their microbial pathogens.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Fan Li ◽  
Yunyun Lv ◽  
Zhengyong Wen ◽  
Chao Bian ◽  
Xinhui Zhang ◽  
...  

Abstract Background Although almost all extant spider species live in terrestrial environments, a few species live fully submerged in freshwater or seawater. The intertidal spiders (genus Desis) built silk nests within coral crevices can survive submerged in high tides. The diving bell spider, Argyroneta aquatica, resides in a similar dynamic environment but exclusively in freshwater. Given the pivotal role played by mitochondria in supplying most energy for physiological activity via oxidative phosphorylation and the environment, herein we sequenced the complete mitogenome of Desis jiaxiangi to investigate the adaptive evolution of the aquatic spider mitogenomes and the evolution of spiders. Results We assembled a complete mitogenome of the intertidal spider Desis jiaxiangi and performed comparative mitochondrial analyses of data set comprising of Desis jiaxiangi and other 45 previously published spider mitogenome sequences, including that of Argyroneta aquatica. We found a unique transposition of trnL2 and trnN genes in Desis jiaxiangi. Our robust phylogenetic topology clearly deciphered the evolutionary relationships between Desis jiaxiangi and Argyroneta aquatica as well as other spiders. We dated the divergence of Desis jiaxiangi and Argyroneta aquatica to the late Cretaceous at ~ 98 Ma. Our selection analyses detected a positive selection signal in the nd4 gene of the aquatic branch comprising both Desis jiaxiangi and Argyroneta aquatica. Surprisingly, Pirata subpiraticus, Hypochilus thorelli, and Argyroneta aquatica each had a higher Ka/Ks value in the 13 PCGs dataset among 46 taxa with complete mitogenomes, and these three species also showed positive selection signal in the nd6 gene. Conclusions Our finding of the unique transposition of trnL2 and trnN genes indicates that these genes may have experienced rearrangements in the history of intertidal spider evolution. The positive selection signals in the nd4 and nd6 genes might enable a better understanding of the spider metabolic adaptations in relation to different environments. Our construction of a novel mitogenome for the intertidal spider thus sheds light on the evolutionary history of spiders and their mitogenomes.


1989 ◽  
Vol 9 (2) ◽  
pp. 726-738
Author(s):  
M L Peterson ◽  
R P Perry

The relative abundance of the mRNAs encoding the membrane (mu m) and secreted (mu s) forms of immunoglobulin mu heavy chain is regulated during B-cell maturation by a change in the mode of RNA processing. Current models to explain this regulation involve either competition between cleavage-polyadenylation at the proximal (mu s) poly(A) site and cleavage-polyadenylation at the distal (mu m) poly(A) site [poly(A) site model] or competition between cleavage-polyadenylation at the mu s poly(A) site and splicing of the C mu 4 and M1 exons, which eliminates the mu s site (mu s site-splice model). To test certain predictions of these models and to determine whether there is a unique structural feature of the mu s poly(A) site that is essential for regulation, we constructed modified mu genes in which the mu s or mu m poly(A) site was replaced by other poly(A) sites and then studied the transient expression of these genes in cells representative of both early- and late-stage lymphocytes. Substitutions at the mu s site dramatically altered the relative usage of this site and caused corresponding reciprocal changes in the usage of the mu m site. Despite these changes, use of the proximal site was still usually higher in plasmacytomas than in pre-B cells, indicating that regulation does not depend on a unique feature of the mu s poly(A) site. Replacement of the distal (mu m) site had no detectable effect on the usage of the mu s site in either plasmacytomas or pre-B cells. These findings are inconsistent with the poly(A) site model. In addition, we noted that in a wide variety of organisms, the sequence at the 5' splice junction of the C mu 4-to-M1 intron is significantly different from the consensus 5' splice junction sequence and is therefore suboptimal with respect to its complementary base pairing with U1 small nuclear RNA. When we mutated this suboptimal sequence into the consensus sequence, the mu mRNA production in plasmacytoma cells was shifted from predominantly mu s to exclusively mu m. This result unequivocally demonstrated that splicing of the C mu 4-to-M1 exon is in competition with usage of the mu s poly(A) site. A key feature of this regulatory phenomenon appears to be the appropriately balanced efficiencies of these two processing reactions. Consistent with predictions of the mu s site-splice model, B cells were found to contain mu m precursor RNA that had undergone the C mu 4-to-M1 splice but had not yet been polyadenylated at the mu m site.


Sign in / Sign up

Export Citation Format

Share Document