scholarly journals A comparison of one-rate and two-rate inference frameworks for site-specific dN/dS estimation

2015 ◽  
Author(s):  
Stephanie J. Spielman ◽  
Suyang Wan ◽  
Claus O. Wilke

AbstractTwo broad paradigms exist for inferring dN/dS, the ratio of nonsynonymous to synonymous substitution rates, from coding sequences: i) a one-rate approach, where dN/dS is represented with a single parameter, or ii) a two-rate approach, where dN and dS are estimated separately. The performances of these two approaches have been well-studied in the specific context of proper model specification, i.e. when the inference model matches the simulation model. By contrast, the relative performances of one-rate vs. two-rate parameterizations when applied to data generated according to a different mechanism remains unclear. Here, we compare the relative merits of one-rate and two-rate approaches in the specific context of model misspecification by simulating alignments with mutation-selection models rather than with dN/dS-based models. We find that one-rate frameworks generally infer more accurate dN/dS point estimates, even when dS varies among sites. In other words, modeling dS variation may substantially reduce accuracy of dN/dS point estimates. These results appear to depend on the selective constraint operating at a given site. In particular, for sites under strong purifying selection (dN/dS<~0.3), one-rate and two-rate models show comparable performances. However, one-rate models significantly outperform two-rate models for sites under moderate-to-weak purifying selection. We attribute this distinction to the fact that, for these more quickly evolving sites, a given substitution is more likely to be nonsynonymous than synonymous. The data will therefore be relatively enriched for nonsynonymous changes, and modeling dS contributes excessive noise to dN/dS estimates. We additionally find that high levels of divergence among sequences, rather than the number of sequences in the alignment, are more critical for obtaining precise point estimates.

2014 ◽  
Vol 2014 ◽  
pp. 1-9 ◽  
Author(s):  
Fei Liu ◽  
Xiao-Long Wei ◽  
Hao Li ◽  
Ji-Fu Wei ◽  
Yong-Qing Wang ◽  
...  

FK506 binding proteins (FKBPs) belong to immunophilins with peptidyl-prolyl isomerases (PPIases) activity. FKBP25 (also known as FKBP3) is one of the nuclear DNA-binding proteins in the FKBPs family, which plays an important role in regulating transcription and chromatin structure. The calculation of nonsynonymous and synonymous substitution rates suggested that FKBP25 undergoes purifying selection throughout the whole vertebrate evolution. Moreover, the result of site-specific tests showed that no sites were detected under positive selection. Only one PPIase domain was detected by searching FKBP25 sequences at Pfam and SMART domain databases. Mammalian FKBP25 possess exon-intron conservation, although conservation in the whole vertebrate lineage is incomplete. The result of this study suggests that the purifying selection triggers FKBP25 evolutionary history, which allows us to discover the complete role of the PPIase domain in the interaction between FKBP25 and nuclear proteins. Moreover, intron alterations during FKBP25 evolution that regulate gene splicing may be involved in the purifying selection.


2011 ◽  
Vol 101 (4) ◽  
pp. 445-450 ◽  
Author(s):  
Jinbo Wang ◽  
Abhineet M. Sharma ◽  
Siobain Duffy ◽  
Rodrigo P. P. Almeida

Grapevine leafroll-associated virus 3 (GLRaV-3; Ampelovirus, Closteroviridae), associated with grapevine leafroll disease, is an important pathogen found across all major grape-growing regions of the world. The genetic diversity of GLRaV-3 in Napa Valley, CA, was studied by sequencing 4.7 kb in the 3′ terminal region of 50 isolates obtained from Vitis vinifera ‘Merlot’. GLRaV-3 isolates were subdivided into four distinct phylogenetic clades. No evidence of positive selection was observed in the data set, although neutral selection (ratio of nonsynonymous to synonymous substitution rates = 1.1) was observed in one open reading frame (ORF 11, p4). Additionally, the four clades had variable degrees of overall nucleotide diversity. Moreover, no geographical structure among isolates was observed, and isolates belonging to different phylogenetic clades were found in distinct vineyards, with one exception. Considered with the evidence of purifying selection (i.e., against deleterious mutations), these data indicate that the population of GLRaV-3 in Napa Valley is not expanding and its effective population size is not increasing. Furthermore, research on the biological characterization of GLRaV-3 strains might provide valuable insights on the biology of this species that may have epidemiological relevance.


2016 ◽  
Author(s):  
Kousuke Hanada ◽  
Ayumi Tezuka ◽  
Masafumi Nozawa ◽  
Yutaka Suzuki ◽  
Sumio Sugano ◽  
...  

AbstractLineage-specifically duplicated genes likely contribute to the phenotypic divergence in closely related species. However, neither the frequency of duplication events nor the degree of selective pressures immediately after gene duplication is clear in the speciation process. Plants have substantially higher gene duplication rates than most other eukaryotes. Here, using Illumina short reads from Arabidopsis halleri, which has highly qualified plant genomes in close species (Brassica rapa, A. thaliana and A. lyrata), we succeeded in generating orthologous gene groups among B. rapa, A. thaliana, A. lyrata and A. halleri. The frequency of duplication events in the Arabidopsis lineage was approximately 10 times higher than the frequency inferred by comparative genomics of Arabidopsis, poplar, rice and moss. Of the currently retained genes in A. halleri, 11–24% had undergone gene duplication in the Arabidopsis lineage. To examine the degree of selective pressure for duplicated genes, we calculated the ratios of nonsynonymous to synonymous substitution rates (KA/KS) in the A. halleri-lyrata and A. halleri lineages. Using a maximum-likelihood framework, we examined positive (KA/KS > 1) and purifying selection (KA/KS < 1) at a significant level (P < 0.01). Duplicate genes tended to have a higher proportion of positive selection compared with non-duplicated genes. More interestingly, we found that functional divergence of duplicated genes was accelerated several million years after gene duplication at a higher proportion than immediately after gene duplication.


Genetics ◽  
1992 ◽  
Vol 130 (4) ◽  
pp. 855-864
Author(s):  
E N Moriyama ◽  
T Gojobori

Abstract We compared the rates of synonymous (silent) substitution among various genes in a number of species of Drosophila. First, we found that even for a particular gene, the rate of synonymous substitution varied considerably with Drosophila lineages. Second, we showed a large variation in synonymous substitution rates among nuclear genes in Drosophila. These rates of synonymous substitution were correlated negatively with C content and positively with A content at the third codon positions. Nucleotide sequences were also compared between pseudogenes and their functional homologs. The C content of the pseudogenes was lower than that of the functional genes and the A content of the former was higher than that of the latter. Because the synonymous substitution for functional genes and the nucleotide substitution for pseudogenes are exempted from any selective constraint at the protein level, these observations could be explained by a biased pattern of mutation in the Drosophila nuclear genome. Such a bias in the mutation pattern may affect the molecular clock (local clock) of each nuclear gene of each species. Finally, we obtained the average rates of synonymous substitution for three gene groups in Drosophila; 11.0 x 10(-9), 17.5 x 10(-9) and 27.1 x 10(-9)/site/year.


2021 ◽  
Vol 25 (06) ◽  
pp. 1331-1338
Author(s):  
Nadya Farah

Secreted fungal effector proteins and their host targets are good examples to understand the mechanism of host-pathogen co-evolution with genes involved in the interaction undergoing positive selection. SIX genes (secreted in xylem) are obtained via horizontal transfer and can be found within the formae speciales of Fusarium oxysporum. SIX6 and SIX9 of F. oxysporum f. spp. cubense (Foc) are predicted to play a role as effectors. However, their involvement in the pathogenicity of Foc in banana plants has not been determined yet. In the susceptible banana cultivar, we found that the SIX6 and SIX9 genes of Foc TR4 were highly expressed in roots, but not in corms or leaves. The host, however, expressed the pathogenesis-related (PR) genes, PR-1 and PR-3, in corms earlier than in the roots. Phylogenetic analysis on SIX6 and SIX9 genes of F. oxysporum has revealed the separation of SIX6 and SIX9 of Foc from other formae speciales. This leads to detecting genes under positive selection using the ratio nonsynonymous to synonymous substitution rates (Ka/Ks). SIX6 of Foc showed an increase in diversity, but insufficient to drive positive selection. Conversely, SIX9 of Foc showed no divergence in the dN/dS ratio distribution, indicating purifying selection. © 2021 Friends Science Publishers


Author(s):  
John Shaheen ◽  
Austin B Mudd ◽  
Thomas G H Diekwisch ◽  
John Abramyan

Abstract Extant anurans (frogs and toads) exhibit reduced dentition, ranging from a lack of mandibular teeth to complete edentulation, as observed in the true toads of the family Bufonidae. The evolutionary timeline of these reductions remains vague due to a poor fossil record. Previous studies have demonstrated an association between the lack of teeth in edentulous vertebrates and the pseudogenization of the major tooth enamel gene amelogenin (AMEL) through accumulation of deleterious mutations and the disruption of its coding sequence. In the present study we have harnessed the pseudogenization of AMEL as a molecular dating tool to correlate loss of dentition with genomic mutation patterns during the rise of the family Bufonidae. Specifically, we have utilized AMEL pseudogenes in three members of the family as a tool to estimate the putative date of edentulation in true toads. Comparison of AMEL sequences from Rhinella marina, Bufo gargarizans and Bufo bufo, with nine extant, dentulous frogs, revealed mutations confirming AMEL inactivation in Bufonidae. AMEL pseudogenes in modern bufonids also exhibited remarkably high 86–93% sequence identity among each other, with only a slight increase in substitution rate and relaxation of selective pressure, in comparison to functional copies in other anurans. Moreover, using selection intensity estimates and synonymous substitution rates, analysis of functional and pseudogenized AMEL resulted in an estimated inactivation window of 46-60 MYA in the lineage leading to modern true toads, a timeline that coincides with the rise of the family Bufonidae.


Genetics ◽  
1997 ◽  
Vol 146 (1) ◽  
pp. 393-399 ◽  
Author(s):  
Spencer V Muse ◽  
Brandon S Gaut

Even when several genetic loci are used in molecular evolutionary studies, each locus is typically analyzed independently of the others. This type of approach makes it difficult to study mechanisms and processes that affect multiple genes. In this work we develop a statistical approach for the joint analysis of two or more loci. The tests we propose examine whether or not nucleotide substitution rates across evolutionary lineages have the same relative proportions at two loci. Theses procedures are applied to 33 genes from the chloroplast genomes of rice, tobacco, pine, and liverwort. With the exception of five clearly distinct loci, we find that synonymous substitution rates tend to change proportionally across genes. We interpret these results to be consistent with a “lineage effect” acting on the entire chloroplast genome. In contrast, nonsynonymous rates do not change proportionally across genes, suggesting that locus-specific evolutionary effects dominate patterns of nonsynonymous substitution.


Author(s):  
Solomon T C Chak ◽  
Juan Antonio Baeza ◽  
Phillip Barden

Abstract Eusociality is a highly conspicuous and ecologically impactful behavioral syndrome that has evolved independently across multiple animal lineages. So far, comparative genomic analyses of advanced sociality have been mostly limited to insects. Here, we study the only clade of animals known to exhibit eusociality in the marine realm—lineages of socially diverse snapping shrimps in the genus Synalpheus. To investigate the molecular impact of sociality, we assembled the mitochondrial genomes of eight Synalpheus species that represent three independent origins of eusociality and analyzed patterns of molecular evolution in protein-coding genes. Synonymous substitution rates are lower and potential signals of relaxed purifying selection are higher in eusocial relative to noneusocial taxa. Our results suggest that mitochondrial genome evolution was shaped by eusociality-linked traits—extended generation times and reduced effective population sizes that are hallmarks of advanced animal societies. This is the first direct evidence of eusociality impacting genome evolution in marine taxa. Our results also strongly support the idea that eusociality can shape genome evolution through profound changes in life history and demography.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Eugene J. Gardner ◽  
Elena Prigmore ◽  
Giuseppe Gallone ◽  
Petr Danecek ◽  
Kaitlin E. Samocha ◽  
...  

Abstract Mobile genetic Elements (MEs) are segments of DNA which can copy themselves and other transcribed sequences through the process of retrotransposition (RT). In humans several disorders have been attributed to RT, but the role of RT in severe developmental disorders (DD) has not yet been explored. Here we identify RT-derived events in 9738 exome sequenced trios with DD-affected probands. We ascertain 9 de novo MEs, 4 of which are likely causative of the patient’s symptoms (0.04%), as well as 2 de novo gene retroduplications. Beyond identifying likely diagnostic RT events, we estimate genome-wide germline ME mutation rate and selective constraint and demonstrate that coding RT events have signatures of purifying selection equivalent to those of truncating mutations. Overall, our analysis represents a comprehensive interrogation of the impact of retrotransposition on protein coding genes and a framework for future evolutionary and disease studies.


Entropy ◽  
2020 ◽  
Vol 22 (11) ◽  
pp. 1294
Author(s):  
Lijuan Huo ◽  
Jin Seo Cho

This study examined the extreme learning machine (ELM) applied to the Wald test statistic for the model specification of the conditional mean, which we call the WELM testing procedure. The omnibus test statistics available in the literature weakly converge to a Gaussian stochastic process under the null that the model is correct, and this makes their application inconvenient. By contrast, the WELM testing procedure is straightforwardly applicable when detecting model misspecification. We applied the WELM testing procedure to the sequential testing procedure formed by a set of polynomial models and estimate an approximate conditional expectation. We then conducted extensive Monte Carlo experiments to evaluate the performance of the sequential WELM testing procedure and verify that it consistently estimates the most parsimonious conditional mean when the set of polynomial models contains a correctly specified model. Otherwise, it consistently rejects all the models in the set.


Sign in / Sign up

Export Citation Format

Share Document