scholarly journals Evolutionary forces affecting synonymous variations in plant genomes

2016 ◽  
Author(s):  
Yves Clément ◽  
Gautier Sarah ◽  
Yan Holtz ◽  
Felix Homa ◽  
Stéphanie Pointet ◽  
...  

AbstractBase composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic datasets based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones.

PeerJ ◽  
2015 ◽  
Vol 3 ◽  
pp. e1520 ◽  
Author(s):  
Marco Gerdol ◽  
Gianluca De Moro ◽  
Paola Venier ◽  
Alberto Pallavicini

Synonymous codon usage bias (CUB) is a defined as the non-random usage of codons encoding the same amino acid across different genomes. This phenomenon is common to all organisms and the real weight of the many factors involved in its shaping still remains to be fully determined. So far, relatively little attention has been put in the analysis of CUB in bivalve mollusks due to the limited genomic data available. Taking advantage of the massive sequence data generated from next generation sequencing projects, we explored codon preferences in 64 different species pertaining to the six major evolutionary lineages in Bivalvia. We detected remarkable differences across species, which are only partially dependent on phylogeny. While the intensity of CUB is mild in most organisms, a heterogeneous group of species (including Arcida and Mytilida, among the others) display higher bias and a strong preference for AT-ending codons. We show that the relative strength and direction of mutational bias, selection for translational efficiency and for translational accuracy contribute to the establishment of synonymous codon usage in bivalves. Although many aspects underlying bivalve CUB still remain obscure, we provide for the first time an overview of this phenomenon in this large, commercially and environmentally important, class of marine invertebrates.


2012 ◽  
Vol 60 (5) ◽  
pp. 461 ◽  
Author(s):  
Yuerong Zhang ◽  
Xiaojun Nie ◽  
Xiaoou Jia ◽  
Cunzhen Zhao ◽  
Siddanagouda S. Biradar ◽  
...  

Codon usage patterns of 23 Poaceae chloroplast genomes were analysed in this study. Neutrality analysis indicated that the codon usage patterns have significant correlations with GC12 and GC3 and also showed strong bias towards a high representation of NNA and NNT codons. The Nc-plot showed that although a large proportion of points follow the parabolic line of trajectory, several genes with low ENc values lie below the expected curve, suggesting that mutational bias played a major role in the codon biology of the Poaceae chloroplast genome. Parity Rule 2 plot analysis showed that T was used more frequently than A in all the genomes. Correspondence analysis of relative synonymous codon usage indicated that the first axis explained only a partial amount of variation of codon usage. Furthermore, the gene length and expression level were also found to drive codon usage variation. These findings revealed that besides natural selection, other factors might also exert some influences in shaping the codon usage bias in Poaceae chloroplast genomes. The optimal codons of these 23 genomes were also identified in this study.


Microbiology ◽  
2003 ◽  
Vol 149 (9) ◽  
pp. 2585-2596 ◽  
Author(s):  
Joshua T. Herbeck ◽  
Dennis P. Wall ◽  
Jennifer J. Wernegreen

Wigglesworthia glossinidia brevipalpis, the obligate bacterial endosymbiont of the tsetse fly Glossina brevipalpis, is characterized by extreme genome reduction and AT nucleotide composition bias. Here, multivariate statistical analyses are used to test the hypothesis that mutational bias and genetic drift shape synonymous codon usage and amino acid usage of Wigglesworthia. The results show that synonymous codon usage patterns vary little across the genome and do not distinguish genes of putative high and low expression levels, thus indicating a lack of translational selection. Extreme AT composition bias across the genome also drives relative amino acid usage, but predicted high-expression genes (ribosomal proteins and chaperonins) use GC-rich amino acids more frequently than do low-expression genes. The levels and configuration of amino acid differences between Wigglesworthia and Escherichia coli were compared to test the hypothesis that the relatively GC-rich amino acid profiles of high-expression genes reflect greater amino acid conservation at these loci. This hypothesis is supported by reduced levels of protein divergence at predicted high-expression Wigglesworthia genes and similar configurations of amino acid changes across expression categories. Combined, the results suggest that codon and amino acid usage in the Wigglesworthia genome reflect a strong AT mutational bias and elevated levels of genetic drift, consistent with expected effects of an endosymbiotic lifestyle and repeated population bottlenecks. However, these impacts of mutation and drift are apparently attenuated by selection on amino acid composition at high-expression genes.


2020 ◽  
Vol 21 (11) ◽  
Author(s):  
Redi Aditama ◽  
Zulfikar Achmad Tanjung ◽  
Widyartini Made Sudania ◽  
Yogo Adhi Nugroho ◽  
Condro Utomo ◽  
...  

Abstract. Aditama R, Tanjung ZA, Sudania WM, Nugroho YA, Utomo C, Liwang T. 2020. Analysis of codon usage bias reveals optimal codons in Elaeis guineensis. Biodiversitas 21: 5331-5337. Codon usage bias of oil palm genome was reported employing several indices, including GC content, relative synonymous codon usage (RSCU), the effective number of codons (ENC), and codon adaptation index (CAI). Unimodal distribution of GC content was observed and matched with non-grass monocots characteristics. Correspondence analysis (COA) on synonymous codon usage bias showed that the main axis was strongly driven by GC content. The ENC and neutrality plot of oil palm genes indicating that natural selection played more vital role compared to mutational bias on shaping codon usage bias. A positive correlation between calculated CAI and experimental data of oil palm gene expression was detected indicating good ability of this index. Finally, eighteen codons were defined as “optimal codons” that may provide a useful reference for heterogeneous expression and genome editing studies.


PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3037 ◽  
Author(s):  
Wei Qing Kong ◽  
Jin Hong Yang

Trees in the Morus genera belong to the Moraceae family. To better understand the species status of genus Morus and to provide information for studies on evolutionary biology within the genus, the complete chloroplast (cp) genomes of M. cathayana and M. multicaulis were sequenced. The plastomes of the two species are 159,265 bp and 159,103 bp, respectively, with corresponding 83 and 82 simple sequence repeats (SSRs). Similar to the SSRs of M. mongolica and M. indica cp genomes, more than 70% are mononucleotides, ten are in coding regions, and one exhibits nucleotide content polymorphism. Results for codon usage and relative synonymous codon usage show a strong bias towards NNA and NNT codons in the two cp genomes. Analysis of a plot of the effective number of codons (ENc) for five Morus spp. cp genomes showed that most genes follow the standard curve, but several genes have ENc values below the expected curve. The results indicate that both natural selection and mutational bias have contributed to the codon bias. Ten highly variable regions were identified among the five Morus spp. cp genomes, and 154 single-nucleotide polymorphism mutation events were accurately located in the gene coding region.


2009 ◽  
Vol 2 (3) ◽  
pp. 133-141
Author(s):  
Tangjie Zhang ◽  
Hong Chang ◽  
Yuzhi Liu ◽  
Huifang Li ◽  
Kuanwei Chen

Codon usage in mitochondrial genes of 11 Gallus gallus and two Anatidae species was analysed to determine the general patterns in codon choice of Callus gallus species. C3 contents were higher in Gallus gallus than in mammalian mitochondrial genomes that encode protein codon positions. The high C3 contents of Callus gallus might be the result of relatively strong mutational bias that occurred in the lineage of the Callus gallus species. A and C ending codons were detected as the “preferred 77 codons in Callus gallus and Anatidae. The NNR codon families are dominated by the A-ending codons, the NNY codon families are dominated by the C-ending codons and the NNN codon families are dominated by the A-ending or the C-ending codons. A comparison of the relative synonymous codon usage (RSCU) and synonymous codon families (SCF) of tRNA and proteins was made, and two groups can be classified by SCF. The codon usage in Callus gallus species indicates that codons containing A or C at the third position are used preferentially, regardless of whether corresponding tRNAs are encoded in the mtDNA. In both Callus gallus and Anatidae species mtDNA, codon usage biases are highly related to CC-ending binucleotide condons.


2019 ◽  
Author(s):  
Abigail L. Labella ◽  
Dana A. Opulente ◽  
Jacob L. Steenwyk ◽  
Chris Todd Hittinger ◽  
Antonis Rokas

AbstractVariation in synonymous codon usage is abundant across multiple levels of organization: between codons of an amino acid, between genes in a genome, and between genomes of different species. It is now well understood that variation in synonymous codon usage is influenced by mutational bias coupled with both natural selection for translational efficiency and genetic drift, but how these processes shape patterns of codon usage bias across entire lineages remains unexplored. To address this question, we used a rich genomic data set of 327 species that covers nearly one third of the known biodiversity of the budding yeast subphylum Saccharomycotina. We found that, while genome-wide relative synonymous codon usage (RSCU) for all codons was highly correlated with the GC content of the third codon position (GC3), the usage of codons for the amino acids proline, arginine, and glycine was inconsistent with the neutral expectation where mutational bias coupled with genetic drift drive codon usage. Examination between genes’ effective numbers of codons and their GC3 contents in individual genomes revealed that nearly a quarter of genes (381,174/1,683,203; 23%), as well as most genomes (308/327; 94%), significantly deviate from the neutral expectation. Finally, by evaluating the imprint of translational selection on codon usage, measured as the degree to which genes’ adaptiveness to the tRNA pool were correlated with selective pressure, we show that translational selection is widespread in budding yeast genomes (264/327; 81%). These results suggest that the contribution of translational selection and drift to patterns of synonymous codon usage across budding yeasts varies across codons, genes, and genomes; whereas drift is the primary driver of global codon usage across the subphylum, the codon bias of large numbers of genes in the majority of genomes is influenced by translational selection.Lay Summary / Significance statementSynonymous mutations in genes have no effect on the encoded proteins and were once thought to be evolutionarily neutral. By examining codon usage bias across codons, genes, and genomes of 327 species in the budding yeast subphylum, we show that synonymous codon usage is shaped by both neutral processes and selection for translational efficiency. Specifically, whereas codon usage bias for most codons appears to be strongly associated with mutational bias and largely driven by genetic drift across the entire subphylum, patterns of codon usage bias in a few codons, as well as in many genes in nearly all genomes of budding yeasts, deviate from neutral expectations. Rather, the synonymous codons used within genes in most budding yeast genomes are adapted to the tRNAs present within each genome, a result most likely due to translational selection that optimizes codons to match the tRNAs. Our results suggest that patterns of codon usage bias in budding yeasts, and perhaps more broadly in fungi and other microbial eukaryotes, are shaped by both neutral and selective processes.


Viruses ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 1800
Author(s):  
Josquin Daron ◽  
Ignacio Bravo

The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.


2010 ◽  
Vol 2010 ◽  
pp. 1-9 ◽  
Author(s):  
Yi-Yuong Hsiao ◽  
Chorng-Horng Lin ◽  
Jong-Kang Liu ◽  
Tit-Yee Wong ◽  
Jimmy Kuo

We have analyzed synonymous codon usage in the genome ofA. tamarenseCCMP 1598 for protein-coding sequences from 10865 expressed sequence tags (ESTs). We reconstructed a total of 4284 unigenes, including 74 ribosomal protein and 40 plastid-related genes, from ESTs using FrameDP, an open reading frame (ORF) prediction program. Correspondence analysis ofA. tamarensegenes based on codon usage showed that the GC content at the third base of synonymous codons (GC3s) was strongly correlated with the first axis (r=0.93withP<.001). On the other hand, the second axis discriminated between presumed highly and low expressed genes, with expression levels being confirmed by the analysis of EST frequencies (r=−0.89withP<.001). Our results suggest that mutational bias is the major factor in shaping codon usage inA. tamarensegenome, but other factors, namely, translational selection, hydropathy, and aromaticity, also appear to influence the selection of codon usage in this species.


Sign in / Sign up

Export Citation Format

Share Document