The Mutational Robustness of the Genetic Code and Codon Usage in Environmental Context: A Non-Extremophilic Preference?

The genetic code was evolved, to some extent, to minimize the effects of mutations. The effects of mutations depend on the amino acid repertoire, the structure of the genetic code and frequencies of amino acids in proteomes. The amino acid compositions of proteins and corresponding codon usages are still under selection, which allows us to ask what kind of environment the standard genetic code is adapted to. Using simple computational models and comprehensive datasets comprising genomic and environmental data from all three domains of Life, we estimate the expected severity of non-synonymous genomic mutations in proteins, measured by the change in amino acid physicochemical properties. We show that the fidelity in these physicochemical properties is expected to deteriorate with extremophilic codon usages, especially in thermophiles. These findings suggest that the genetic code performs better under non-extremophilic conditions, which not only explains the low substitution rates encountered in halophiles and thermophiles but the revealed relationship between the genetic code and habitat allows us to ponder on earlier phases in the history of Life.

Download Full-text

Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution?

Genes ◽

10.3390/genes12030409 ◽

2021 ◽

Vol 12 (3) ◽

pp. 409

Author(s):

Tamara L. Hendrickson ◽

Whitney N. Wood ◽

Udumbara M. Rathnayake

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Chemical Reactivity ◽

Trna Synthetase ◽

Amino Acid Side Chain ◽

Standard Genetic Code ◽

Last Universal Common Ancestor ◽

Trna Synthetases ◽

Universal Common Ancestor

The twenty amino acids in the standard genetic code were fixed prior to the last universal common ancestor (LUCA). Factors that guided this selection included establishment of pathways for their metabolic synthesis and the concomitant fixation of substrate specificities in the emerging aminoacyl-tRNA synthetases (aaRSs). In this conceptual paper, we propose that the chemical reactivity of some amino acid side chains (e.g., lysine, cysteine, homocysteine, ornithine, homoserine, and selenocysteine) delayed or prohibited the emergence of the corresponding aaRSs and helped define the amino acids in the standard genetic code. We also consider the possibility that amino acid chemistry delayed the emergence of the glutaminyl- and asparaginyl-tRNA synthetases, neither of which are ubiquitous in extant organisms. We argue that fundamental chemical principles played critical roles in fixation of some aspects of the genetic code pre- and post-LUCA.

Download Full-text

Phylogenetic analysis of mutational robustness based on codon usage supports that the standard genetic code does not prefer extreme environments

Scientific Reports ◽

10.1038/s41598-021-90440-y ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Ádám Radványi ◽

Ádám Kun

Keyword(s):

Physicochemical Properties ◽

Codon Usage ◽

Genetic Code ◽

Biological Diversity ◽

Phylogenetic Analyses ◽

Extreme Environments ◽

Gc Content ◽

Standard Genetic Code ◽

Mutational Robustness ◽

Synonymous Mutations

AbstractThe mutational robustness of the genetic code is rarely discussed in the context of biological diversity, such as codon usage and related factors, often considered as independent of the actual organism’s proteome. Here we put the living beings back to picture and use distortion as a metric of mutational robustness. Distortion estimates the expected severities of non-synonymous mutations measuring it by amino acid physicochemical properties and weighting for codon usage. Using the biological variance of codon frequencies, we interpret the mutational robustness of the standard genetic code with regards to their corresponding environments and genomic compositions (GC-content). Employing phylogenetic analyses, we show that coding fidelity in physicochemical properties can deteriorate with codon usages adapted to extreme environments and these putative effects are not the artefacts of phylogenetic bias. High temperature environments select for codon usages with decreased mutational robustness of hydrophobic, volumetric, and isoelectric properties. Selection at high saline concentrations also leads to reduced fidelity in polar and isoelectric patterns. These show that the genetic code performs best with mesophilic codon usages, strengthening the view that LUCA or its ancestors preferred lower temperature environments. Taxonomic implications, such as rooting the tree of life, are also discussed.

Download Full-text

The Combinatorial Fusion Cascade to Generate the Standard Genetic Code

Life ◽

10.3390/life11090975 ◽

2021 ◽

Vol 11 (9) ◽

pp. 975

Author(s):

Alexander Nesterov-Mueller ◽

Roman Popov

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Temporal Order ◽

Prebiotic Chemistry ◽

Standard Genetic Code ◽

The Road ◽

Novel Technology ◽

Three Stages ◽

Combinatorial Fusion

Combinatorial fusion cascade was proposed as a transition stage between prebiotic chemistry and early forms of life. The combinatorial fusion cascade consists of three stages: eight initial complimentary pairs of amino acids, four protocodes, and the standard genetic code. The initial complimentary pairs and the protocodes are divided into dominant and recessive entities. The transitions between these stages obey the same combinatorial fusion rules for all amino acids. The combinatorial fusion cascade mathematically describes the codon assignments in the standard genetic code. It explains the availability of amino acids with the even and odd numbers of codons, the appearance of stop codons, inclusion of novel canonical amino acids, exceptional high numbers of codons for amino acids arginine, leucine, and serine, and the temporal order of amino acid inclusion into the genetic code. The temporal order of amino acids within the cascade is congruent with the consensus temporal order previously derived from the similarities between the available hypotheses. The control over the combinatorial fusion cascades would open the road for a novel technology to develop artificial microorganisms.

Download Full-text

Genetic code: Chemical Distinctions of Protein Amino Acids

10.31219/osf.io/86rjt ◽

2017 ◽

Author(s):

Miloje M. Rakocevic

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Standard Genetic Code ◽

Atom Number ◽

Canonical Bases ◽

Ordinal Numbers ◽

Intelligent Structures ◽

Protein Amino Acids

In the work it is shown that 20 protein amino acids ("the canonical amino acids" within the genetic code) appear to be a whole and very symmetrical system, in many ways, all based on strict chemical distinctions from the aspect of their similarity, complexity, stereochemical and diversity types. By this, all distinctions are accompanied by specific arithmetical and algebraic regularities, including the existence of amino acid ordinal numbers from 1 to 20. The classification of amino acids into two decades (1-10 and 11-20) appears to be in a strict correspondence with the atom number balances. From the presented "ideal" and "intelligent" structures and arrangements follow the conclusions that the genetic code was complete even in prebiotic conditions (as a set of 20 canonical amino acids and the set of 2+2 pyrimidine / purine canonical bases, respectively); and the notion "evolution" of the genetic code can only mean the degree of freedom of standard genetic code, i.e. the possible exceptions and deviations from the standard genetic code. [This is the second version with minimal interventions in the text. In addition, one passage was added in front of the second star, with quoting of T. Jukes. Added is Remark 4 and a more adequate shading in the Table inside Box 2.]

Download Full-text

On the Importance of Asymmetry in the Phenotypic Expression of the Genetic Code upon the Molecular Evolution of Proteins

Symmetry ◽

10.3390/sym12060997 ◽

2020 ◽

Vol 12 (6) ◽

pp. 997

Author(s):

Marco V. José ◽

Gabriel S. Zamudio

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Phenotypic Expression ◽

Standard Genetic Code ◽

Specific Amino Acid ◽

Trna Synthetases ◽

Probability Of Occurrence ◽

Genetic Codes ◽

Evolution Of Proteins

The standard genetic code (SGC) is a mapping between the 64 possible arrangements of the four RNA nucleotides (C, A, U, G) into triplets or codons, where 61 codons are assigned to a specific amino acid and the other three are stop codons for terminating protein synthesis. Aminoacyl-tRNA synthetases (aaRSs) are responsible for implementing the SGC by specifically amino-acylating only its cognate transfer RNA (tRNA), thereby linking an amino acid with its corresponding anticodon triplets. tRNAs molecules bind each codon with its anticodon. To understand the meaning of symmetrical/asymmetrical properties of the SGC, we designed synthetic genetic codes with known symmetries and with the same degeneracy of the SGC. We determined their impact on the substitution rates for each amino acid under a neutral model of protein evolution. We prove that the phenotypic graphs of the SGC for codons and anticodons for all the possible arrangements of nucleotides are asymmetric and the amino acids do not form orbits. In the symmetrical synthetic codes, the amino acids are grouped according to their codonicity, this is the number of triplets encoding a given amino acid. Both the SGC and symmetrical synthetic codes exhibit a probability of occurrence of the amino acids proportional to their degeneracy. Unlike the SGC, the synthetic codes display a constant probability of occurrence of the amino acid according to their codonicity. The asymmetry of the phenotypic graphs of codons and anticodons of the SGC, has important implications on the evolutionary processes of proteins.

Download Full-text

Genetic code: Chemical Distinctions of Protein Amino Acids

10.31227/osf.io/qb2p5 ◽

2017 ◽

Author(s):

Miloje M. Rakočević

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Standard Genetic Code ◽

Atom Number ◽

Canonical Bases ◽

Ordinal Numbers ◽

Intelligent Structures ◽

Protein Amino Acids

In the work it is shown that 20 protein amino acids ("the canonical amino acids" within the genetic code) appear to be a whole and very symmetrical system, in many ways, all based on strict chemical distinctions from the aspect of their similarity, complexity, stereochemical and diversity types. By this, all distinctions are accompanied by specific arithmetical and algebraic regularities, including the existence of amino acid ordinal numbers from 1 to 20. The classification of amino acids into two decades (1-10 and 11-20) appears to be in a strict correspondence with the atom number balances. From the presented "ideal" and "intelligent" structures and arrangements follow the conclusions that the genetic code was complete even in prebiotic conditions (as a set of 20 canonical amino acids and the set of 2+2 pyrimidine / purine canonical bases, respectively); and the "evolution" of the genetic code can only mean the degree of freedom of standard genetic code, i.e. the possible exceptions and deviations from the standard genetic code.

Download Full-text

Optimization of Expanded Genetic Codes via Genetic Algorithms

10.5753/eniac.2018.4440 ◽

2018 ◽

Author(s):

Maísa de Carvalho Silva ◽

Lariza Laura De Oliveira ◽

Renato Tinós

Keyword(s):

Amino Acids ◽

Genetic Algorithms ◽

Amino Acid ◽

Genetic Code ◽

Unnatural Amino Acids ◽

Genetically Modified Organisms ◽

Fitness Function ◽

Unnatural Amino Acid ◽

Standard Genetic Code ◽

Genetic Codes

In the last decades, researchers have proposed the use of genetically modified organisms that utilize unnatural amino acids, i.e., amino acids other than the 20 amino acids encoded in the standard genetic code. Unnatural amino acids have been incorporated into genetically engineered organisms for the development of new drugs, fuels and chemicals. When new amino acids are incorporated, it is necessary to modify the standard genetic code. Expanded genetic codes have been created without considering the robustness of the code. The objective of this work is the use of genetic algorithms (GAs) for the optimization of expanded genetic codes. The GA indicates which codons of the standard genetic code should be used to encode a new unnatural amino acid. The fitness function has two terms; one for robustness of the new code and another that takes into account the frequency of use of amino acids. Experiments show that, by controlling the weighting between the two terms, it is possible to obtain more or less amino acid substitutions at the same time that the robustness is minimized.

Download Full-text

Computational design of genes encoding completely overlapping protein domains: Influence of genetic code and taxonomic rank

10.1101/2020.09.25.312959 ◽

2020 ◽

Author(s):

Stefan Wichmann ◽

Siegfried Scherer ◽

Zachary Ardern

Keyword(s):

Amino Acid ◽

Genetic Code ◽

Prokaryotic Genome ◽

Computational Design ◽

Protein Domain ◽

Separate Analysis ◽

Overlapping Genes ◽

Standard Genetic Code ◽

Mutational Robustness ◽

Eukaryotic Genes

AbstractOverlapping genes (OLGs) with long protein-coding overlapping sequences are often excluded by genome annotation programs, with the exception of virus genomes. A recent study used a novel algorithm to construct OLGs from arbitrary protein domain pairs and concluded that virus genes are best suited for creating OLGs, a result which fitted with common assumptions. However, improving sequence evaluation using Hidden Markov Models shows that the previous result is an artifact originating from dataset-database biases. When parameters for OLG design and evaluation are optimized we find that 94.5% of the constructed OLG pairs score at least as highly as naturally occurring sequences, while 9.6% of the artificial OLGs cannot be distinguished from typical sequences in their protein family. Constructed OLG sequences are also indistinguishable from natural sequences in terms of amino acid identity and secondary structure, while the minimum nucleotide change required for overprinting an overlapping sequence can be as low as 1.8% of the sequence. Separate analysis of datasets containing only sequences from either archaea, bacteria, eukaryotes or viruses showed that, surprisingly, virus genes are much less suitable for designing OLGs than bacterial or eukaryotic genes. An important factor influencing OLG design is the structure of the standard genetic code. Success rates in different reading frames strongly correlate with their code-determined respective amino acid constraints. There is a tendency indicating that the structure of the standard genetic code could be optimized in its ability to create OLGs while conserving mutational robustness. The findings reported here add to the growing evidence that OLGs should no longer be excluded in prokaryotic genome annotations. Determining the factors facilitating the computational design of artificial overlapping genes may improve our understanding of the origin of these remarkable genetic constructs and may also open up exciting possibilities for synthetic biology.

Download Full-text

Plasma-Free Amino Acid Profiling of Nasal Polyposis Patients

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207322666190920110324 ◽

2020 ◽

Vol 22 (9) ◽

pp. 657-662 ◽

Cited By ~ 1

Author(s):

Mustafa Celik ◽

Alper Şen ◽

İsmail Koyuncu ◽

Ataman Gönel

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Free Amino Acid ◽

Free Amino ◽

Nasal Polyposis ◽

Amino Acid Profile ◽

Healthy Control Group ◽

Control Group ◽

Healthy Control ◽

History Of

Aim and Objective:: To determine the mechanisms present in the etiopathogenesis of nasal polyposis. It is not clear whether amino acids contribute in a causal way to the development of the disease. Therefore, the aim of this study was to determine the plasma-free amino acid profile in patients with nasal polyposis and to compare the results with a healthy control group. Materials and Methods:: This was a prospective controlled study that took place in the Otolaryngology Department at the Harran University Faculty of Medicine between April 2017 and April 2018. Plasmafree amino acid profile levels were studied in serum samples taken from a patient group and a healthy control group. Patients who were diagnosed with bilateral diffuse nasal polyposis and were scheduled for surgical interventions were included in this study. Individuals whose age, gender, and body mass index values were compatible with that of the patient group and who did not have any health problems were included in the control group. All the participants whose levels of plasma-free amino acid were thought to be affected by one or more of the following factors were excluded from the study: smoking and alcohol use, allergic rhinitis presence, the presence of acute or chronic sinusitis, a history of endoscopic sinus surgery, unilateral nasal masses, a history of chronic drug use, systemic or topical steroid use in the last three months for any reason, and liver, kidney, hematological, cardiovascular, metabolic, neurological, or psychiatric disorders or malignancies. Results: In patients with nasal polyposis, 3-methyl histidine (3-MHIS: nasal polyposis group (ng) = 3.22 (1.92 – 6.07); control group (cg) = 1.21 (0.77 – 1.68); p = 0.001); arginine (arg: ng = 98.95 (70.81 – 117.75); cg = 75.10 (54.49 – 79.88); p = 0.005); asparagine (asn: ng = 79.84 (57.50 – 101.44); cg = 60.66 (46.39 – 74.62); p = 0.021); citrulline (cit: ng = 51.83 (43.81 – 59.78); cg = 38.33 (27.81 – 53.73); p = 0.038); cystine (cys: ng = 4.29 (2.43 – 6.66); cg = 2.41 (1.51 – 4.16); p = 0.019); glutamic acid (glu: ng = 234.86 (128.75 – 286.66); cg = 152.37 (122.51 – 188.34); p = 0.045); histidine (his: ng = 94.19 (79.34 – 113.99); cg = 74.80 (62.76 – 98.91); p = 0.018); lysine (lys: ng = 297.22 (206.55 – 371.25); cg = 179.50 (151.58 – 238.02); p = 0.001); ornithine (ng = 160.62 (128.36 – 189.32); cg = 115.91 (97.03 – 159.91); p = 0.019); serine (ser: ng = 195.15 (151.58 – 253.07); cg = 83.07 (67.44 – 92.44); p = 0.001); taurine (tau: ng = 74.69 (47.00 – 112.13); cg = 53.14 (33.57 – 67.31); p = 0.006); tryptophan (trp: ng = 52.31 (33.81 – 80.11); cg = 34.44 (25.94 – 43.07); p = 0.005), homocitrulline (ng = 1.75 (1.27 – 2.59); cg = 0.00 (0.00 – 0.53); p = 0.001); norvaline (ng = 6.90 (5.61 – 9.18); cg = 4.93 (3.74 – 7.13); p = 0.021); argininosuccinic acid (ng = 14.33 (10.06 – 25.65); cg = 12.22 (5.77 – 16.87) p = 0.046); and plasma concentrations were significantly higher than in the healthy control group (p <0.05). However, the gamma-aminobutyric acid (gaba: ng = 0.16 (0.10 – 0.24); cg = 0.21 (0.19 – 0.29); p = 0.010) plasma concentration was significantly lower in the nasal polyposis group than in the healthy control group. Conclusion: In this study, plasma levels of 15 free amino acids were significantly higher in the nasal polyposis group than in the healthy control group. A plasma level of 1 free amino acid was found to be significantly lower in the nasal polyposis group compared to the healthy control group. Therefore, it is important to determine the possibility of using the information obtained to prevent the recurrence of the condition and to develop effective treatment strategies. This study may be a milestone for studies of this subject. However, this study needs to be confirmed by further studies conducted in a larger series.

Download Full-text

Transferability of N-terminal mutations of pyrrolysyl-tRNA synthetase in one species to that in another species on unnatural amino acid incorporation efficiency

Amino Acids ◽

10.1007/s00726-020-02927-z ◽

2020 ◽

Author(s):

Thomas L. Williams ◽

Debra J. Iskandar ◽

Alexander R. Nödling ◽

Yurong Tan ◽

Louis Y. P. Luk ◽

...

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Unnatural Amino Acids ◽

Trna Synthetase ◽

Unnatural Amino Acid ◽

Amino Acid Incorporation ◽

Acid Incorporation ◽

Genetic Code Expansion ◽

Incorporation Efficiency

AbstractGenetic code expansion is a powerful technique for site-specific incorporation of an unnatural amino acid into a protein of interest. This technique relies on an orthogonal aminoacyl-tRNA synthetase/tRNA pair and has enabled incorporation of over 100 different unnatural amino acids into ribosomally synthesized proteins in cells. Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA from Methanosarcina species are arguably the most widely used orthogonal pair. Here, we investigated whether beneficial effect in unnatural amino acid incorporation caused by N-terminal mutations in PylRS of one species is transferable to PylRS of another species. It was shown that conserved mutations on the N-terminal domain of MmPylRS improved the unnatural amino acid incorporation efficiency up to five folds. As MbPylRS shares high sequence identity to MmPylRS, and the two homologs are often used interchangeably, we examined incorporation of five unnatural amino acids by four MbPylRS variants at two temperatures. Our results indicate that the beneficial N-terminal mutations in MmPylRS did not improve unnatural amino acid incorporation efficiency by MbPylRS. Knowledge from this work contributes to our understanding of PylRS homologs which are needed to improve the technique of genetic code expansion in the future.

Download Full-text