codon preference
Recently Published Documents


TOTAL DOCUMENTS

45
(FIVE YEARS 14)

H-INDEX

13
(FIVE YEARS 2)

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Panpan Wang ◽  
Yong Mao ◽  
Yongquan Su ◽  
Jun Wang

Abstract Background Kuruma shrimp, a major commercial shrimp species in the world, has two cryptic or sibling species, Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. Codon usage analysis would contribute to our understanding of the genetic and evolutionary characteristics of the two Marsupenaeus species. In this study, we analyzed codon usage and related indices using coding sequences (CDSs) from RNA-seq data. Results Using CodonW 1.4.2 software, we performed the codon bias analysis of transcriptomes obtained from hepatopancreas tissues, which indicated weak codon bias. Almost all parameters had similar correlations for both species. The gene expression level (FPKM) was negatively correlated with A/T3s. We determined 12 and 14 optimal codons for M. japonicus and M. pulchricaudatus, respectively, and all optimal codons have a C/G-ending. The two Marsupenaeus species had different usage frequencies of codon pairs, which contributed to further analysis of transcriptional differences between them. Orthologous genes that underwent positive selection (ω > 1) had a higher correlation coefficient than that of experienced purifying selection (ω < 1). Parity Rule 2 (PR2) and effective number of codons (ENc) plot analysis showed that the codon usage patterns of both species were influenced by both mutations and selection. Moreover, the average observed ENc value was lower than the expected value for both species, suggesting that factors other than GC may play roles in these phenomena. The results of multispecies clustering based on codon preference were consistent with traditional classification. Conclusions This study provides a relatively comprehensive understanding of the correlations among codon usage bias, gene expression, and selection pressures of CDSs for M. japonicus and M. pulchricaudatus. The genetic evolution was driven by mutations and selection pressure. Moreover, the results point out new insights into the specificities and evolutionary characteristics of the two Marsupenaeus species.


Oncogene ◽  
2021 ◽  
Author(s):  
Sarah L. Gillen ◽  
Joseph A. Waldron ◽  
Martin Bushell

AbstractA key characteristic of cancer cells is their increased proliferative capacity, which requires elevated levels of protein synthesis. The process of protein synthesis involves the translation of codons within the mRNA coding sequence into a string of amino acids to form a polypeptide chain. As most amino acids are encoded by multiple codons, the nucleotide sequence of a coding region can vary dramatically without altering the polypeptide sequence of the encoded protein. Although mutations that do not alter the final amino acid sequence are often thought of as silent/synonymous, these can still have dramatic effects on protein output. Because each codon has a distinct translation elongation rate and can differentially impact mRNA stability, each codon has a different degree of ‘optimality’ for protein synthesis. Recent data demonstrates that the codon preference of a transcriptome matches the abundance of tRNAs within the cell and that this supply and demand between tRNAs and mRNAs varies between different cell types. The largest observed distinction is between mRNAs encoding proteins associated with proliferation or differentiation. Nevertheless, precisely how codon optimality and tRNA expression levels regulate cell fate decisions and their role in malignancy is not fully understood. This review describes the current mechanistic understanding on codon optimality, its role in malignancy and discusses the potential to target codon optimality therapeutically in the context of cancer.


2021 ◽  
Vol 3 (4) ◽  
Author(s):  
Ruben Cabrera ◽  
Jhoana Díaz-Larrea ◽  
Arsenio J. Areces ◽  
Laura Nuñez García ◽  
J. RicardoJ. Ricardo Cruz-Aviña ◽  
...  

Recent advances in molecular biology make it possible to sequence not only genes or genomes, but also to understand codon dynamics. For the organelle genes of these organisms, a small set of preferred codons are used for encoding proteins. For the first time, this paper treats the divergence of synonymous codon usage and its bias in the rbcL gene within the Laurencia complex of red algae. We observed that the synonymous codon preference biases in rbcL are large and differ among species. A clear distinction in codon usage between genera is evident: the genera Dasya and Delesseria use a set that fluctuated between 53 and 58 codons. Whereas, in the genera Ceramium, Chondrophycus, Chlamydomonas, Chlorella, Laurencia s.s, Osmundea, and Palisada codon usage indicates a higher restriction fluctuating between 40 to 51 codons. Laurencia complex genera and other representative algae showed a defined composition pattern, with lower percentage values of NNC/G (7-24.9%). Dasya and Delesseria showed a selective pattern tendency because of high percentage values of NNC/G (54-55%). The estimated codon bias parameters were tested to infer systematic relationships and match suitable codons with the NNC / G codon percentages. Cluster analysis based on Codon Usage supports phylogenetic relationships between Chondrophycus, Palisada, Laurencia, Osmundea, and Yuzurua species.


2021 ◽  
Vol 22 (18) ◽  
pp. 9842
Author(s):  
Zheng-Shan He ◽  
Andan Zhu ◽  
Jun-Bo Yang ◽  
Weishu Fan ◽  
De-Zhu Li

Posttranscriptional modifications, including intron splicing and RNA editing, are common processes during regulation of gene expression in plant organelle genomes. However, the intermediate products of intron-splicing, and the interplay between intron-splicing and RNA-editing were not well studied. Most organelle transcriptome analyses were based on the Illumina short reads which were unable to capture the full spectrum of transcript intermediates within an organelle. To fully investigate the intermediates during intron splicing and the underlying relationships with RNA editing, we used PacBio DNA-seq and Iso-seq, together with Illumina short reads genome and transcriptome sequencing data to assemble the chloroplast and mitochondrial genomes of Nymphaea ‘Joey Tomocik’ and analyze their posttranscriptional features. With the direct evidence from Iso-seq, multiple intermediates partially or fully intron-spliced were observed, and we also found that both cis- and trans-splicing introns were spliced randomly. Moreover, by using rRNA-depleted and non-Oligo(dT)-enrichment strand-specific RNA-seq data and combining direct SNP-calling and transcript-mapping methods, we identified 98 and 865 RNA-editing sites in the plastome and mitogenome of N. ‘Joey Tomocik’, respectively. The target codon preference, the tendency of increasing protein hydrophobicity, and the bias distribution of editing sites are similar in both organelles, suggesting their common evolutionary origin and shared editing machinery. The distribution of RNA editing sites also implies that the RNA editing sites in the intron and exon regions may splice synchronously, except those exonic sites adjacent to intron which could only be edited after being intron-spliced. Our study provides solid evidence for the multiple intermediates co-existing during intron-splicing and their interplay with RNA editing in organelle genomes of a basal angiosperm.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Douglas Meyer ◽  
Jacob Kames ◽  
Haim Bar ◽  
Anton A. Komar ◽  
Aikaterini Alexaki ◽  
...  

Abstract Background Gene expression is highly variable across tissues of multi-cellular organisms, influencing the codon usage of the tissue-specific transcriptome. Cancer disrupts the gene expression pattern of healthy tissue resulting in altered codon usage preferences. The topic of codon usage changes as they relate to codon demand, and tRNA supply in cancer is of growing interest. Methods We analyzed transcriptome-weighted codon and codon pair usage based on The Cancer Genome Atlas (TCGA) RNA-seq data from 6427 solid tumor samples and 632 normal tissue samples. This dataset represents 32 cancer types affecting 11 distinct tissues. Our analysis focused on tissues that give rise to multiple solid tumor types and cancer types that are present in multiple tissues. Results We identified distinct patterns of synonymous codon usage changes for different cancer types affecting the same tissue. For example, a substantial increase in GGT-glycine was observed in invasive ductal carcinoma (IDC), invasive lobular carcinoma (ILC), and mixed invasive ductal and lobular carcinoma (IDLC) of the breast. Change in synonymous codon preference favoring GGT correlated with change in synonymous codon preference against GGC in IDC and IDLC, but not in ILC. Furthermore, we examined the codon usage changes between paired healthy/tumor tissue from the same patient. Using clinical data from TCGA, we conducted a survival analysis of patients based on the degree of change between healthy and tumor-specific codon usage, revealing an association between larger changes and increased mortality. We have also created a database that contains cancer-specific codon and codon pair usage data for cancer types derived from TCGA, which represents a comprehensive tool for codon-usage-oriented cancer research. Conclusions Based on data from TCGA, we have highlighted tumor type-specific signatures of codon and codon pair usage. Paired data revealed variable changes to codon usage patterns, which must be considered when designing personalized cancer treatments. The associated database, CancerCoCoPUTs, represents a comprehensive resource for codon and codon pair usage in cancer and is available at https://dnahive.fda.gov/review/cancercocoputs/. These findings are important to understand the relationship between tRNA supply and codon demand in cancer states and could help guide the development of new cancer therapeutics.


2021 ◽  
Vol 15 (1) ◽  
pp. 20-32
Author(s):  
Rui Yang ◽  
Ruishuang Sun ◽  
Wenlong Zhu ◽  
QingShuai Zhang ◽  
Debang Liu ◽  
...  

To realize the application and production of hagfish mucus protein, this experiment increased the protein expression and improved its purification method. According to codon preference, the hagfish mucus protein gene was optimized to increase the production of target protein. Then, the protein expression conditions of the host bacteria were optimized, and the IPTG concentration, induction time and supplementation amounts of glycine, threonine, and serine were evaluated in single-factor tests. On the basis of single-factor experiments, with the supplementation of glycine, threonine, and serine as independent variables, the target protein yield was the response value. According to the Box-Behnken central combination design principle of the response surface method, the influence of the respective variables and their interaction on the hagfish mucus protein yield were studied, and the induction conditions were optimized through a combination of Design-Expert software and response surface analysis. The results show that the best induction conditions for EsTKα shake flasks are IPTG concentration 0.6 mmol/L, induction for 12 h, and glycine, threonine, and serine added at 90 mg/L, 90 mg/L, and 9.91 mg/L, respectively, to achieve the highest protein yield of 153.482 mg/L. The IPTG concentration of EsTKγ was 0.8 mmol/L after induction for 12 h, and the amounts of glycine, threonine, and serine were 54 mg/L, 9.01 mg/L, and 11.4 mg/L, respectively. The theoretical best protein yield was 141.97 mg/L. Finally, based on the principle of specific self-assembly between proteins, the two proteins were subjected to gradient dialysis, and the gelled assembled protein was selected by the phase separation method to achieve separation and purification.


BMC Biology ◽  
2020 ◽  
Vol 18 (1) ◽  
Author(s):  
Yu Zhao ◽  
Dantong Shang ◽  
Ruhong Ying ◽  
Hanhua Cheng ◽  
Rongjia Zhou

Abstract Background Zebrafish is a model organism widely used for the understanding of gene function, including the fundamental basis of human disease, enabled by the presence in its genome of a high number of orthologs to human genes. CRISPR/Cas9 and next-generation gene-editing techniques using cytidine deaminase fused with Cas9 nickase provide fast and efficient tools able to induce sequence-specific single base mutations in various organisms and have also been used to generate genetically modified zebrafish for modeling pathogenic mutations. However, the editing efficiency in zebrafish of currently available base editors is lower than other model organisms, frequently inducing indel formation, which limits the applicability of these tools and calls for the search of more accurate and efficient editors. Results Here, we generated a new base editor (zAncBE4max) with a length of 5560 bp following a strategy based on the optimization of codon preference in zebrafish. Our new editor effectively created C-to-T base substitution while maintaining a high product purity at multiple target sites. Moreover, zAncBE4max successfully generated the Twist2 p.E78K mutation in zebrafish, recapitulating pathological features of human ablepharon macrostomia syndrome (AMS). Conclusions Overall, the zAncBE4max system provides a promising tool to perform efficient base editing in zebrafish and enhances its capacity to precisely model human diseases.


2020 ◽  
Author(s):  
Panpan Wang ◽  
Yong Mao ◽  
Yongquan Su ◽  
Jun Wang

Abstract Background: Marsupenaeus japonicus, a major commercial shrimp species in the world, has two cryptic or sibling species, Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. Due to the lack of genomic information, little is known about the correlations among codon usage bias, gene expression, and evolutionary trends in Marsupenaeus orthologs.Results: Using the CodonW 1.4.2 software, we performed the codon bias analysis of two Marsupenaeus species transcriptomes. The average contents of GC and ENc were 51.61% and 52.1 for VI (M. japonicus), 51.54% and 52.22 for VII (M. pulchricaudatus), respectively. Parity Rule 2 (PR2) plot analysis showed that purines (A and G) were used more frequently than pyrimidines (C and T) in two Marsupenaeus species. The average ENc value was 52.1 and 52.22 for M. japonicus and M. pulchricaudatus, respectively. Overall, orthologous genes that underwent positive selection (ω > 1) had a higher correlation coefficient than that experienced purifying selection (ω < 1). In M. japonicus, the relationships were highly significant positive about Axis 1 and A3, T3 and ENc (p < 0.01). However, all relationships in M. pulchricaudatus were the opposite. We determined 12 and 14 optimal codons for M. japonicus and M. pulchricaudatus, respectively. Two Marsupenaeus species had 31 different codon pairs. The results of multi-species clustering based on codon preference were consistent with traditional classification. Conclusions: We characterized the codon usage patterns of the two Marsupenaeus species and the evolutionary trends in Marsupenaeus orthologs, which provides new insights into the genetic divergence and the phylogenetic relationships of two Marsupenaeus species.


Marine Drugs ◽  
2020 ◽  
Vol 18 (9) ◽  
pp. 464
Author(s):  
Xinjia Li ◽  
Wanyi Chen ◽  
Dongting Zhangsun ◽  
Sulan Luo

The venom of various Conus species is composed of a rich variety of unique bioactive peptides, commonly referred to as conotoxins (conopeptides). Most conopeptides have specific receptors or ion channels as physiologically relevant targets. In this paper, high-throughput transcriptome sequencing was performed to analyze putative conotoxin transcripts from the venom duct of a vermivorous cone snail species, Conus litteratus native to the South China Sea. A total of 128 putative conotoxins were identified, most of them belonging to 22 known superfamilies, with 43 conotoxins being regarded as belonging to new superfamilies. Notably, the M superfamily was the most abundant in conotoxins among the known superfamilies. A total of 15 known cysteine frameworks were also described. The largest proportion of cysteine frameworks were VI/VII (C-C-CC-C-C), IX (C-C-C-C-C-C) and XIV (C-C-C-C). In addition, five novel cysteine patterns were also discovered. Simple sequence repeat detection results showed that di-nucleotide was the major type of repetition, and the codon usage bias results indicated that the codon usage bias of the conotoxin genes was weak, but the M, O1, O2 superfamilies differed in codon preference. Gene cloning indicated that there was no intron in conotoxins of the B1- or J superfamily, one intron with 1273–1339 bp existed in a mature region of the F superfamily, which is different from the previously reported gene structure of conotoxins from other superfamilies. This study will enhance our understanding of conotoxin diversity, and the new conotoxins discovered in this paper will provide more potential candidates for the development of pharmacological probes and marine peptide drugs.


Sign in / Sign up

Export Citation Format

Share Document