scholarly journals Equivalence classes of circular codes induced by permutation groups

2021 ◽  
Vol 140 (1) ◽  
pp. 107-121
Author(s):  
Fariba Fayazi ◽  
Elena Fimmel ◽  
Lutz Strüngmann

AbstractIn the 1950s, Crick proposed the concept of so-called comma-free codes as an answer to the frame-shift problem that biologists have encountered when studying the process of translating a sequence of nucleotide bases into a protein. A little later it turned out that this proposal unfortunately does not correspond to biological reality. However, in the mid-90s, a weaker version of comma-free codes, so-called circular codes, was discovered in nature in J Theor Biol 182:45–58, 1996. Circular codes allow to retrieve the reading frame during the translational process in the ribosome and surprisingly the circular code discovered in nature is even circular in all three possible reading-frames ($$C^3$$ C 3 -property). Moreover, it is maximal in the sense that it contains 20 codons and is self-complementary which means that it consists of pairs of codons and corresponding anticodons. In further investigations, it was found that there are exactly 216 codes that have the same strong properties as the originally found code from J Theor Biol 182:45–58. Using an algebraic approach, it was shown in J Math Biol, 2004 that the class of 216 maximal self-complementary $$C^3$$ C 3 -codes can be partitioned into 27 equally sized equivalence classes by the action of a transformation group $$L \subseteq S_4$$ L ⊆ S 4 which is isomorphic to the dihedral group. Here, we extend the above findings to circular codes over a finite alphabet of even cardinality $$|\Sigma |=2n$$ | Σ | = 2 n for $$n \in {\mathbb {N}}$$ n ∈ N . We describe the corresponding group $$L_n$$ L n using matrices and we investigate what classes of circular codes are split into equally sized equivalence classes under the natural equivalence relation induced by $$L_n$$ L n . Surprisingly, this is not always the case. All results and constructions are illustrated by examples.

Author(s):  
Elena Fimmel ◽  
Christian J. Michel ◽  
Lutz Strüngmann

The circular code theory proposes that genes are constituted of two trinucleotide codes: the classical genetic code with 61 trinucleotides for coding the 20 amino acids (except the three stop codons { TAA , TAG , TGA }) and a circular code based on 20 trinucleotides for retrieving, maintaining and synchronizing the reading frame. It relies on two main results: the identification of a maximal C 3 self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses (Michel 2015 J. Theor. Biol. 380, 156–177. ( doi:10.1016/j.jtbi.2015.04.009 ); Arquès & Michel 1996 J. Theor. Biol. 182, 45–58. ( doi:10.1006/jtbi.1996.0142 )) and the finding of X circular code motifs in tRNAs and rRNAs, in particular in the ribosome decoding centre (Michel 2012 Comput. Biol. Chem. 37, 24–37. ( doi:10.1016/j.compbiolchem.2011.10.002 ); El Soufi & Michel 2014 Comput. Biol. Chem. 52, 9–17. ( doi:10.1016/j.compbiolchem.2014.08.001 )). The univerally conserved nucleotides A1492 and A1493 and the conserved nucleotide G530 are included in X circular code motifs. Recently, dinucleotide circular codes were also investigated (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. ( doi:10.1155/2013/538631 ); Fimmel et al. 2015 J. Theor. Biol. 386, 159–165. ( doi:10.1016/j.jtbi.2015.08.034 )). As the genetic motifs of different lengths are ubiquitous in genes and genomes, we introduce a new approach based on graph theory to study in full generality n -nucleotide circular codes X , i.e. of length 2 (dinucleotide), 3 (trinucleotide), 4 (tetranucleotide), etc. Indeed, we prove that an n -nucleotide code X is circular if and only if the corresponding graph is acyclic. Moreover, the maximal length of a path in corresponds to the window of nucleotides in a sequence for detecting the correct reading frame. Finally, the graph theory of tournaments is applied to the study of dinucleotide circular codes. It has full equivalence between the combinatorics theory (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. ( doi:10.1155/2013/538631 )) and the group theory (Fimmel et al. 2015 J. Theor. Biol. 386, 159–165. ( doi:10.1016/j.jtbi.2015.08.034 )) of dinucleotide circular codes while its mathematical approach is simpler.


2004 ◽  
Vol 377 (2) ◽  
pp. 459-467 ◽  
Author(s):  
Jose M. LAPLAZA ◽  
Magnolia BOSTICK ◽  
Derek T. SCHOLES ◽  
M. Joan CURCIO ◽  
Judy CALLIS

In Saccharomyces cerevisiae, the ubiquitin-like protein Rub1p (related to ubiquitin 1 protein) covalently attaches to the cullin protein Cdc53p (cell division cycle 53 protein), a subunit of a class of ubiquitin E3 ligases named SCF (Skp1–Cdc53–F-box protein) complex. We identified Rtt101p (regulator of Ty transposition 101 protein, where Ty stands for transposon of yeast), initially found during a screen for proteins to confer retrotransposition suppression, and Cul3p (cullin 3 protein), a protein encoded by the previously uncharacterized open reading frame YGR003w, as two new in vivo targets for Rub1p conjugation. These proteins show significant identity with Cdc53p and, therefore, are cullin proteins. Modification of Cul3p is eliminated by deletion of the Rub1p pathway through disruption of either RUB1 or its activating enzyme ENR2/ULA1. The same disruptions in the Rub pathway decreased the percentage of total Rtt101p that is modified from approx. 60 to 30%. This suggests that Rtt101p has an additional RUB1- and ENR2-independent modification. All modified forms of Rtt101p and Cul3p were lost when a single lysine residue in a conserved region near the C-terminus was replaced by an arginine residue. These results suggest that this lysine residue is the site of Rub1p-dependent and -independent modifications in Rtt101p and of Rub1p-dependent modification in Cul3p. An rtt101Δ strain was hypersensitive to thiabendazole, isopropyl (N-3-chlorophenyl) carbamate and methyl methanesulphonate, but rub1Δ strains were not. Whereas rtt101Δ strains exhibited a 14-fold increase in Ty1 transposition, isogenic rub1Δ strains did not show statistically significant increases. Rtt101K791Rp, which cannot be modified, complemented for Rtt101p function in a transposition assay. Altogether, these results suggest that neither the RUB1-dependent nor the RUB1-independent form of Rtt101p is required for Rtt101p function. The identification of additional Rub1p targets in S. cerevisiae suggests an expanded role for Rub in this organism.


Life ◽  
2019 ◽  
Vol 9 (1) ◽  
pp. 18 ◽  
Author(s):  
Christian J. Michel

We study the distribution of new classes of motifs in genes, a research field that has not been investigated to date. A single-frame motif SF has no trinucleotide in reading frame (frame 0) that occurs in a shifted frame (frame 1 or 2), e.g., the dicodon AAACAA is SF as the trinucleotides AAA and CAA do not occur in a shifted frame. A motif which is not single-frame SF is multiple-frame MF. Several classes of MF motifs are defined and analysed. The distributions of single-frame SF motifs (associated with an unambiguous trinucleotide decoding in the two 5'–3' and 3'–5' directions) and 5′ unambiguous motifs 5'U (associated with an unambiguous trinucleotide decoding in the 5'–3' direction only) are analysed without and with constraints. The constraints studied are: initiation and stop codons, periodic codons AAA,CCC,GGG,TTT, antiparallel complementarity and parallel complementarity. Taken together, these results suggest that the complementarity property involved in the antiparallel (DNA double helix, RNA stem) and parallel sequences could also be fundamental for coding genes with an unambiguous trinucleotide decoding in the two 5'–3' and 3'–5' directions or the 5'–3' direction only. Furthermore, the single-frame motifs SF with a property of trinucleotide decoding and the framing motifs F (also called circular code motifs; first introduced by Michel (2012)) with a property of reading frame decoding may have been involved in the early life genes to build the modern genetic code and the extant genes. They could have been involved in the stage without anticodon-amino acid interactions or in the Implicated Site Nucleotides (ISN) of RNA interacting with the amino acids. Finally, the SF and MF dipeptides associated with the SF and MF dicodons, respectively, are studied and their importance for biology and the origin of life discussed.


1995 ◽  
Vol 305 (1) ◽  
pp. 197-203 ◽  
Author(s):  
J A Gutierrez ◽  
V Guerriero

A cDNA clone for the stress-inducible 70 kDa heat-shock protein (Hsp70) has been isolated from a bovine skeletal-muscle cDNA library. This mRNA encodes a protein with a calculated molecular mass of 70250 Da. The cDNA has one continuous open reading frame capable of encoding a 641-amino-acid protein. Expression of this cDNA in a bacterial expression system produced a protein with a mobility identical with that of the inducible Hsp70 protein from bovine skeletal muscle as determined by SDS/PAGE. Two-dimensional gel electrophoresis demonstrated this protein to have focusing properties identical with that of a minor isoform from bovine skeletal muscle. Upon carbamylation of this bacterially expressed protein, a train of charged proteins with charge differences of -1 were produced. These carbamylated proteins were shown to have similar focusing mobilities to the Hsp70 isoforms isolated from bovine skeletal muscle. These results demonstrate the identification of a skeletal-muscle inducible Hsp70 gene and suggest that the presence of multiple Hsp70 isoforms may be the product of post-translational modifications to the Hsp70 proteins.


2010 ◽  
Vol 84 (14) ◽  
pp. 7029-7038 ◽  
Author(s):  
Sabrina Schreiner ◽  
Peter Wimmer ◽  
Hüseyin Sirma ◽  
Roger D. Everett ◽  
Paola Blanchette ◽  
...  

ABSTRACT The death-associated protein Daxx found in PML (promyelocytic leukemia protein) nuclear bodies (PML-NBs) is involved in transcriptional regulation and cellular intrinsic antiviral resistence against incoming viruses. We found that knockdown of Daxx in a nontransformed human hepatocyte cell line using RNA interference (RNAi) techniques results in significantly increased adenoviral (Ad) replication, including enhanced viral mRNA synthesis and viral protein expression. This Daxx restriction imposed upon adenovirus growth is counteracted by early protein E1B-55K (early region 1B 55-kDa protein), a multifunctional regulator of cell-cycle-independent Ad5 replication. The viral protein binds to Daxx and induces its degradation through a proteasome-dependent pathway. We show that this process is independent of Ad E4orf6 (early region 4 open reading frame 6), known to promote the proteasomal degradation of cellular p53, Mre11, DNA ligase IV, and integrin α3 in combination with E1B-55K. These results illustrate the importance of the PML-NB-associated factor Daxx in virus growth restriction and suggest that E1B-55K antagonizes innate antiviral activities of Daxx and PML-NBs to stimulate viral replication at a posttranslational level.


2005 ◽  
Vol 71 (6) ◽  
pp. 3068-3076 ◽  
Author(s):  
Muthusamy Kunnimalaiyaan ◽  
Patricia S. Vary

ABSTRACT Strain QM B1551 of Bacillus megaterium contains seven compatible plasmids: two small rolling circle plasmids and five theta-replicating plasmids with cross-hybridizing replicons. To expand our understanding of these plasmids, the replicon region (6.7 kb) from pBM300 was cloned, sequenced, and functionally characterized. Sequence analysis showed that the replication protein (RepM300) was highly homologous to two other plasmid Rep proteins of the same strain but to no other known proteins. Furthermore, the location of the replication origin was within the RepM300 coding region, and the origin contained three 12-base direct repeats. Deletion analysis of the replicon confirmed the role of the Rep protein and showed that open reading frame 2 (ORF2) was required for stability. However, the protein encoded by ORF2 is entirely different from the replicon stability proteins encoded by the other two replicons. The entire plasmid was isolated from the plasmid array by integrating a spectinomycin resistance gene and transforming a plasmidless strain, PV361. Complete sequencing showed that pBM300 was 26,300 bp long, had a G+C content of 35.2%, and contained 20 ORFs, two of which encoded proteins that had no similarity to other proteins in the database. The proteins encoded by the plasmid ORFs had similarity to proteins for mobilization and transfer, an integrase, a rifampin resistance protein, a cell wall hydrolase, glutathione synthase, and a biotin carboxylase. The similarities were to several gram-positive genera and a few gram-negative genera and archaea. oriT and ssoT-like regions were detected near two mob genes. These results suggest that pBM300 is a mobilizable hybrid plasmid that confers increased metabolic and germination ability on its host. Its replicon also helps define a new plasmid family.


Author(s):  
O. G. Babak ◽  
N. A. Nekrashevich ◽  
T. V. Nikitinskaya ◽  
K. K. Yatsevich ◽  
A. V. Kilchevsky

Anthocyanins are high-value plant antioxidants; they also determine biotic and abiotic stress resistance. The aim of our research was to study the allelic polymorphism of Antocyanin 1 orthologs in the vegetable Solanaceae crops of C. annuum and S. melongena. The search revealed the following closest genes in C. annuum: Myb113-like1 TF and Myb113like2 transcription factors and Myb1 in S. melongena. Exon amplicons of those genes were obtained and then sequenced in the pepper and eggplant samples with contrasting anthocyanin fruit coloration. Primers to the identified polymorphisms were developed and their correlation with the anthocyanin accumulation in fruits was studied. A close correlation was found between a minimum accumulation or the complete absence of anthocyanin synthesis in fruits with a single nucleotide deletion (Myb113-like1), and in the pepper samples, 2 SNP (Myb113-like2) was detected using the CAPS marker Myb 113-AccI. In the eggplant samples, the deletions of 6 and 26 bp were detected using the SCAR marker MybMel and the CAPS marker MybmelPst1. The disturbance of anthocyanin synthesis in pepper forms with 1Indel in Myb113-like1 TF was determined by a shift in the reading frame and SNPs in Myb113-like2 TF lead to amino acid substitutions: Lys → Arg and Thr → Lys. In the eggplant, a deletion of 6 bp leads to the loss of ala and arg in the protein; a deletion of 26 bp causes disorder during the mRNA maturation. The developed markers allow identifying the Myb-like TF alleles under study, resulting in anthocyanin synthesis disturbance in fruits. C. annuum and S. melongena samples with different alleles were selected for a further study and new varieties in agriculture. 


Genome ◽  
1991 ◽  
Vol 34 (1) ◽  
pp. 6-12 ◽  
Author(s):  
Shiv S. Prasad ◽  
Linda J. Harris ◽  
David L. Baillie ◽  
Ann M. Rose

In this paper we present the sequence of an intact Caenorhabditis briggsae transposable element, Tcb2. Tcb2 is 1606 base pairs in length and contains 80 base pair imperfect terminal repeats and a single open reading frame. We have identified blocks of T-rich repeats in the regions 150–200 and 1421–1476 of this element which are conserved in the Caenorhabditis elegans element Tc1. The sequence conservation of these regions in elements from different Caenorhabditis species suggests that they are of functional importance. A single open reading frame corresponding to the major open reading frame of Tc1 is conserved among Tc1, Tcb1, and Tcb2. Comparison of the first 550 nucleotides of the sequence among the three elements has allowed the evaluation of a model proposing an extension of the major open reading frame. Our data support the suggestion that Tc1 is capable of producing a 335 amino acid protein. A comparison of the sequence coding for the amino and carboxy termini of the 273 amino acid transposase from Caenorhabditis Tc1-like elements and Drosophila HB1 showed different amounts of divergence for each of these regions, indicating that the two functional domains have undergone different amounts of selection. Our data are not compatible with the proposal that Tc1-related sequences have been acquired via horizontal transmission. The divergence of Tc1 from the two C. briggsae elements, Tcb1 and Tcb2, indicated that all three elements have been diverging from each other for approximately the same amount of time as the genomes of the two species.Key words: Caenorhabditis, transposable element, sequence comparison.


2009 ◽  
Vol 83 (24) ◽  
pp. 12833-12841 ◽  
Author(s):  
Rachel Condjella ◽  
Xuefeng Liu ◽  
Frank Suprynowicz ◽  
Hang Yuan ◽  
Sawali Sudarshan ◽  
...  

ABSTRACT The recently discovered Canis familiaris papillomavirus (PV) type 2 (CfPV2) provides a unique opportunity to study PV gene functions in vitro and in vivo. Unlike the previously characterized canine oral PV, CfPV2 contains an E5 open reading frame and is associated with progression to squamous cell carcinoma. In the current study, we have expressed and characterized the CfPV2-encoded E5 protein, a small, hydrophobic, 41-amino-acid polypeptide. We demonstrate that, similar to the E5 protein from high-risk human PV type 16, the CfPV2 E5 protein is localized in the endoplasmic reticulum (ER) and that its expression decreases keratinocyte proliferation and cell life span. E5 expression also increases the percentage of cells in the G1 phase of the cell cycle, with a concomitant decrease in the percentage of cells in S phase. To identify a potential mechanism for E5-mediated growth inhibition from the ER, we developed a real-time PCR method to quantify the splicing of XBP1 mRNA as a measure of ER stress. We found that the CfPV2 E5 protein induced ER stress and that this, as well as the observed growth inhibition, is tempered significantly by coexpression of the CfPV2 E6 and E7 genes. It is possible that the spatial/temporal regulation of E6/E7 gene expression during keratinocyte differentiation might therefore modulate E5 activity and ER stress.


Sign in / Sign up

Export Citation Format

Share Document