potential orfs
Recently Published Documents


TOTAL DOCUMENTS

4
(FIVE YEARS 0)

H-INDEX

2
(FIVE YEARS 0)

Author(s):  
Christian Jean Michel ◽  
Claudine Mayer ◽  
Olivier Poch ◽  
Julie Dawn Thompson

Abstract Background: The Covid19 infection is caused by the SARS-CoV-2 virus, a novel member of the coronavirus (CoV) family. CoV genomes code for a ORF1a / ORF1ab polyprotein and four structural proteins widely studied as major drug targets. The genomes also contain a variable number of open reading frames (ORFs) coding for accessory proteins that are not essential for virus replication, but appear to have a role in pathogenesis. The accessory proteins have been less well characterized and are difficult to predict by classical bioinformatics methods.Methods: We propose a computational tool GOFIX to characterize potential ORFs in virus genomes. In particular, ORF coding potential is estimated by searching for enrichment in motifs of the X circular code, that is known to be over-represented in the reading frames of viral genes.Results: We applied GOFIX to study the SARS-CoV-2 and related genomes including SARS-CoV and SARS-like viruses from bat, civet and pangolin hosts, focusing on the accessory proteins. Our analysis provides evidence supporting the presence of overlapping ORFs 7b, 9b and 9c in all the genomes and thus helps to resolve some differences in current genome annotations. In contrast, we predict that ORF3b is not functional in all genomes. Novel putative ORFs were also predicted, including a truncated form of the ORF10 previously identified in SARS-CoV-2 and a little known ORF overlapping the Spike protein in Civet-CoV and SARS-CoV.Conclusions: Our findings contribute to characterizing sequence properties of accessory genes of SARS coronaviruses, and especially the newly acquired genes making use of overlapping reading frames.



Author(s):  
Christian J. Michel ◽  
Claudine Mayer ◽  
Olivier Poch ◽  
Julie D. Thompson

AbstractThe Covid19 infection is caused by the SARS-CoV-2 virus, a novel member of the coronavirus (CoV) family. CoV genomes code for a ORF1a / ORF1ab polyprotein and four structural proteins widely studied as major drug targets. The genomes also contain a variable number of open reading frames (ORFs) coding for accessory proteins that are not essential for virus replication, but appear to have a role in pathogenesis. The accessory proteins have been less well characterized and are difficult to predict by classical bioinformatics methods. We propose a computational tool GOFIX to characterize potential ORFs in virus genomes. In particular, ORF coding potential is estimated by searching for enrichment in motifs of the X circular code, that is known to be over-represented in the reading frames of viral genes. We applied GOFIX to study the SARS-CoV-2 and related genomes including SARS-CoV and SARS-like viruses from bat, civet and pangolin hosts, focusing on the accessory proteins. Our analysis provides evidence supporting the presence of overlapping ORFs 7b, 9b and 9c in all the genomes and thus helps to resolve some differences in current genome annotations. In contrast, we predict that ORF3b is not functional in all genomes. Novel putative ORFs were also predicted, including a truncated form of the ORF10 previously identified in SARS-CoV-2 and a little known ORF overlapping the Spike protein in Civet-CoV and SARS-CoV. Our findings contribute to characterizing sequence properties of accessory genes of SARS coronaviruses, and especially the newly acquired genes making use of overlapping reading frames.



2004 ◽  
Vol 78 (13) ◽  
pp. 6723-6734 ◽  
Author(s):  
Yee-Joo Tan ◽  
Eileen Teng ◽  
Shuo Shen ◽  
Timothy H. P. Tan ◽  
Phuay-Yee Goh ◽  
...  

ABSTRACT The severe acute respiratory syndrome coronavirus (SARS-CoV) genome contains open reading frames (ORFs) that encode for several genes that are homologous to proteins found in all known coronaviruses. These are the replicase gene 1a/1b and the four structural proteins, nucleocapsid (N), spike (S), membrane (M), and envelope (E), and these proteins are expected to be essential for the replication of the virus. In addition, this genome also contains nine other potential ORFs varying in length from 39 to 274 amino acids. The largest among these is the first ORF of the second longest subgenomic RNA, and this protein (termed U274 in the present study) consists of 274 amino acids and contains three putative transmembrane domains. Using antibody specific for the C terminus of U274, we show U274 to be expressed in SARS-CoV-infected Vero E6 cells and, in addition to the full-length protein, two other processed forms were also detected. By indirect immunofluorescence, U274 was localized to the perinuclear region, as well as to the plasma membrane, in both transfected and infected cells. Using an N terminus myc-tagged U274, the topology of U274 and its expression on the cell surface were confirmed. Deletion of a cytoplasmic domain of U274, which contains Yxxφ and diacidic motifs, abolished its transport to the cell surface. In addition, U274 expressed on the cell surface can internalize antibodies from the culture medium into the cells. Coimmunoprecipitation experiments also showed that U274 could interact specifically with the M, E, and S structural proteins, as well as with U122, another protein that is unique to SARS-CoV.



1998 ◽  
Vol 64 (5) ◽  
pp. 1871-1877 ◽  
Author(s):  
Sarah K. Stephens ◽  
Bel�n Floriano ◽  
Declan P. Cathcart ◽  
Susan A. Bayley ◽  
Valerie F. Witt ◽  
...  

ABSTRACT A 4.5-kb region of chromosomal DNA carrying the locus responsible for the production of plantaricin S, a two-peptide bacteriocin produced by Lactobacillus plantarum LPCO10 (R. Jim�nez-Dı́az, J. L. Ruiz-Barba, D. P. Cathcart, H. Holo, I. F. Nes, K. H. Sletten, and P. J. Warner, Appl. Environ. Microbiol. 61:4459–4463, 1995), has been cloned, and the nucleotide sequence has been elucidated. Two genes, designatedplsA and plsB and encoding peptides α and β, respectively, of plantaricin S, plus an open reading frame (ORF), ORF2, were found to be organized in an operon. Northern blot analysis showed that these genes are cotranscribed, giving a ca. 0.7-kb mRNA, whose transcription start point was determined by primer extension. Nucleotide sequences of plsA and plsB revealed that both genes are translated as bacteriocin precursors which include N-terminal leader sequences of the double-glycine type. The role of ORF2 is unknown at the moment, although it might be expected to encode an immunity protein of the type described for other bacteriocin operons. In addition, several other potential ORFs have been found, including some which may be responsible for the regulation of bacteriocin production. Two of them, ORF8 and ORF14, show strong homology with histidine protein kinase and response regulator genes, respectively, which have been found to be involved in the regulation of the production of other bacteriocins from lactic acid bacteria. A third ORF, ORF5, shows homology with gene agrB fromStaphylococcus aureus, which is involved in the mechanism of regulation of the virulence phenotype in this species. Thus, anagr-like regulatory system for the production of plantaricin S is postulated.



Sign in / Sign up

Export Citation Format

Share Document