Pervasive RNA Secondary Structure in the Genomes of SARS-CoV-2 and Other Coronaviruses

ABSTRACT The ultimate outcome of the coronavirus disease 2019 (COVID-19) pandemic is unknown and is dependent on a complex interplay of its pathogenicity, transmissibility, and population immunity. In the current study, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was investigated for the presence of large-scale internal RNA base pairing in its genome. This property, termed genome-scale ordered RNA structure (GORS) has been previously associated with host persistence in other positive-strand RNA viruses, potentially through its shielding effect on viral RNA recognition in the cell. Genomes of SARS-CoV-2 were remarkably structured, with minimum folding energy differences (MFEDs) of 15%, substantially greater than previously examined viruses such as hepatitis C virus (HCV) (MFED of 7 to 9%). High MFED values were shared with all coronavirus genomes analyzed and created by several hundred consecutive energetically favored stem-loops throughout the genome. In contrast to replication-associated RNA structure, GORS was poorly conserved in the positions and identities of base pairing with other sarbecoviruses—even similarly positioned stem-loops in SARS-CoV-2 and SARS-CoV rarely shared homologous pairings, indicative of more rapid evolutionary change in RNA structure than in the underlying coding sequences. Sites predicted to be base paired in SARS-CoV-2 showed less sequence diversity than unpaired sites, suggesting that disruption of RNA structure by mutation imposes a fitness cost on the virus that is potentially restrictive to its longer evolution. Although functionally uncharacterized, GORS in SARS-CoV-2 and other coronaviruses represents important elements in their cellular interactions that may contribute to their persistence and transmissibility. IMPORTANCE The detection and characterization of large-scale RNA secondary structure in the genome of SARS-CoV-2 indicate an extraordinary and unsuspected degree of genome structural organization; this could be effectively visualized through a newly developed contour plotting method that displays positions, structural features, and conservation of RNA secondary structure between related viruses. Such RNA structure imposes a substantial evolutionary cost; paired sites showed greater restriction in diversity and represent a substantial additional constraint in reconstructing its molecular epidemiology. Its biological relevance arises from previously documented associations between possession of structured genomes and persistence, as documented for HCV and several other RNA viruses infecting humans and mammals. Shared properties potentially conferred by large-scale structure in SARS-CoV-2 include increasing evidence for prolonged infections and induced immune dysfunction that prevents development of protective immunity. The findings provide an additional element to cellular interactions that potentially influences the natural history of SARS-CoV-2, its pathogenicity, and its transmission.

Download Full-text

Pervasive RNA secondary structure in the genomes of SARS-CoV-2 and other coronaviruses – an endeavour to understand its biological purpose

10.1101/2020.06.17.155200 ◽

2020 ◽

Cited By ~ 1

Author(s):

P. Simmonds

Keyword(s):

Rna Structure ◽

Large Scale ◽

Shielding Effect ◽

Cellular Interactions ◽

Folding Energy ◽

Base Pairing ◽

Population Immunity ◽

Ultimate Outcome ◽

Positive Strand Rna ◽

Genome Scale

ABSTRACTThe ultimate outcome of the COVID-19 pandemic is unknown and is dependent on a complex interplay of its pathogenicity, transmissibility and population immunity. In the current study, SARS coronavirus 2 (SARS-CoV-2) was investigated for the presence of large scale internal RNA base pairing in its genome. This property, termed genome scale ordered RNA structure (GORS) has been previously associated with host persistence in other positive-strand RNA viruses, potentially through its shielding effect on viral RNA recognition in the cell. Genomes of SARS-CoV-2 were remarkably structured, with minimum folding energy differences (MFEDs) of 15%, substantially greater than previously examined viruses such as HCV (MFED 7-9%). High MFED values were shared with all coronavirus genomes analysed created by several hundred consecutive energetically favoured stem-loops throughout the genome. In contrast to replication-association RNA structure, GORS was poorly conserved in the positions and identities of base pairing with other sarbecoviruses – even similarly positioned stem-loops in SARS-CoV-2 and SARS-CoV rarely shared homologous pairings, indicative of more rapid evolutionary change in RNA structure than in the underlying coding sequences. Sites predicted to be base-paired in SARS-CoV-2 showed substantially less sequence diversity than unpaired sites, suggesting that disruption of RNA structure by mutation imposes a fitness cost on the virus which is potentially restrictive to its longer evolution. Although functionally uncharacterised, GORS in SARS-CoV-2 and other coronaviruses represent important elements in their cellular interactions that may contribute to their persistence and transmissibility.

Download Full-text

Bioinformatic and Physical Characterizations of Genome-Scale Ordered RNA Structure in Mammalian RNA Viruses

Journal of Virology ◽

10.1128/jvi.01078-08 ◽

2008 ◽

Vol 82 (23) ◽

pp. 11824-11836 ◽

Cited By ~ 66

Author(s):

Matthew Davis ◽

Selena M. Sagan ◽

John P. Pezacki ◽

David J. Evans ◽

Peter Simmonds

Keyword(s):

Secondary Structure ◽

Rna Structure ◽

Rna Secondary Structure ◽

Large Scale ◽

Rna Viruses ◽

Plant Viruses ◽

Large Set ◽

Conserved Sequence ◽

Evolutionarily Conserved ◽

Genome Scale

ABSTRACT By the analysis of thermodynamic RNA secondary structure predictions, we previously obtained evidence for evolutionarily conserved large-scale ordering of RNA virus genomes (P. Simmonds, A. Tuplin, and D. J. Evans, RNA 10:1337-1351, 2004). Genome-scale ordered RNA structure (GORS) was widely distributed in many animal and plant viruses, much greater in extent than RNA structures required for viral translation or replication, but in mammalian viruses was associated with host persistence. To substantiate the existence of large-scale RNA structure differences between viruses, a large set of alignments of mammalian RNA viruses and rRNA sequences as controls were examined by thermodynamic methods (to calculate minimum free energy differences) and by algorithmically independent RNAz and Pfold methods. These methods produced generally concordant results and identified substantial differences in the degrees of evolutionarily conserved, sequence order-dependent RNA secondary structure between virus genera and groups. A probe hybridization accessibility assay was used to investigate the physical nature of GORS. Transcripts of hepatitis C virus (HCV), hepatitis G virus/GB virus-C (HGV/GBV-C), and murine norovirus, which are predicted to be structured, were largely inaccessible to hybridization in solution, in contrast to the almost universal binding of probes to a range of unstructured virus transcripts irrespective of G+C content. Using atomic force microscopy, HCV and HGV/GBV-C RNA was visualized as tightly compacted prolate spheroids, while under the same experimental conditions the predicted unstructured poliovirus and rubella virus RNA were pleomorphic and had extensively single-stranded RNA on deposition. Bioinformatic and physical characterization methods both identified fundamental differences in the configurations of viral genomic RNA that may modify their interactions with host cell defenses and their ability to persist.

Download Full-text

Importance of the Positive-Strand RNA Secondary Structure of a Murine Coronavirus Defective Interfering RNA Internal Replication Signal in Positive-Strand RNA Synthesis

Journal of Virology ◽

10.1128/jvi.72.10.7926-7933.1998 ◽

1998 ◽

Vol 72 (10) ◽

pp. 7926-7933 ◽

Cited By ~ 17

Author(s):

John F. Repass ◽

Shinji Makino

Keyword(s):

Secondary Structure ◽

Rna Structure ◽

Rna Secondary Structure ◽

Rna Synthesis ◽

Biological Function ◽

Hepatitis Virus ◽

Positive Strand ◽

Positive Strand Rna ◽

Genomic Regions ◽

Interfering Rna

ABSTRACT The RNA elements that are required for replication of defective interfering (DI) RNA of the JHM strain of mouse hepatitis virus (MHV) consist of three discontinuous genomic regions: about 0.46 to 0.47 kb from both terminal sequences and an internal 58-nucleotide (nt)-long sequence (58-nt region) present at about 0.9 kb from the 5′ end of the DI genome. The internal region is important for positive-strand DI RNA synthesis (Y. N. Kim and S. Makino, J. Virol. 69:4963–4971, 1995). We further characterized the 58-nt region in the present study and obtained the following results. (i) The positive-strand RNA structure in solution was comparable with that predicted by computer modeling. (ii) Positive-strand RNA secondary structure, but not negative-strand RNA structure, was important for the biological function of the region. (iii) The biological function had a sequence-specific requirement. We discuss possible mechanisms by which the internal cis-acting signal drives MHV positive-strand DI RNA synthesis.

Download Full-text

Predicting RNA Secondary Structure Using a Improved BPSO Based on Stem Combination

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.325-326.1551 ◽

2013 ◽

Vol 325-326 ◽

pp. 1551-1554

Author(s):

Yi Qi

Keyword(s):

Secondary Structure ◽

Sensitivity And Specificity ◽

Rna Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Experimental Results ◽

New Method ◽

Rna Structure Prediction ◽

New Strategies

In this paper, we present an improved BPSO to predict RNA secondary structure to improve the performance with two new strategies. First one is to reduce the searching space of PSO through super stem set construction. Second is to modify the general BPSO updating process to settle stem permutation and combination problems. The experimental results show that the new method is effective for RNA structure prediction in terms of sensitivity and specificity by different sequence datasets including simple pseudoknot.

Download Full-text

Quantitative high-throughput tests of ubiquitous RNA secondary structure prediction algorithms via RNA/protein binding

10.1101/571588 ◽

2019 ◽

Cited By ~ 2

Author(s):

Winston R. Becker ◽

Inga Jarmoskaite ◽

Kalli Kappel ◽

Pavanapuresan P. Vaidyanathan ◽

Sarah K. Denny ◽

...

Keyword(s):

Secondary Structure ◽

Protein Binding ◽

Rna Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Nearest Neighbor ◽

Secondary Structure Prediction ◽

Structural Features ◽

Vast Number ◽

Prediction Algorithms

AbstractNearest-neighbor (NN) rules provide a simple and powerful quantitative framework for RNA structure prediction that is strongly supported for canonical Watson-Crick duplexes from a plethora of thermodynamic measurements. Predictions of RNA secondary structure based on nearest-neighbor (NN) rules are routinely used to understand biological function and to engineer and control new functions in biotechnology. However, NN applications to RNA structural features such as internal and terminal loops rely on approximations and assumptions, with sparse experimental coverage of the vast number of possible sequence and structural features. To test to what extent NN rules accurately predict thermodynamic stabilities across RNAs with non-WC features, we tested their predictions using a quantitative high-throughput assay platform, RNA-MaP. Using a thermodynamic assay with coupled protein binding, we carried out equilibrium measurements for over 1000 RNAs with a range of predicted secondary structure stabilities. Our results revealed substantial scatter and systematic deviations between NN predictions and observed stabilities. Solution salt effects and incorrect or omitted loop parameters contribute to these observed deviations. Our results demonstrate the need to independently and quantitatively test NN computational algorithms to identify their capabilities and limitations. RNA-MaP and related approaches can be used to test computational predictions and can be adapted to obtain experimental data to improve RNA secondary structure and other prediction algorithms.Significance statementRNA secondary structure prediction algorithms are routinely used to understand, predict and design functional RNA structures in biology and biotechnology. Given the vast number of RNA sequence and structural features, these predictions rely on a series of approximations, and independent tests are needed to quantitatively evaluate the accuracy of predicted RNA structural stabilities. Here we measure the stabilities of over 1000 RNA constructs by using a coupled protein binding assay. Our results reveal substantial deviations from the RNA stabilities predicted by popular algorithms, and identify factors contributing to the observed deviations. We demonstrate the importance of quantitative, experimental tests of computational RNA structure predictions and present an approach that can be used to routinely test and improve the prediction accuracy.

Download Full-text

RNAStructViz: Graphical base pairing analysis

10.1101/2021.01.20.427505 ◽

2021 ◽

Author(s):

Maxie Dion Schmidt ◽

Anna Kirkpatrick ◽

Christine Heitsch

Keyword(s):

Secondary Structure ◽

Structure Analysis ◽

Rna Secondary Structure ◽

Central Feature ◽

Style File ◽

Base Pairing ◽

Circular Arc ◽

Secondary Structure Analysis ◽

Graphical Tool ◽

Similarities And Differences

AbstractSummaryWe present a new graphical tool for RNA secondary structure analysis. The central feature is the ability to visually compare/contrast up to three base pairing configurations for a given sequence in a compact, standardized circular arc diagram layout. This is complemented by a built-in CT-style file viewer and radial layout substructure viewer which are directly linked to the arc diagram window via the zoom selection tool. Additional functionality includes the computation of some numerical information, and the ability to export images and data for later use. This tool should be of use to researchers seeking to better understand similarities and differences between structural alternatives for an RNA sequence.Availability and implementationhttps://github.com/gtDMMB/RNAStructViz/wikiAuthor [email protected], [email protected], and [email protected]

Download Full-text

Impact of virus subtype and host IFNL4 genotype on large-scale RNA structure formation in the genome of hepatitis C virus

10.1101/2020.06.16.155150 ◽

2020 ◽

Cited By ~ 1

Author(s):

P. Simmonds ◽

L. Cuypers ◽

W.L. Irving ◽

J. McLauchlan ◽

G.S. Cooke ◽

...

Keyword(s):

Hepatitis C Virus ◽

Hepatitis C ◽

Rna Structure ◽

Large Scale ◽

Rna Virus ◽

Energy Difference ◽

Folding Energy ◽

Stem Loop ◽

Coding Regions ◽

Progressive Liver Disease

ABSTRACTMechanisms underlying the ability of hepatitis C virus (HCV) to establish persistent infections and induce progressive liver disease remain poorly understood. HCV is one of several positive-stranded RNA viruses capable of establishing persistence in their immunocompetent vertebrate hosts, an attribute associated with formation of large scale RNA structure in their genomic RNA. We developed novel methods to analyse and visualise genome-scale ordered RNA structure (GORS) predicted from the increasingly large datasets of complete genome sequences of HCV. Structurally conserved RNA secondary structure in coding regions of HCV localised exclusively to polyprotein ends (core, NS5B). Coding regions elsewhere were also intensely structured based on elevated minimum folding energy difference (MFED) values, but the actual stem-loop elements involved in genome folding were structurally entirely distinct, even between subtypes 1a and 1b. Dynamic remodelling was further evident from comparison of HCV strains in different host genetic background. Significantly higher MFED values, greater suppression of UpA dinucleotide frequencies and restricted diversification were found in subjects with the TT genotype of the rs12979860 SNP in the IFNL4 gene compared to the CC (non-expressing) allele. These structural and compositional associations with expression of interferon-λ4 were recapitulated on a larger scale by higher MFED values and greater UpA suppression of genotype 1 compared to genotype 3a, associated with previously reported HCV genotype-associated differences in hepatic interferon-stimulated gene induction. Associations between innate cellular responses with HCV structure and further evolutionary constraints represents an important new element in RNA virus evolution and the adaptive interplay between virus and host.

Download Full-text

RNA Secondary Structurome Revealed Distinct Thermoregulation in Plasmodium falciparum

Frontiers in Cell and Developmental Biology ◽

10.3389/fcell.2021.766532 ◽

2022 ◽

Vol 9 ◽

Author(s):

Yanwei Qi ◽

Yuhong Zhang ◽

Quankai Mu ◽

Guixing Zheng ◽

Mengxin Zhang ◽

...

Keyword(s):

Plasmodium Falciparum ◽

Secondary Structure ◽

Rna Structure ◽

Rna Secondary Structure ◽

Structural Changes ◽

Environmental Changes ◽

Parasite Development ◽

Temperature Dependent ◽

Trophozoite Stage ◽

Temperature Changes

The development of Plasmodium parasites, a causative agent of malaria, requests two hosts and the completion of 11 different parasite stages during development. Therefore, an efficient and fast response of parasites to various complex environmental changes, such as ambient temperature, pH, ions, and nutrients, is essential for parasite development and survival. Among many of these environmental changes, temperature is a decisive factor for parasite development and pathogenesis, including the thermoregulation of rRNA expression, gametogenesis, and parasite sequestration in cerebral malaria. However, the exact mechanism of how Plasmodium parasites rapidly respond and adapt to temperature change remains elusive. As a fundamental and pervasive regulator of gene expression, RNA structure can be a specific mechanism for fine tuning various biological processes. For example, dynamic and temperature-dependent changes in RNA secondary structures can control the expression of different gene programs, as shown by RNA thermometers. In this study, we applied the in vitro and in vivo transcriptomic-wide secondary structurome approach icSHAPE to measure parasite RNA structure changes with temperature alteration at single-nucleotide resolution for ring and trophozoite stage parasites. Among 3,000 probed structures at different temperatures, our data showed structural changes in the global transcriptome, such as S-type rRNA, HRPII gene, and the erythrocyte membrane protein family. When the temperature drops from 37°C to 26°C, most of the genes in the trophozoite stage cause significantly more changes to the RNA structure than the genes in the ring stage. A multi-omics analysis of transcriptome data from RNA-seq and RNA structure data from icSHAPE reveals that the specific RNA secondary structure plays a significant role in the regulation of transcript expression for parasites in response to temperature changes. In addition, we identified several RNA thermometers (RNATs) that responded quickly to temperature changes. The possible thermo-responsive RNAs in Plasmodium falciparum were further mapped. To this end, we identified dynamic and temperature-dependent RNA structural changes in the P. falciparum transcriptome and performed a comprehensive characterization of RNA secondary structures over the course of temperature stress in blood stage development. These findings not only contribute to a better understanding of the function of the RNA secondary structure but may also provide novel targets for efficient vaccines or drugs.

Download Full-text

DSARna: RNA Secondary Structure Alignment Based on Digital Sequence Representation

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207323666200811100338 ◽

2020 ◽

Vol 23 ◽

Author(s):

Longjian Gao ◽

Chengzhen Xu ◽

Wangan Song ◽

Feng Xiao ◽

Xiaomin Wu ◽

...

Keyword(s):

Secondary Structure ◽

Rna Structure ◽

Rna Secondary Structure ◽

High Throughput Sequencing ◽

Structural Information ◽

Dynamic Programming Algorithm ◽

Structure Alignment ◽

Programming Algorithm ◽

Structure Information ◽

Alignment Algorithms

Background: With increasing applications and development of high-throughput sequencing, knowledge of the primary structure of RNA has expanded exponentially. Moreover, the function of RNA is determined by the secondary or higher RNA structure, and similar structures are related to similar functions, such as the secondary clover structure of tRNA. Therefore, RNA structure alignment is an important subject in computational biology and bioinformatics to accurately predict function. However, the traditional RNA structure alignment algorithms have some drawbacks such as high complexity and easy loss of secondary structure information. Objective: To study RNA secondary structure alignment according to the shortcomings of existing secondary structure alignment algorithms and the characteristics of RNA secondary structure. Method: We propose a new digital sequence RNA structure representation algorithm named “DSARna” . Then based on a dynamic programming algorithm, the scoring matrix and binary path matrix are simultaneously constructed. The backtracking path is identified in the path matrix, and the optimal result is predicted according to the path length. Conclusions: Upon comparison with the existing SimTree algorithm through experimental analysis, the proposed method showed higher accuracy and could ensure that the structural information is not easily lost in terms of improved specificity, sensitivity, and the Matthews correlation coefficient.

Download Full-text

Prediction of RNA Secondary Structure Using Quantum-inspired Genetic Algorithms

Current Bioinformatics ◽

10.2174/1574893614666190916154103 ◽

2020 ◽

Vol 15 (2) ◽

pp. 135-143

Author(s):

Sha Shi ◽

Xin-Li Zhang ◽

Le Yang ◽

Wei Du ◽

Xian-Li Zhao ◽

...

Keyword(s):

Quantum Computing ◽

Secondary Structure ◽

Rna Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Prediction Accuracy ◽

Secondary Structure Prediction ◽

Minimum Free Energy ◽

Rna Sequences ◽

New Strategy

Background: The prediction of RNA secondary structure using optimization algorithms is key to understand the real structure of an RNA. Evolutionary algorithms (EAs) are popular strategies for RNA secondary structure prediction. However, compared to most state-of-the-art software based on DPAs, the performances of EAs are a bit far from satisfactory. Objective: Therefore, a more powerful strategy is required to improve the performances of EAs when applied to the prediciton of RNA secondary structures. Methods: The idea of quantum computing is introduced here yielding a new strategy to find all possible legal paired-bases with the constraint of minimum free energy. The sate of a stem pool with size N is encoded as a population of QGA, which is represented by N quantum bits but not classical bits. The updating of populations is accomplished by so-called quantum crossover operations, quantum mutation operations and quantum rotation operations. Results: The numerical results show that the performances of traditional EAs are significantly improved by using QGA with regard to not only prediction accuracy and sensitivity but also complexity. Moreover, for RNA sequences with middle-short length, QGA even improves the state-of-art software based on DPAs in terms of both prediction accuracy and sensitivity. Conclusion: This work sheds an interesting light on the applications of quantum computing on RNA structure prediction.

Download Full-text