Estimating RNA structure chemical probing reactivities from reverse transcriptase stops and mutations

ABSTRACTChemical probing experiments interrogate RNA structures by creating covalent adducts on RNA molecules in structure-dependent patterns. Adduct positions are then detected through conversion of the modified RNAs into complementary DNA (cDNA) by reverse transcription (RT) as either stops (RT-stops) or mutations (RT-mutations). Statistical analysis of the frequencies of RT-stops and RT-mutations can then be used to estimate a measure of chemical probing reactivity at each nucleotide of an RNA, which reveals properties of the underlying RNA structure. Inspired by recent work that showed that different reverse transcriptase enzymes show distinct biases for detecting adducts as either RT-stops or RT-mutations, here we use a statistical modeling framework to derive an equation for chemical probing reactivity using experimental signatures from both RT-stops and RT-mutations within a single experiment. The resulting formula intuitively matches the expected result from considering reactivity to be defined as the fraction of adduct observed at each position in an RNA at the end of a chemical probing experiment. We discuss assumptions and implementation of the model, as well as ways in which the model may be experimentally validated.

Download Full-text

RNAStat: An Integrated Tool for Statistical Analysis of RNA 3D Structures

Frontiers in Bioinformatics ◽

10.3389/fbinf.2021.809082 ◽

2022 ◽

Vol 1 ◽

Author(s):

Zhi-Hao Guo ◽

Li Yuan ◽

Ya-Lan Tan ◽

Ben-Gong Zhang ◽

Ya-Zhou Shi

Keyword(s):

Statistical Analysis ◽

Rna Structure ◽

Structure Prediction ◽

Local Coordinate System ◽

3D Structure ◽

Scoring Function ◽

Rna Structures ◽

Cellular Functions ◽

3D Structures ◽

Structure Annotation

The 3D architectures of RNAs are essential for understanding their cellular functions. While an accurate scoring function based on the statistics of known RNA structures is a key component for successful RNA structure prediction or evaluation, there are few tools or web servers that can be directly used to make comprehensive statistical analysis for RNA 3D structures. In this work, we developed RNAStat, an integrated tool for making statistics on RNA 3D structures. For given RNA structures, RNAStat automatically calculates RNA structural properties such as size and shape, and shows their distributions. Based on the RNA structure annotation from DSSR, RNAStat provides statistical information of RNA secondary structure motifs including canonical/non-canonical base pairs, stems, and various loops. In particular, the geometry of base-pairing/stacking can be calculated in RNAStat by constructing a local coordinate system for each base. In addition, RNAStat also supplies the distribution of distance between any atoms to the users to help build distance-based RNA statistical potentials. To test the usability of the tool, we established a non-redundant RNA 3D structure dataset, and based on the dataset, we made a comprehensive statistical analysis on RNA structures, which could have the guiding significance for RNA structure modeling. The python code of RNAStat, the dataset used in this work, and corresponding statistical data files are freely available at GitHub (https://github.com/RNA-folding-lab/RNAStat).

Download Full-text

Cotranscriptional kinetic folding of RNA secondary structures

10.1101/2020.07.10.196972 ◽

2020 ◽

Author(s):

Vo Hong Thanh ◽

Pekka Orponen

Keyword(s):

Structure Formation ◽

Rna Structure ◽

Rna Folding ◽

Computational Prediction ◽

Simulation Method ◽

Rna Structures ◽

Rna Molecules ◽

Primitive Operation ◽

Nucleotide Resolution ◽

Kinetics Of

Computational prediction of RNA structures is an important problem in computational structural biology. Studies of RNA structure formation often assume that the process starts from a fully synthesized sequence. Experimental evidence, however, has shown that RNA folds concurrently with its elongation. We investigate RNA structure formation, taking into account also the cotranscriptional effects. We propose a single-nucleotide resolution kinetic model of the folding process of RNA molecules, where the polymerase-driven elongation of an RNA strand by a new nucleotide is included as a primitive operation, together with a stochastic simulation method that implements this folding concurrently with the transcriptional synthesis. Numerical case studies show that our cotranscriptional RNA folding model can predict the formation of metastable conformations that are favored in actual biological systems. Our new computational tool can thus provide quantitative predictions and offer useful insights into the kinetics of RNA folding.

Download Full-text

A novel SHAPE reagent enables the analysis of RNA structure in living cells with unprecedented accuracy

Nucleic Acids Research ◽

10.1093/nar/gkaa1255 ◽

2021 ◽

Author(s):

Tycho Marinus ◽

Adam B Fessler ◽

Craig A Ogle ◽

Danny Incarnato

Keyword(s):

Rna Structure ◽

Structure Prediction ◽

Critical Role ◽

Living Cells ◽

Pathological Process ◽

Rna Structures ◽

Rna Molecules ◽

Derived Data

Abstract Due to the mounting evidence that RNA structure plays a critical role in regulating almost any physiological as well as pathological process, being able to accurately define the folding of RNA molecules within living cells has become a crucial need. We introduce here 2-aminopyridine-3-carboxylic acid imidazolide (2A3), as a general probe for the interrogation of RNA structures in vivo. 2A3 shows moderate improvements with respect to the state-of-the-art selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) reagent NAI on naked RNA under in vitro conditions, but it significantly outperforms NAI when probing RNA structure in vivo, particularly in bacteria, underlining its increased ability to permeate biological membranes. When used as a restraint to drive RNA structure prediction, data derived by SHAPE-MaP with 2A3 yields more accurate predictions than NAI-derived data. Due to its extreme efficiency and accuracy, we can anticipate that 2A3 will rapidly take over conventional SHAPE reagents for probing RNA structures both in vitro and in vivo.

Download Full-text

Distributed Biotin-Streptavidin Transcription Roadblocks for Mapping Cotranscriptional RNA Folding

10.1101/100073 ◽

2017 ◽

Author(s):

Eric J. Strobel ◽

Kyle E. Watters ◽

Julius B. Lucks

Keyword(s):

Experimental Study ◽

Rna Structure ◽

Rna Folding ◽

Folding Pathway ◽

Decision Making Process ◽

Important Process ◽

Rna Structures ◽

Rna Molecules ◽

Fundamental Properties ◽

Nucleotide Resolution

AbstractRNA molecules fold cotranscriptionally as they emerge from RNA polymerase. Cotranscriptional folding is an important process for proper RNA structure formation as the order of folding can determine an RNA molecule’s structure, and thus its functional properties. Despite its fundamental importance, the experimental study of RNA cotranscriptional folding has been limited by the lack of easily approachable methods that can interrogate nascent RNA structures at nucleotide resolution during transcription. We previously developed cotranscriptional selective 2’-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-seq) to simultaneously probe all of the intermediate structures an RNA molecule transitions through during transcription elongation. Here, we improve the broad applicability of cotranscriptional SHAPE-Seq by developing a sequence-independent streptavidin roadblocking strategy to simplify the preparation of roadblocking transcription templates. We determine the fundamental properties of streptavidin roadblocks and show that randomly distributed streptavidin roadblocks can be used in cotranscriptional SHAPE-Seq experiments to measure the Bacillus cereus crcB fluoride riboswitch folding pathway. Comparison of EcoRIE111Q and streptavidin roadblocks in cotranscriptional SHAPE-Seq data shows that both strategies identify the same RNA structural transitions related to the riboswitch decision-making process. Finally, we propose guidelines to leverage the complementary strengths of each transcription roadblock for use in studying cotranscriptional folding.

Download Full-text

SHAPER: A Web Server for Fast and Accurate SHAPE Reactivity Prediction

Frontiers in Molecular Biosciences ◽

10.3389/fmolb.2021.721955 ◽

2021 ◽

Vol 8 ◽

Author(s):

Yuanzhe Zhou ◽

Jun Li ◽

Travis Hurst ◽

Shi-Jie Chen

Keyword(s):

Rna Structure ◽

Structural Information ◽

3D Structure ◽

Web Server ◽

Rna Structures ◽

Structure Selection ◽

Chemical Probing ◽

Local Flexibility ◽

Multiple Conformations ◽

Shape Data

Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) chemical probing serves as a convenient and efficient experiment technique for providing information about RNA local flexibility. The local structural information contained in SHAPE reactivity data can be used as constraints in 2D/3D structure predictions. Here, we present SHAPE predictoR (SHAPER), a web server for fast and accurate SHAPE reactivity prediction. The main purpose of the SHAPER web server is to provide a portal that uses experimental SHAPE data to refine 2D/3D RNA structure selection. Input structures for the SHAPER server can be obtained through experimental or computational modeling. The SHAPER server can accept RNA structures with single or multiple conformations, and the predicted SHAPE profile and correlation with experimental SHAPE data (if provided) for each conformation can be freely downloaded through the web portal. The SHAPER web server is available at http://rna.physics.missouri.edu/shaper/.

Download Full-text

A novel SHAPE reagent enables the analysis of RNA structure in living cells with unprecedented accuracy

10.1101/2020.08.31.274761 ◽

2020 ◽

Author(s):

Tycho Marinus ◽

Adam B. Fessler ◽

Craig A. Ogle ◽

Danny Incarnato

Keyword(s):

Rna Structure ◽

Structure Prediction ◽

Critical Role ◽

Living Cells ◽

Pathological Process ◽

Rna Structures ◽

Rna Molecules ◽

Derived Data

ABSTRACTDue to the mounting evidence that RNA structure plays a critical role in regulating almost any physiological as well as pathological process, being able to accurately define the folding of RNA molecules within living cells has become a crucial need. We introduce here 2-aminopyridine-3-carboxylic acid imidazolide (2A3), as a general probe for the interrogation of RNA structures in vivo. 2A3 shows moderate improvements with respect to the state-of-the-art SHAPE reagent NAI on naked RNA under in vitro conditions, but it significantly outperforms NAI when probing RNA structure in vivo, particularly in bacteria, underlining its increased ability to permeate biological membranes. When used as a restraint to drive RNA structure prediction, data derived by SHAPE-MaP with 2A3 yields more accurate predictions than NAI-derived data. Due to its extreme efficiency and accuracy, we can anticipate that 2A3 will rapidly take over conventional SHAPE reagents for probing RNA structures both in vitro and in vivo.

Download Full-text

EVALUATING MIXTURE MODELS FOR BUILDING RNA KNOWLEDGE-BASED POTENTIALS

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720012410107 ◽

2012 ◽

Vol 10 (02) ◽

pp. 1241010 ◽

Cited By ~ 7

Author(s):

ADELENE Y. L. SIM ◽

OLIVIER SCHWANDER ◽

MICHAEL LEVITT ◽

JULIE BERNAUER

Keyword(s):

Mixture Models ◽

Dirichlet Process ◽

Rna Structure ◽

Structural Information ◽

A Priori ◽

Comparable Efficacy ◽

Rna Structures ◽

Rna Molecules ◽

Knowledge Based ◽

Spline Fitting

Ribonucleic acid (RNA) molecules play important roles in a variety of biological processes. To properly function, RNA molecules usually have to fold to specific structures, and therefore understanding RNA structure is vital in comprehending how RNA functions. One approach to understanding and predicting biomolecular structure is to use knowledge-based potentials built from experimentally determined structures. These types of potentials have been shown to be effective for predicting both protein and RNA structures, but their utility is limited by their significantly rugged nature. This ruggedness (and hence the potential's usefulness) depends heavily on the choice of bin width to sort structural information (e.g. distances) but the appropriate bin width is not known a priori. To circumvent the binning problem, we compared knowledge-based potentials built from inter-atomic distances in RNA structures using different mixture models (Kernel Density Estimation, Expectation Minimization and Dirichlet Process). We show that the smooth knowledge-based potential built from Dirichlet process is successful in selecting native-like RNA models from different sets of structural decoys with comparable efficacy to a potential developed by spline-fitting — a commonly taken approach — to binned distance histograms. The less rugged nature of our potential suggests its applicability in diverse types of structural modeling.

Download Full-text

RNAProbe: a web server for normalization and analysis of RNA structure probing data

Nucleic Acids Research ◽

10.1093/nar/gkaa396 ◽

2020 ◽

Vol 48 (W1) ◽

pp. W292-W299 ◽

Cited By ~ 2

Author(s):

Tomasz K Wirecki ◽

Katarzyna Merdas ◽

Agata Bernat ◽

Michał J Boniecki ◽

Janusz M Bujnicki ◽

...

Keyword(s):

Secondary Structure ◽

Rna Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Structural Characteristics ◽

Web Server ◽

Rna Molecules ◽

Chemical Probing ◽

Structure Probing ◽

Low Pass

Abstract RNA molecules play key roles in all living cells. Knowledge of the structural characteristics of RNA molecules allows for a better understanding of the mechanisms of their action. RNA chemical probing allows us to study the susceptibility of nucleotides to chemical modification, and the information obtained can be used to guide secondary structure prediction. These experimental results can be analyzed using various computational tools, which, however, requires additional, tedious steps (e.g., further normalization of the reactivities and visualization of the results), for which there are no fully automated methods. Here, we introduce RNAProbe, a web server that facilitates normalization, analysis, and visualization of the low-pass SHAPE, DMS and CMCT probing results with the modification sites detected by capillary electrophoresis. RNAProbe automatically analyzes chemical probing output data and turns tedious manual work into a one-minute assignment. RNAProbe performs normalization based on a well-established protocol, utilizes recognized secondary structure prediction methods, and generates high-quality images with structure representations and reactivity heatmaps. It summarizes the results in the form of a spreadsheet, which can be used for comparative analyses between experiments. Results of predictions with normalized reactivities are also collected in text files, providing interoperability with bioinformatics workflows. RNAProbe is available at https://rnaprobe.genesilico.pl.

Download Full-text

Computational modeling of RNA 3D structure based on experimental data

Bioscience Reports ◽

10.1042/bsr20180430 ◽

2019 ◽

Vol 39 (2) ◽

Cited By ~ 13

Author(s):

Almudena Ponce-Salvatierra ◽

Astha ◽

Katarzyna Merdas ◽

Chandran Nithin ◽

Pritha Ghosh ◽

...

Keyword(s):

Experimental Data ◽

Computational Methods ◽

Rna Structure ◽

Structure Prediction ◽

3D Structure ◽

Rna Structures ◽

Data Types ◽

Rna Sequences ◽

Rna Molecules

Abstract RNA molecules are master regulators of cells. They are involved in a variety of molecular processes: they transmit genetic information, sense cellular signals and communicate responses, and even catalyze chemical reactions. As in the case of proteins, RNA function is dictated by its structure and by its ability to adopt different conformations, which in turn is encoded in the sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore the majority of known RNAs remain structurally uncharacterized. To address this problem, predictive computational methods were developed based on the accumulated knowledge of RNA structures determined so far, the physical basis of the RNA folding, and taking into account evolutionary considerations, such as conservation of functionally important motifs. However, all theoretical methods suffer from various limitations, and they are generally unable to accurately predict structures for RNA sequences longer than 100-nt residues unless aided by additional experimental data. In this article, we review experimental methods that can generate data usable by computational methods, as well as computational approaches for RNA structure prediction that can utilize data from experimental analyses. We outline methods and data types that can be potentially useful for RNA 3D structure modeling but are not commonly used by the existing software, suggesting directions for future development.

Download Full-text

Machine learning a model for RNA structure prediction

NAR Genomics and Bioinformatics ◽

10.1093/nargab/lqaa090 ◽

2020 ◽

Vol 2 (4) ◽

Author(s):

Nicola Calonaci ◽

Alisha Jones ◽

Francesca Cuturello ◽

Michael Sattler ◽

Giovanni Bussi

Keyword(s):

Free Energy ◽

Rna Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Structural Information ◽

Minimum Free Energy ◽

Experimental Information ◽

Rna Structures ◽

Chemical Probing ◽

Validation Set

Abstract RNA function crucially depends on its structure. Thermodynamic models currently used for secondary structure prediction rely on computing the partition function of folding ensembles, and can thus estimate minimum free-energy structures and ensemble populations. These models sometimes fail in identifying native structures unless complemented by auxiliary experimental data. Here, we build a set of models that combine thermodynamic parameters, chemical probing data (DMS and SHAPE) and co-evolutionary data (direct coupling analysis) through a network that outputs perturbations to the ensemble free energy. Perturbations are trained to increase the ensemble populations of a representative set of known native RNA structures. In the chemical probing nodes of the network, a convolutional window combines neighboring reactivities, enlightening their structural information content and the contribution of local conformational ensembles. Regularization is used to limit overfitting and improve transferability. The most transferable model is selected through a cross-validation strategy that estimates the performance of models on systems on which they are not trained. With the selected model we obtain increased ensemble populations for native structures and more accurate predictions in an independent validation set. The flexibility of the approach allows the model to be easily retrained and adapted to incorporate arbitrary experimental information.

Download Full-text