scholarly journals Secondary Structure Prediction for RNA Sequences Including N6-methyladenosine

2021 ◽  
Author(s):  
Elzbieta Kierzek ◽  
Xiaoju Zhang ◽  
Richard M. Watson ◽  
Ryszard Kierzek ◽  
David H. Mathews

AbstractThere is increasing interest in the roles played by covalently modified nucleotides in mRNAs and non-coding RNAs. New high-throughput sequencing technologies localize these modifications to exact nucleotide positions. There has been, however, and inability to account for these modifications in secondary structure prediction because of a lack of software tools for handling modifications and a lack of thermodynamic parameters for modifications. Here, we report that we solved these issues for N6-methyladenosine (m6A), for the first time allowing secondary structure prediction for a nucleotide alphabet of A, C, G, U, and m6A. We revised the RNAstructure software package to work with any user-defined alphabet of nucleotides. We also developed a set of nearest neighbor parameters for helices and loops containing m6A, using a set of 45 optical melting experiments. Interestingly, N6-methylation decreases the folding stability of structures with adenosines in the middle of a helix, has little effect on the folding stability of adenosines at the ends of helices, and stabilizes the folding stability for structures with unpaired adenosines stacked on the end of a helix. The parameters were tested against an additional two melting experiments, including a consensus sequence for methylation and an m6A dangling end. The utility of the new software was tested using predictions of the structure of a molecular switch in the MALAT1 lncRNA, for which a conformation change is triggered by methylation. Additionally, human transcriptome-wide calculations for the effect of N6-methylation on the probability of an adenosine being buried in a helix compare favorably with PARS structure mapping data. Now users of RNAstructure are able to develop hypothesis for structure-function relationships for RNAs with m6A, including conformational switching triggered by methylation.

RNA ◽  
2010 ◽  
Vol 16 (6) ◽  
pp. 1108-1117 ◽  
Author(s):  
S. Quarrier ◽  
J. S. Martin ◽  
L. Davis-Neulander ◽  
A. Beauregard ◽  
A. Laederach

2010 ◽  
Vol 08 (05) ◽  
pp. 867-884 ◽  
Author(s):  
YUZHONG ZHAO ◽  
BABAK ALIPANAHI ◽  
SHUAI CHENG LI ◽  
MING LI

Accurate determination of protein secondary structure from the chemical shift information is a key step for NMR tertiary structure determination. Relatively few work has been done on this subject. There needs to be a systematic investigation of algorithms that are (a) robust for large datasets; (b) easily extendable to (the dynamic) new databases; and (c) approaching to the limit of accuracy. We introduce new approaches using k-nearest neighbor algorithm to do the basic prediction and use the BCJR algorithm to smooth the predictions and combine different predictions from chemical shifts and based on sequence information only. Our new system, SUCCES, improves the accuracy of all existing methods on a large dataset of 805 proteins (at 86% Q3 accuracy and at 92.6% accuracy when the boundary residues are ignored), and it is easily extendable to any new dataset without requiring any new training. The software is publicly available at .


2019 ◽  
Author(s):  
Winston R. Becker ◽  
Inga Jarmoskaite ◽  
Kalli Kappel ◽  
Pavanapuresan P. Vaidyanathan ◽  
Sarah K. Denny ◽  
...  

AbstractNearest-neighbor (NN) rules provide a simple and powerful quantitative framework for RNA structure prediction that is strongly supported for canonical Watson-Crick duplexes from a plethora of thermodynamic measurements. Predictions of RNA secondary structure based on nearest-neighbor (NN) rules are routinely used to understand biological function and to engineer and control new functions in biotechnology. However, NN applications to RNA structural features such as internal and terminal loops rely on approximations and assumptions, with sparse experimental coverage of the vast number of possible sequence and structural features. To test to what extent NN rules accurately predict thermodynamic stabilities across RNAs with non-WC features, we tested their predictions using a quantitative high-throughput assay platform, RNA-MaP. Using a thermodynamic assay with coupled protein binding, we carried out equilibrium measurements for over 1000 RNAs with a range of predicted secondary structure stabilities. Our results revealed substantial scatter and systematic deviations between NN predictions and observed stabilities. Solution salt effects and incorrect or omitted loop parameters contribute to these observed deviations. Our results demonstrate the need to independently and quantitatively test NN computational algorithms to identify their capabilities and limitations. RNA-MaP and related approaches can be used to test computational predictions and can be adapted to obtain experimental data to improve RNA secondary structure and other prediction algorithms.Significance statementRNA secondary structure prediction algorithms are routinely used to understand, predict and design functional RNA structures in biology and biotechnology. Given the vast number of RNA sequence and structural features, these predictions rely on a series of approximations, and independent tests are needed to quantitatively evaluate the accuracy of predicted RNA structural stabilities. Here we measure the stabilities of over 1000 RNA constructs by using a coupled protein binding assay. Our results reveal substantial deviations from the RNA stabilities predicted by popular algorithms, and identify factors contributing to the observed deviations. We demonstrate the importance of quantitative, experimental tests of computational RNA structure predictions and present an approach that can be used to routinely test and improve the prediction accuracy.


Microbiology ◽  
2000 ◽  
Vol 81 (1) ◽  
pp. 257-266 ◽  
Author(s):  
Ulrich Melcher

Relationships among the amino acid sequences of viral movement proteins related to the 30 kDa (‘30K’) movement protein of tobacco mosaic virus – the 30K superfamily – were explored. Sequences were grouped into 18 families. A comparison of secondary structure predictions for each family revealed a common predicted core structure flanked by variable N- and C-terminal domains. The core consisted of a series of β-elements flanked by an α-helix on each end. Consensus sequences for each of the families were generated and aligned with one another. From this alignment an overall secondary structure prediction was generated and a consensus sequence that can recognize each family in database searches was obtained. The analysis led to criteria that were used to evaluate other virus-encoded proteins for possible membership of the 30K superfamily. A rhabdoviral and a tenuiviral protein were identified as 30K superfamily members, as were plant-encoded phloem proteins. Parsimony analysis grouped tubule-forming movement proteins separate from others. Establishment of the alignment of residues of diverse families facilitates comparison of mutagenesis experiments done on different movement proteins and should serve as a guide for further such experiments.


Sign in / Sign up

Export Citation Format

Share Document