Developing an Updated Strategy for Estimating the Free-Energy Parameters in RNA Duplexes

Mesoscopic models can be used for the description of the thermodynamic properties of RNA duplexes. With the use of experimental melting temperatures, its parametrization can provide important insights into its hydrogen bonds and stacking interactions as has been done for high sodium concentrations. However, the RNA parametrization for lower salt concentrations is still missing due to the limited amount of published melting temperature data. While the Peyrard-Bishop (PB) parametrization was found to be largely independent of strand concentrations, it requires that all temperatures are provided at the same strand concentrations. Here we adapted the PB model to handle multiple strand concentrations and in this way we were able to make use of an experimental set of temperatures to model the hydrogen bond and stacking interactions at low and intermediate sodium concentrations. For the parametrizations we make a distinction between terminal and internal base pairs, and the resulting potentials were qualitatively similar as we obtained previously for DNA. The main difference from DNA parameters, was the Morse potentials at low sodium concentrations for terminal r(AU) which is stronger than d(AT), suggesting higher hydrogen bond strength.

Download Full-text

Salt Dependent Mesoscopic Model for RNA with Multiple Strand Concentrations

10.26434/chemrxiv.13271594 ◽

2020 ◽

Author(s):

Izabela Ferreira ◽

Tauanne Dias Amarante ◽

Gerald Weber

Keyword(s):

Hydrogen Bond ◽

Stacking Interactions ◽

Base Pairs ◽

High Sodium ◽

Melting Temperatures ◽

Mesoscopic Models ◽

Low Sodium ◽

Rna Duplexes ◽

Morse Potentials ◽

Experimental Melting

Mesoscopic models can be used for the description of the thermodynamic properties of RNA duplexes. With the use of experimental melting temperatures, its parametrization can provide important insights into its hydrogen bonds and stacking interactions as has been done for high sodium concentrations. However, the RNA parametrization for lower salt concentrations is still missing due to the limited amount of published melting temperature data. While the Peyrard-Bishop (PB) parametrization was found to be largely independent of strand concentrations, it requires that all temperatures are provided at the same strand concentrations. Here we adapted the PB model to handle multiple strand concentrations and in this way we were able to make use of an experimental set of temperatures to model the hydrogen bond and stacking interactions at low and intermediate sodium concentrations. For the parametrizations we make a distinction between terminal and internal base pairs, and the resulting potentials were qualitatively similar as we obtained previously for DNA. The main difference from DNA parameters, was the Morse potentials at low sodium concentrations for terminal r(AU) which is stronger than d(AT), suggesting higher hydrogen bond strength.

Download Full-text

Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function

Journal of Computational Chemistry ◽

10.1002/(sici)1096-987x(19981115)19:14<1639::aid-jcc10>3.0.co;2-b ◽

1998 ◽

Vol 19 (14) ◽

pp. 1639-1662 ◽

Cited By ~ 6130

Author(s):

Garrett M. Morris ◽

David S. Goodsell ◽

Robert S. Halliday ◽

Ruth Huey ◽

William E. Hart ◽

...

Keyword(s):

Genetic Algorithm ◽

Free Energy ◽

Energy Function ◽

Binding Free Energy ◽

Free Energy Function ◽

Lamarckian Genetic Algorithm ◽

Automated Docking

Download Full-text

Hybrid models for suspended sediment prediction: optimized random forest and multi-layer perceptron through genetic algorithm and stochastic gradient descent methods

Neural Computing and Applications ◽

10.1007/s00521-021-06550-1 ◽

2021 ◽

Author(s):

Saeed Samadianfard ◽

Katayoun Kargar ◽

Sadra Shadkani ◽

Sajjad Hashemi ◽

Akram Abbaspour ◽

...

Keyword(s):

Genetic Algorithm ◽

Random Forest ◽

Suspended Sediment ◽

Gradient Descent ◽

Hybrid Models ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Methods ◽

Multi Layer Perceptron ◽

Gradient Descent Methods

Download Full-text

Prospects for recurrent neural network models to learn RNA biophysics from high-throughput data

10.1101/227611 ◽

2017 ◽

Author(s):

Michelle J Wu ◽

Johan OL Andreasson ◽

Wipapat Kladwang ◽

William J Greenleaf ◽

Rhiju Das ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Free Energy ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Data Augmentation ◽

Rna Folding ◽

Large Datasets ◽

Turner Model ◽

Rna Complexes

AbstractRNA is a functionally versatile molecule that plays key roles in genetic regulation and in emerging technologies to control biological processes. Computational models of RNA secondary structure are well-developed but often fall short in making quantitative predictions of the behavior of multi-RNA complexes. Recently, large datasets characterizing hundreds of thousands of individual RNA complexes have emerged as rich sources of information about RNA energetics. Meanwhile, advances in machine learning have enabled the training of complex neural networks from large datasets. Here, we assess whether a recurrent neural network model, Ribonet, can learn from high-throughput binding data, using simulation and experimental studies to test model accuracy but also determine if they learned meaningful information about the biophysics of RNA folding. We began by evaluating the model on energetic values predicted by the Turner model to assess whether the neural network could learn a representation that recovered known biophysical principles. First, we trained Ribonet to predict the simulated free energy of an RNA in complex with multiple input RNAs. Our model accurately predicts free energies of new sequences but also shows evidence of having learned base pairing information, as assessed by in silico double mutant analysis. Next, we extended this model to predict the simulated affinity between an arbitrary RNA sequence and a reporter RNA. While these more indirect measurements precluded the learning of basic principles of RNA biophysics, the resulting model achieved sub-kcal/mol accuracy and enabled design of simple RNA input responsive riboswitches with high activation ratios predicted by the Turner model from which the training data were generated. Finally, we compiled and trained on an experimental dataset comprising over 600,000 experimental affinity measurements published on the Eterna open laboratory. Though our tests revealed that the model likely did not learn a physically realistic representation of RNA interactions, it nevertheless achieved good performance of 0.76 kcal/mol on test sets with the application of transfer learning and novel sequence-specific data augmentation strategies. These results suggest that recurrent neural network architectures, despite being naïve to the physics of RNA folding, have the potential to capture complex biophysical information. However, more diverse datasets, ideally involving more direct free energy measurements, may be necessary to train de novo predictive models that are consistent with the fundamentals of RNA biophysics.Author SummaryThe precise design of RNA interactions is essential to gaining greater control over RNA-based biotechnology tools, including designer riboswitches and CRISPR-Cas9 gene editing. However, the classic model for energetics governing these interactions fails to quantitatively predict the behavior of RNA molecules. We developed a recurrent neural network model, Ribonet, to quantitatively predict these values from sequence alone. Using simulated data, we show that this model is able to learn simple base pairing rules, despite having no a priori knowledge about RNA folding encoded in the network architecture. This model also enables design of new switching RNAs that are predicted to be effective by the “ground truth” simulated model. We applied transfer learning to retrain Ribonet using hundreds of thousands of RNA-RNA affinity measurements and demonstrate simple data augmentation techniques that improve model performance. At the same time, data diversity currently available set limits on Ribonet’s accuracy. Recurrent neural networks are a promising tool for modeling nucleic acid biophysics and may enable design of complex RNAs for novel applications.

Download Full-text

The thermal stability of RNA duplexes containing modified base pairs placed at internal and terminal positions of the oligoribonucleotides

Biophysical Chemistry ◽

10.1016/s0301-4622(02)00074-1 ◽

2002 ◽

Vol 97 (2-3) ◽

pp. 233-241 ◽

Cited By ~ 14

Author(s):

Krzysztof Ziomek ◽

Elżbieta Kierzek ◽

Ewa Biała ◽

Ryszard Kierzek

Keyword(s):

Thermal Stability ◽

Base Pairs ◽

Rna Duplexes ◽

Thermal Stability Of

Download Full-text

Updated Free Energy Parameters Increase MicroRNA Prediction Performance

IFMBE Proceedings - World Congress on Medical Physics and Biomedical Engineering, June 7-12, 2015, Toronto, Canada ◽

10.1007/978-3-319-19387-8_359 ◽

2015 ◽

pp. 1477-1480

Author(s):

R. J. Peace ◽

J. R. Green

Keyword(s):

Free Energy ◽

Prediction Performance ◽

Energy Parameters

Download Full-text

Some linear free energy parameters

Free Energy Relationships in Organic and Bio-Organic Chemistry ◽

10.1039/9781847550927-00282 ◽

2007 ◽

pp. 282-290

Keyword(s):

Free Energy ◽

Energy Parameters ◽

Linear Free Energy

Download Full-text

Multi-Frequency Matched-Field Inversion of Benchmark Data Using a Genetic Algorithm

Journal of Computational Acoustics ◽

10.1142/s0218396x98000119 ◽

1998 ◽

Vol 06 (01n02) ◽

pp. 135-150 ◽

Cited By ~ 10

Author(s):

D. G. Simons ◽

M. Snellen

Keyword(s):

Genetic Algorithm ◽

Test Cases ◽

Data Sets ◽

True Parameter ◽

Benchmark Data ◽

Line Array ◽

Standard Normal ◽

Water Test ◽

Parameter Values ◽

Field Inversion

For a selected number of shallow water test cases of the 1997 Geoacoustic Inversion Workshop we have applied Matched-Field Inversion to determine the geoacoustic and geometric (source location, water depth) parameters. A genetic algorithm has been applied for performing the optimization, whereas the replica fields have been calculated using a standard normal-mode model. The energy function to be optimized is based on the incoherent multi-frequency Bartlett processor. We have used the data sets provided at a few frequencies in the band 25–500 Hz for a vertical line array positioned at 5 km from the source. A comparison between the inverted and true parameter values is made.

Download Full-text

Application and Need-Based Architecture Design of Deep Neural Networks

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800142052014x ◽

2020 ◽

Vol 34 (13) ◽

pp. 2052014 ◽

Cited By ~ 1

Author(s):

Soniya ◽

Sandeep Paul ◽

Lotika Singh

Keyword(s):

Genetic Algorithm ◽

Network Structure ◽

Gradient Descent ◽

Stochastic Gradient Descent ◽

Number Of Layers ◽

Effective Manner ◽

Compact Genetic Algorithm ◽

Benchmark Datasets ◽

The Cost ◽

Optimal Set

This paper applies a hybrid evolutionary approach to a convolutional neural network (CNN) and determines the number of layers and filters based on the application and user need. It integrates compact genetic algorithm with stochastic gradient descent (SGD) for simultaneously evolving structure and parameters of the CNN. It defines an effectual string representation for combining structure and parameters of the CNN. The compact genetic algorithm helps in the evolution of network structure by optimizing the number of convolutional layers and number of filters in each convolutional layer. At the same time, an optimal set of weight parameters of the network is obtained using the SGD law. This approach amalgamates exploration in network space by compact genetic algorithm and exploitation in weight space with SGD in an effective manner. The proposed approach also incorporates user-defined parameters in the cost function in an elegant manner which controls the network structure and hence the performance of the network based on the users need. The effectiveness of the proposed approach has been demonstrated on four benchmark datasets, namely MNIST, COIL-100, CIFAR-10 and CIFAR-100. The obtained results clearly demonstrate the potential of the proposed approach by evolving architectures based on the nature of the application and the need of the user.

Download Full-text