scholarly journals Empirical optimization of molecular simulation force fields by Bayesian inference

2021 ◽  
Vol 94 (12) ◽  
Author(s):  
Jürgen Köfinger ◽  
Gerhard Hummer

Abstract The demands on the accuracy of force fields for classical molecular dynamics simulations are steadily growing as larger and more complex systems are studied over longer times. One way to meet these growing demands is to hand over the learning of force fields and their parameters to machines in a systematic (semi)automatic manner. Doing so, we can take full advantage of exascale computing, the increasing availability of experimental data, and advances in quantum mechanical computations and the calculation of experimental observables from molecular ensembles. Here, we discuss and illustrate the challenges one faces in this endeavor and explore a way forward by adapting the Bayesian inference of ensembles (BioEn) method [Hummer and Köfinger, J. Chem. Phys. (2015)] for force field parameterization. In the Bayesian inference of force fields (BioFF) method developed here, the optimization problem is regularized by a simplified prior on the force field parameters and an entropic prior acting on the ensemble. The latter compensates for the unavoidable over simplifications in the parameter prior. We determine optimal force field parameters using an iterative predictor–corrector approach, in which we run simulations, determine the reference ensemble using the weighted histogram analysis method (WHAM), and update the force field according to the BioFF posterior. We illustrate this approach for a simple polymer model, using the distance between two labeled sites as the experimental observable. By systematically resolving force field issues, instead of just reweighting a structural ensemble, the BioFF corrections extend to observables not included in ensemble reweighting. We envision future force field optimization as a formalized, systematic, and (semi)automatic machine-learning effort that incorporates a wide range of data from experiment and high-level quantum chemical calculations, and takes advantage of exascale computing resources. Graphic abstract

Author(s):  
Alice Allen ◽  
Michael J. Robertson ◽  
Michael C. Payne ◽  
Daniel Cole

<div><div><div><p>Molecular mechanics force field parameters for macromolecules, such as proteins, are traditionally fit to reproduce experimental properties of small molecules, and thus they neglect system-specific polarization. In this paper, we introduce a complete QUantum mechanical BEspoke (QUBE) protein force field, which derives non-bonded parameters directly from the electron density of the specific protein under study. The main backbone and sidechain protein torsional parameters are re-derived in this work by fitting to quantum mechanical dihedral scans for compatibibility with QUBE non-bonded parameters. Software is provided for the preparation of QUBE input files. The accuracy of the new force field, and the derived torsional parameters, are tested by comparing the conformational preferences of a range of peptides and proteins with experimental measurements. Accurate backbone and sidechain conformations are obtained in molecular dynamics simulations of dipeptides, with NMR J coupling errors comparable to the widely-used OPLS force field. In simulations of five folded proteins, the secondary structure is generally retained and the NMR J coupling errors are similar to standard transferable force fields, although some loss of the experimental structure is observed in certain regions of the proteins. Overall, with several avenues for further development, the use of system-specific non-bonded force field parameters is a promising approach for next-generation simulations of biological molecules.</p></div></div></div>


Author(s):  
Joshua Horton ◽  
Alice Allen ◽  
Leela Dodda ◽  
Daniel Cole

<div><div><div><p>Modern molecular mechanics force fields are widely used for modelling the dynamics and interactions of small organic molecules using libraries of transferable force field parameters. For molecules outside the training set, parameters may be missing or inaccurate, and in these cases, it may be preferable to derive molecule-specific parameters. Here we present an intuitive parameter derivation toolkit, QUBEKit (QUantum mechanical BEspoke Kit), which enables the automated generation of system-specific small molecule force field parameters directly from quantum mechanics. QUBEKit is written in python and combines the latest QM parameter derivation methodologies with a novel method for deriving the positions and charges of off-center virtual sites. As a proof of concept, we have re-derived a complete set of parameters for 109 small organic molecules, and assessed the accuracy by comparing computed liquid properties with experiment. QUBEKit gives highly competitive results when compared to standard transferable force fields, with mean unsigned errors of 0.024 g/cm3, 0.79 kcal/mol and 1.17 kcal/mol for the liquid density, heat of vaporization and free energy of hydration respectively. This indicates that the derived parameters are suitable for molecular modelling applications, including computer-aided drug design.</p></div></div></div>


Author(s):  
Joshua Horton ◽  
Alice Allen ◽  
Leela Dodda ◽  
Daniel Cole

<div><div><div><p>Modern molecular mechanics force fields are widely used for modelling the dynamics and interactions of small organic molecules using libraries of transferable force field parameters. For molecules outside the training set, parameters may be missing or inaccurate, and in these cases, it may be preferable to derive molecule-specific parameters. Here we present an intuitive parameter derivation toolkit, QUBEKit (QUantum mechanical BEspoke Kit), which enables the automated generation of system-specific small molecule force field parameters directly from quantum mechanics. QUBEKit is written in python and combines the latest QM parameter derivation methodologies with a novel method for deriving the positions and charges of off-center virtual sites. As a proof of concept, we have re-derived a complete set of parameters for 109 small organic molecules, and assessed the accuracy by comparing computed liquid properties with experiment. QUBEKit gives highly competitive results when compared to standard transferable force fields, with mean unsigned errors of 0.024 g/cm3, 0.79 kcal/mol and 1.17 kcal/mol for the liquid density, heat of vaporization and free energy of hydration respectively. This indicates that the derived parameters are suitable for molecular modelling applications, including computer-aided drug design.</p></div></div></div>


RSC Advances ◽  
2014 ◽  
Vol 4 (89) ◽  
pp. 48621-48631 ◽  
Author(s):  
Eleanor R. Turpin ◽  
Sam Mulholland ◽  
Andrew M. Teale ◽  
Boyan B. Bonev ◽  
Jonathan D. Hirst

2017 ◽  
Vol 114 (31) ◽  
pp. 8265-8270 ◽  
Author(s):  
Simon Olsson ◽  
Hao Wu ◽  
Fabian Paul ◽  
Cecilia Clementi ◽  
Frank Noé

Accurate mechanistic description of structural changes in biomolecules is an increasingly important topic in structural and chemical biology. Markov models have emerged as a powerful way to approximate the molecular kinetics of large biomolecules while keeping full structural resolution in a divide-and-conquer fashion. However, the accuracy of these models is limited by that of the force fields used to generate the underlying molecular dynamics (MD) simulation data. Whereas the quality of classical MD force fields has improved significantly in recent years, remaining errors in the Boltzmann weights are still on the order of a few kT, which may lead to significant discrepancies when comparing to experimentally measured rates or state populations. Here we take the view that simulations using a sufficiently good force-field sample conformations that are valid but have inaccurate weights, yet these weights may be made accurate by incorporating experimental data a posteriori. To do so, we propose augmented Markov models (AMMs), an approach that combines concepts from probability theory and information theory to consistently treat systematic force-field error and statistical errors in simulation and experiment. Our results demonstrate that AMMs can reconcile conflicting results for protein mechanisms obtained by different force fields and correct for a wide range of stationary and dynamical observables even when only equilibrium measurements are incorporated into the estimation process. This approach constitutes a unique avenue to combine experiment and computation into integrative models of biomolecular structure and dynamics.


1999 ◽  
Vol 103 (33) ◽  
pp. 6998-7014 ◽  
Author(s):  
Carl S. Ewig ◽  
Thomas S. Thacher ◽  
Arnold T. Hagler

2012 ◽  
Vol 18 (8) ◽  
pp. 3455-3466 ◽  
Author(s):  
Gül Altınbaş Özpınar ◽  
Frank R. Beierlein ◽  
Wolfgang Peukert ◽  
Dirk Zahn ◽  
Timothy Clark

2004 ◽  
Vol 03 (03) ◽  
pp. 339-358 ◽  
Author(s):  
YOSHITAKE SAKAE ◽  
YUKO OKAMOTO

We optimized five existing sets of force-field parameters for protein systems by our recently proposed method. The five force fields are AMBER parm94, AMBER parm96, AMBER parm99, CHARMM version 22, and OPLS-AA. The method consists of minimizing the sum of the square of the force acting on each atom in the proteins with the structures from the Protein Data Bank (PDB). We selected the partial-charge and backbone torsion-energy parameters for this optimization, and 100 molecules from the PDB were used. We gave detailed comparisons of the optimized force fields and found that there is a tendency of convergence towards the same function for the torsion-energy term.


Sign in / Sign up

Export Citation Format

Share Document