scholarly journals Perturbation Free-Energy Toolkit: Automated Alchemical Topology Builder and Optimized Simulation Update Scheme

Author(s):  
Drazen Petrov

Free-energy calculations play an important role in the application of computational chemistry to a range of fields, including protein biochemistry, rational drug design or material science. Importantly, the free energy difference is directly related to experimentally measurable quantities such as partition and adsorption coefficients, water activity and binding affinities. Among several techniques aimed at predicting the free-energy differences, perturbation approaches, involving alchemical transformation of one molecule into another through intermediate states, stand out as rigorous methods based on statistical mechanics. However, despite the importance of efficient and accurate free energy predictions, applicability of the perturbation approaches is still largely impeded by a number of challenges. This study aims at addressing two of them: 1) the definition of the perturbation path, i.e., alchemical changes leading to the transformation of one molecule to the other, and 2) determining the amount of sampling along the path to reach desired convergence. In particular, an automatic perturbation builder based on a graph matching algorithm is developed, that is able to identify the maximum common substructure of two molecules and provide the perturbation topologies suitable for free-energy calculations using GROMOS and GROMACS simulation packages. Moreover, it was used to calculate the changes in free energy of a set of post-translational modifications and analyze their convergence behavior. Different methods were tested, which showed that MBAR and extended thermodynamic integration (TI) in combination with MBAR show better performance as compared to BAR, extended TI with linear interpolation and plain TI. Also, a number of error estimators were explored and how they relate to the true error, estimated as the difference in free energy from an extensive set of simulation data. This analysis shows that most of the estimators provide only a qualitative agreement to the true error, with little quantitative predictive power. This notwithstanding, the preformed analyses provided insight into the convergence of free-energy calculations, which allowed for development of an iterative update scheme for perturbation simulations that aims at minimizing the simulation time to reach the convergence, i.e., optimizing the efficiency. Importantly, this toolkit is made available online as an open-source python package (https://github.com/drazen-petrov/SMArt).

2021 ◽  
Author(s):  
Drazen Petrov

Free-energy calculations play an important role in the application of computational chemistry to a range of fields, including protein biochemistry, rational drug design or material science. Importantly, the free energy difference is directly related to experimentally measurable quantities such as partition and adsorption coefficients, water activity and binding affinities. Among several techniques aimed at predicting the free-energy differences, perturbation approaches, involving alchemical transformation of one molecule into another through intermediate states, stand out as rigorous methods based on statistical mechanics. However, despite the importance of efficient and accurate free energy predictions, applicability of the perturbation approaches is still largely impeded by a number of challenges. This study aims at addressing two of them: 1) the definition of the perturbation path, i.e., alchemical changes leading to the transformation of one molecule to the other, and 2) determining the amount of sampling along the path to reach desired convergence. In particular, an automatic perturbation builder based on a graph matching algorithm is developed, that is able to identify the maximum common substructure of two molecules and provide the perturbation topologies suitable for free-energy calculations using GROMOS and GROMACS simulation packages. Moreover, it was used to calculate the changes in free energy of a set of post-translational modifications and analyze their convergence behavior. Different methods were tested, which showed that MBAR and extended thermodynamic integration (TI) in combination with MBAR show better performance as compared to BAR, extended TI with linear interpolation and plain TI. Also, a number of error estimators were explored and how they relate to the true error, estimated as the difference in free energy from an extensive set of simulation data. This analysis shows that most of the estimators provide only a qualitative agreement to the true error, with little quantitative predictive power. This notwithstanding, the preformed analyses provided insight into the convergence of free-energy calculations, which allowed for development of an iterative update scheme for perturbation simulations that aims at minimizing the simulation time to reach the convergence, i.e., optimizing the efficiency. Importantly, this toolkit is made available online as an open-source python package (https://github.com/drazen-petrov/SMArt).


2019 ◽  
Author(s):  
Braden Kelly ◽  
William Smith

We present an algorithm to calculate hydration free energies in explicit solvent that incorporates polarization of the solute molecule in conjunction with the use of a classical fixed--charge force field. The goal is to improve the accuracy over the alternative approach of developing a polarizable force field with adjustable parameters. We incorporate polarization by implementing on--the--fly periodic updating of the solute's partial charges during a standard molecular dynamics (MD) alchemical change simulation by the use of mixed QM/MM calculations. We decouple the polarizing solvent's electric field along with the normal MD solute Coulomb decoupling to calculate the free energy difference between an unpolarized solute in vacuum and a fully polarized solute in solution. This approach is in contrast to the common approach of GAFF, which calculates the difference between a solute in vacuum that is over--polarized by the use of fixed charges calculated using HF/6-31G*, and correspondingly under--polarized by the same partial charge set in the solution phase. We apply our methodology to a test set of 31 molecules, ranging from small polar to large drug--like molecules. We find that results using our method with Minimum Basis Iterative Stockholder (MBIS) charges and using RESP charges with B3LYP/cc-pVTZ are superior to results calculated using the current ``gold standard" AM1--BCC method. We show results using MBIS partial charges using B3LYP/cc-pVTZ and MP2/cc-pVTZ, RESP partial charges using B3LYP/cc-pVTZ and HF/6-31G*, and AM1-BCC partial charges. Our method using MBIS in conjunction with MP2/cc-pVTZ yields an AAD that is 2.91 kJ$\cdot$mol$^{-1}$ (0.70 kcal$\cdot$mol$^{-1}$) lower than that of AM1--BCC for our test set. AM1-BCC was within experimental uncertainty on 13 \% of the data, while our method using MP2 was within experimental uncertainty on 43 \% of the data. We conjecture that results can be further improved by using Lennard--Jones and torsional parameters that are fitted to the MBIS charge method and that using RESP with our method can be improved by using a higher level of theory than B3LYP, for instance MP2 or $\omega$B97X-D.


2020 ◽  
Author(s):  
Maximilian Kuhn ◽  
Stuart Firth-Clark ◽  
Paolo Tosco ◽  
Antonia S. J. S. Mey ◽  
Mark Mackey ◽  
...  

Free energy calculations have seen increased usage in structure-based drug design. Despite the rising interest, automation of the complex calculations and subsequent analysis of their results are still hampered by the restricted choice of available tools. In this work, an application for automated setup and processing of free energy calculations is presented. Several sanity checks for assessing the reliability of the calculations were implemented, constituting a distinct advantage over existing open-source tools. The underlying workflow is built on top of the software Sire, SOMD, BioSimSpace and OpenMM and uses the AMBER14SB and GAFF2.1 force fields. It was validated on two datasets originally composed by Schrödinger, consisting of 14 protein structures and 220 ligands. Predicted binding affinities were in good agreement with experimental values. For the larger dataset the average correlation coefficient Rp was 0.70 ± 0.05 and average Kendall’s τ was 0.53 ± 0.05 which is broadly comparable to or better than previously reported results using other methods. <br>


Sign in / Sign up

Export Citation Format

Share Document