Binding Affinity Estimation From Restrained Umbrella Sampling Simulations

Accelerating the Calculation of Protein-Ligand Binding Free Energy and Residence Times using Dynamically Optimized Collective Variables

10.1101/417915 ◽

2018 ◽

Author(s):

Z. Faidon Brotzakis ◽

Vittorio Limongelli ◽

Michele Parrinello

Keyword(s):

Free Energy ◽

Ligand Binding ◽

Degrees Of Freedom ◽

Binding Free Energy ◽

Conformational Dynamics ◽

Computational Cost ◽

Binding Pocket ◽

Binding Mode ◽

Collective Variables ◽

Binding Interaction

AbstractElucidation of the ligand/protein binding interaction is of paramount relevance in pharmacology to increase the success rate of drug design. To this end a number of computational methods have been proposed, however all of them suffer from limitations since the ligand binding/unbinding transitions to the molecular target involve many slow degrees of freedom that hamper a full characterization of the binding process. Being able to express this transition in simple and general slow degrees of freedom, would give a distinctive advantage, since it would require minimal knowledge of the system under study, while in turn it would elucidate its physics and accelerate the convergence speed of enhanced sampling methods relying on collective variables. In this study we pursuit this goal by combining for the first time Variation Approach to Conformational dynamics with Funnel-Metadynamics. In so doing, we predict for the benzamidine/trypsin system the ligand binding mode, and we accurately compute the absolute protein-ligand binding free energy and unbinding rate at unprecedented low computational cost. Finally, our simulation protocol reveals the energetics and structural details of the ligand binding mechanism and shows that water and binding pocket solvation/desolvation are the dominant slow degrees of freedom.

Download Full-text

Automated, Accurate, and Scalable Relative Protein-Ligand Binding Free Energy Calculations using Lambda Dynamics

10.26434/chemrxiv.12781310.v1 ◽

2020 ◽

Author(s):

E. Prabhu Raman ◽

Thomas J. Paul ◽

Ryan L. Hayes ◽

Charles L. Brooks III

Keyword(s):

Free Energy ◽

Ligand Binding ◽

Binding Affinity ◽

Binding Free Energy ◽

Computational Cost ◽

Combinatorial Libraries ◽

Free Energy Calculations ◽

Lead Optimization ◽

Efficient Estimation ◽

Lead Compound

Accurate predictions of changes to protein-ligand binding affinity in response to chemical modifications are of utility in small molecule lead optimization. Relative free energy perturbation (FEP) approaches are one of the most widely utilized for this goal, but involve significant computational cost, thus limiting their application to small sets of compounds. Lambda dynamics, also rigorously based on the principles of statistical mechanics, provides a more efficient alternative. In this paper, we describe the development of a workflow to setup, execute, and analyze Multi-Site Lambda Dynamics (MSLD) calculations run on GPUs with CHARMm implemented in BIOVIA Discovery Studio and Pipeline Pilot. The workflow establishes a framework for setting up simulation systems for exploratory screening of modifications to a lead compound, enabling the calculation of relative binding affinities of combinatorial libraries. To validate the workflow, a diverse dataset of congeneric ligands for seven proteins with experimental binding affinity data is examined. A protocol to automatically tailor fit biasing potentials iteratively to flatten the free energy landscape of any MSLD system is developed that enhances sampling and allows for efficient estimation of free energy differences. The protocol is first validated on a large number of ligand subsets that model diverse substituents, which shows accurate and reliable performance. The scalability of the workflow is also tested to screen more than a hundred ligands modeled in a single system, which also resulted in accurate predictions. With a cumulative sampling time of 150ns or less, the method results in average unsigned errors of under 1 kcal/mol in most cases for both small and large combinatorial libraries. For the multi-site systems examined, the method is estimated to be more than an order of magnitude more efficient than contemporary FEP applications. The results thus demonstrate the utility of the presented MSLD workflow to efficiently screen combinatorial libraries and explore chemical space around a lead compound, and thus are of utility in lead optimization.

Download Full-text

Optimal Designs of Pairwise Calculation: an Application to Free Energy Perturbation in Minimizing Prediction Variability

10.26434/chemrxiv.7965140 ◽

2019 ◽

Author(s):

Qingyi Yang ◽

Woodrow W. Burchett ◽

Gregory S. Steeno ◽

David L. Mobley ◽

Xinjun Hou

Keyword(s):

Free Energy ◽

Least Squares ◽

Binding Affinity ◽

Binding Free Energy ◽

The Other ◽

Computer Hardware ◽

Free Energy Perturbation ◽

Implicit Assumption ◽

Reference Ligand ◽

Energy Perturbation

Predicting binding free energy of ligand-protein complexes has been a grand challenge in the field of computational chemistry since the early days of molecular modeling. Multiple computational methodologies exist to predict ligand binding affinities. Pathway-based Free Energy Perturbation (FEP), Thermodynamic Integration (TI) as well as Linear Interaction Energy (LIE), and Molecular Mechanics-Poisson Boltzmann/Generalized Born Surface Area (MM-PBSA/GBSA) have been applied to a variety of biologically relevant problems and achieved different levels of predictive accuracy. Recent advancements in computer hardware and simulation algorithms of molecular dynamics and Monte Carlo sampling, as well as improved general force field parameters, have made FEP a principal approach for calculating the free energy differences, especially when calculating the host-guest binding affinity differences upon chemical modification. Since the FEP-calculated binding free energy difference, denoted ddGFEP only characterizes the difference in free energy between pairs of ligands or complexes, not the absolute binding free energy value of each individual host-guest system, denoted dG, we examine two rarely asked questions in FEP application: 1) Which values would be more appropriate as the prediction to assess the ligands prospectively: the calculated pairwise free energy differences, ddGFEP, or the estimated absolute binding energies, d^G, transformed from ddGFEP? 2) In the situation where only a limited number of ligand pairs can be calculated in FEP, can the perturbation pairs be optimally selected with respect to the reference ligand(s) to maximize the prediction precision? These two questions underline the viability of an often-neglected assumption in pairwise comparisons: that the pairwise value is sufficient to make a quantitative and reliable characterization of an individual ligand's properties or activities. This implicit assumption would be true if there was no error in each pairwise calculation. Recently pair designs such as multiple pathways or cycle closure analyses provided calculation error estimation but did not address the statistical impact of the two questions above. The error impact is fully minimized by conducting an exhaustive study that obtains all NC2 = N(N-1)/2 pairs for a set N molecules; more if there is directionality (dGi,j != dGj,i). Obviously, that study design is impractical and unnecessary. Thus, we desire to collect the right amount of data that is 1) feasibly attainable, 2) topologically sufficient, and 3) mathematically synthesizable so that we can mitigate inherent calculation errors and have higher confidence in our conclusions. The significance of above questions can be illustrated by a motivating example shown in Figure 1 and Table 1, which considers two different perturbation graph designs for 20 ligands with the same number of FEP perturbation pairs, 19, and the same reference, Ligand 1. These two designs reached different conclusions in rank ordering ligand potencies due to errors inherent in the FEP derived estimates. Based on design A, ligands 5, 7, 14, 15 would be selected as the best four (20%) picks since those d^G estimates are the most favorable. Design B would yield ligands 5, 12, 18, 19 as best for the same reason. Without knowing the true value, dGTrue of the other 19 ligands, we lack a prospective metric to assess which design could be more precise even though, retrospectively, we know that both designs had reasonably good agreement with the true values, as measured through correlation and error metrics. However, the top picks from neither design were consistent with the true top four ligands, which are ligands 7, 10, 12, 18. Yet, if all of the 20C2 =190 pairs could have been calculated as listed in the last column of Table 1, the best four ligands would have been correctly identified. Additionally, the other metrics included in Table 1 were significantly improved. However, as mentioned above, calculating all possible pairs, or even a significant fraction of all possible pairs, is unlikely in practice, especially when number of molecules are large. Given this restriction, is it possible to objectively determine whether design A or B will give more precise predictions? In this report, we investigated the performance of the calculated ddGFEP values compared to the pairwise differences in least squares derived d^G estimates both analytically and through simulations. Based on our findings, we recommend applying weighted least squares to transforming ddGFEP values into d^G estimates. Second, we investigated the factors that contribute to the precision of the d^G estimates, such as the total number of computed pairs, the selection of computed pairs, and the uncertainty in the computed ddGFEP values. The mean squared error, denoted MSE and Spearman's rank correlation, are used as performance metrics. To illustrate, we demonstrated how the structural similarity can be included in design and its potential impact on prediction precision. As in the majority of reported FEP studies on binding affinity prediction, the ddGFEP pairs were selected based on chemical structure similarity. Pairs with small chemical differences are assumed to be more likely to have smaller errors in ddGFEP calculation. Together using the constructed mathematic system and literature examples, we demonstrate that some of pair-selection schemes (designs) are better than the others. To minimize the prediction uncertainty, it is recommended to wisely select design optimality criterion to suit practical applications accordingly.

Download Full-text

Docking, Binding Free Energy Estimation, and MD Simulation of Newly Designed CQ and HCQ Analogues Against the Spike-ACE2 Complex of SARS-CoV-2

International Journal of Quantitative Structure-Property Relationships ◽

10.4018/ijqspr.2021100105 ◽

2021 ◽

Vol 6 (4) ◽

pp. 77-89

Author(s):

Anjoomaara H. Patel ◽

Riya B. Patel ◽

MahammadHussain J. Memon ◽

Samiya S. Patel ◽

Sharav A. Desai ◽

...

Keyword(s):

Free Energy ◽

Md Simulation ◽

Binding Free Energy ◽

Docking Studies ◽

Radius Of Gyration ◽

Ligand Interaction ◽

Energy Estimation ◽

Efficacious Treatment ◽

Protein Ligand Interaction ◽

Derivatives Of

The coronavirus disease 2019 (COVID-19) virus has been spreading rapidly, and scientists are endeavouring to discover drugs for its efficacious treatment. Chloroquine phosphate, an old drug for treatment of malaria, has shown to have apparent efficacy and acceptable safety against COVID-19. As a part of Drug Discovery Hackathon-2020, in this study, the authors have tried making the derivatives of CQ and HCQ using MarvinSketch by ChemAxon. Molecular docking studies of these ligands were performed using Glide by Schrodinger, and ADME profiles were obtained by using QikProp. The obtained results after data analysis demonstrated that ligands HCQ_imidazoll, choloroquine_3c, HCQ_pyrrolC had good binding affinity and complied with all the ADME parameters. The molecular dynamic simulation of these ligands in complex with the 2019-nCoV RBD/ACE-2-B0AT1 complex PDB ID: 6M17 were carried out, and the parameters like RMSD, RMSF, and radius of gyration were observed to understand the fluctuations and protein-ligand interaction.

Download Full-text

Current Trends in Docking Methodologies

Oncology ◽

10.4018/978-1-5225-0549-5.ch032 ◽

2017 ◽

pp. 829-847

Author(s):

Shubhandra Tripathi ◽

Akhil Kumar ◽

Amandeep Kaur Kahlon ◽

Ashok Sharma

Keyword(s):

Free Energy ◽

Molecular Docking ◽

Binding Affinity ◽

Binding Free Energy ◽

Energy Calculation ◽

Molecular Binding ◽

New Horizons ◽

Current Trends ◽

Complex Scheme ◽

Dynamics Simulations

Molecular docking was earlier considered to predict the binding affinity of the receptor and ligand molecules. With the progress in computational power and developing approaches, new horizons are now opening for accurate prediction of molecular binding affinity. In the current book chapter, recent strategies for Computer-Aided Drug Designing (CADD) including virtual screening and molecular docking, encompassing molecular dynamics simulations, and binding free energy calculation methods are discussed. Brief overview of different binding free energy methods MMPBSA, MMGBSA, LIE and TI have also been given along with the recent Relaxed Complex Scheme protocol.

Download Full-text

Optimal Designs of Pairwise Calculation: an Application to Free Energy Perturbation in Minimizing Prediction Variability

10.26434/chemrxiv.7965140.v2 ◽

2019 ◽

Author(s):

Qingyi Yang ◽

Woodrow W. Burchett ◽

Gregory S. Steeno ◽

David L. Mobley ◽

Xinjun Hou

Keyword(s):

Free Energy ◽

Least Squares ◽

Binding Affinity ◽

Binding Free Energy ◽

The Other ◽

Computer Hardware ◽

Free Energy Perturbation ◽

Implicit Assumption ◽

Reference Ligand ◽

Energy Perturbation

Predicting binding free energy of ligand-protein complexes has been a grand challenge in the field of computational chemistry since the early days of molecular modeling. Multiple computational methodologies exist to predict ligand binding affinities. Pathway-based Free Energy Perturbation (FEP), Thermodynamic Integration (TI) as well as Linear Interaction Energy (LIE), and Molecular Mechanics-Poisson Boltzmann/Generalized Born Surface Area (MM-PBSA/GBSA) have been applied to a variety of biologically relevant problems and achieved different levels of predictive accuracy. Recent advancements in computer hardware and simulation algorithms of molecular dynamics and Monte Carlo sampling, as well as improved general force field parameters, have made FEP a principal approach for calculating the free energy differences, especially when calculating the host-guest binding affinity differences upon chemical modification. Since the FEP-calculated binding free energy difference, denoted ddGFEP only characterizes the difference in free energy between pairs of ligands or complexes, not the absolute binding free energy value of each individual host-guest system, denoted dG, we examine two rarely asked questions in FEP application: 1) Which values would be more appropriate as the prediction to assess the ligands prospectively: the calculated pairwise free energy differences, ddGFEP, or the estimated absolute binding energies, d^G, transformed from ddGFEP? 2) In the situation where only a limited number of ligand pairs can be calculated in FEP, can the perturbation pairs be optimally selected with respect to the reference ligand(s) to maximize the prediction precision? These two questions underline the viability of an often-neglected assumption in pairwise comparisons: that the pairwise value is sufficient to make a quantitative and reliable characterization of an individual ligand's properties or activities. This implicit assumption would be true if there was no error in each pairwise calculation. Recently pair designs such as multiple pathways or cycle closure analyses provided calculation error estimation but did not address the statistical impact of the two questions above. The error impact is fully minimized by conducting an exhaustive study that obtains all NC2 = N(N-1)/2 pairs for a set N molecules; more if there is directionality (dGi,j != dGj,i). Obviously, that study design is impractical and unnecessary. Thus, we desire to collect the right amount of data that is 1) feasibly attainable, 2) topologically sufficient, and 3) mathematically synthesizable so that we can mitigate inherent calculation errors and have higher confidence in our conclusions. The significance of above questions can be illustrated by a motivating example shown in Figure 1 and Table 1, which considers two different perturbation graph designs for 20 ligands with the same number of FEP perturbation pairs, 19, and the same reference, Ligand 1. These two designs reached different conclusions in rank ordering ligand potencies due to errors inherent in the FEP derived estimates. Based on design A, ligands 5, 7, 14, 15 would be selected as the best four (20%) picks since those d^G estimates are the most favorable. Design B would yield ligands 5, 12, 18, 19 as best for the same reason. Without knowing the true value, dGTrue of the other 19 ligands, we lack a prospective metric to assess which design could be more precise even though, retrospectively, we know that both designs had reasonably good agreement with the true values, as measured through correlation and error metrics. However, the top picks from neither design were consistent with the true top four ligands, which are ligands 7, 10, 12, 18. Yet, if all of the 20C2 =190 pairs could have been calculated as listed in the last column of Table 1, the best four ligands would have been correctly identified. Additionally, the other metrics included in Table 1 were significantly improved. However, as mentioned above, calculating all possible pairs, or even a significant fraction of all possible pairs, is unlikely in practice, especially when number of molecules are large. Given this restriction, is it possible to objectively determine whether design A or B will give more precise predictions? In this report, we investigated the performance of the calculated ddGFEP values compared to the pairwise differences in least squares derived d^G estimates both analytically and through simulations. Based on our findings, we recommend applying weighted least squares to transforming ddGFEP values into d^G estimates. Second, we investigated the factors that contribute to the precision of the d^G estimates, such as the total number of computed pairs, the selection of computed pairs, and the uncertainty in the computed ddGFEP values. The mean squared error, denoted MSE and Spearman's rank correlation, are used as performance metrics. To illustrate, we demonstrated how the structural similarity can be included in design and its potential impact on prediction precision. As in the majority of reported FEP studies on binding affinity prediction, the ddGFEP pairs were selected based on chemical structure similarity. Pairs with small chemical differences are assumed to be more likely to have smaller errors in ddGFEP calculation. Together using the constructed mathematic system and literature examples, we demonstrate that some of pair-selection schemes (designs) are better than the others. To minimize the prediction uncertainty, it is recommended to wisely select design optimality criterion to suit practical applications accordingly.

Download Full-text

A Benchmark of Popular Free Energy Approaches Revealing the Inhibitors Binding to SARS-CoV2 Mpro

10.26434/chemrxiv.13318523.v1 ◽

2020 ◽

Author(s):

Son Tung Ngo ◽

Nguyen Minh Tam ◽

Pham Minh Quan ◽

Trung Hai Nguyen

Keyword(s):

Free Energy ◽

Ligand Binding ◽

Binding Affinity ◽

Drug Target ◽

Binding Free Energy ◽

Essential Elements ◽

Dominant Factor ◽

Linear Interaction ◽

Energy Perturbation ◽

Binding Cleft

COVID-19 pandemic has killed millions of people worldwide since its outbreak in Dec 2019. The pandemic is caused by the SARS-CoV-2 virus whose main protease (Mpro) is a promising drug target since it plays a key role in viral proliferation and replication. Currently, designing an effective therapy is an urgent task, which requires accurately estimating ligand-binding free energy to the SARS-CoV-2 Mpro. However, it should be noted that the accuracy of a free energy method probably depends on the protein target. A highly accurate approach for some targets may fail to produce a reasonable correlation with experiment when a novel enzyme is considered as a drug target. Therefore, in this context, the ligand-binding affinity to SARS-CoV-2 Mpro was calculated via various approaches. The Autodock Vina (Vina) and Autodock4 (AD4) packages were manipulated to preliminary investigate the ligand-binding affinity and pose to the SARS-CoV-2 Mpro. The binding free energy was then refined using the fast pulling of ligand (FPL), linear interaction energy (LIE), molecular mechanics-Poission Boltzmann surface area (MM-PBSA), and free energy perturbation (FEP) methods. The benchmark results indicated that for docking calculations, Vina is more accurate than AD4 and for free energy methods, FEP is the most accurate followed by LIE, FPL and MM-PBSA (FEP > LIE > FPL > MM-PBSA). Moreover, the binding mechanism was also revealed by atomistic simulations. The vdW interaction is the dominant factor. The residues Thr25, Thr26, His41, Ser46, Asn142, Gly143, Cys145, Glu166, and Gln189 are essential elements affecting on the binding process. Furthermore, the Ser46 and related residues probably are important elements affecting the enlarge/dwindle of the SARS-CoV-2 Mpro binding cleft. The benchmark probably guide for further investigations using computational approaches.

Download Full-text

Drug Repurposing Against SARS-CoV-2 Using E-Pharmacophore Based Virtual Screening and Molecular Docking with Main Protease as the Target

10.26434/chemrxiv.12199610 ◽

2020 ◽

Author(s):

arun kumar ◽

Sharanya C.S ◽

Abhithaj J ◽

Dileep Francis ◽

Sadasivan C

Keyword(s):

Free Energy ◽

Molecular Docking ◽

Drug Target ◽

Binding Free Energy ◽

Drug Repurposing ◽

Energy Calculation ◽

Viral Protease ◽

Energy Estimation ◽

Main Protease ◽

Potential Inhibitors

Since its first report in December 2019 from China the COVID-19 pandemic caused by the beta-coronavirus SARS-CoV-2 has spread at an alarming pace infecting about 26 lakh, and claiming the lives of more than 1.8 lakh individuals across the globe. Although social quarantine measures have succeeded in containing the spread of the virus to some extent, the lack of a clinically approved vaccine or drug remains the biggest bottleneck in combating the pandemic. Drug repurposing can expedite the process of drug development by identifying known drugs which are effective against SARS-CoV-2. The SARS-CoV-2 main protease is a promising drug target due to its indispensable role in viral multiplication inside the host. In the present study an E-pharmacophore hypothesis was generated using the crystal structure of the viral protease in complex with an imidazole carbaximide inhibitor as the drug target. Drugs available in the superDRUG2 database were used to identify candidate drugs for repurposing. The hits were further screened using a structure based approach involving molecular docking at different precisions. The most promising drugs were subjected to binding free energy estimation using MM-GBSA. Among the 4600 drugs screened 17 drugs were identified as candidate inhibitors of the viral protease based on the glide scores obtained from molecular docking. Binding free energy calculation showed that six drugs viz, Binifibrate, Macimorelin acetate, Bamifylline, Rilmazafon, Afatinib and Ezetimibe can act as potential inhibitors of the viral protease.

Download Full-text