scholarly journals De Novo Molecule Design Through Molecular Generative Model Conditioned by 3D Information of Protein Binding Sites

Author(s):  
Mingyuan Xu ◽  
Ting Ran ◽  
Hongming Chen

<p><i>De novo</i> molecule design through molecular generative model is gaining increasing attention in recent years. Here a novel generative model was proposed by integrating the 3D structural information of the protein binding pocket into the conditional RNN (cRNN) model to control the generation of drug-like molecules. In this model, the composition of protein binding pocket is effectively characterized through a coarse-grain strategy and the three-dimensional information of the pocket can be represented by the sorted eigenvalues of the coulomb matrix (EGCM) of the coarse-grained atoms composing the binding pocket. In current work, we used our EGCM method and a previously reported binding pocket descriptor DeeplyTough to train cRNN models and compared their performance. It has been shown that the molecules generated with the control of protein environment information have a clear tendency on generating compounds with higher similarity to the original X-ray bound ligand than normal RNN model and also achieving better performance in terms of docking scores. Our results demonstrate the potential application of EGCM controlled generative model for the targeted molecule generation and guided exploration on the drug-like chemical space. </p><p> </p>

2020 ◽  
Author(s):  
Mingyuan Xu ◽  
Ting Ran ◽  
Hongming Chen

<p><i>De novo</i> molecule design through molecular generative model is gaining increasing attention in recent years. Here a novel generative model was proposed by integrating the 3D structural information of the protein binding pocket into the conditional RNN (cRNN) model to control the generation of drug-like molecules. In this model, the composition of protein binding pocket is effectively characterized through a coarse-grain strategy and the three-dimensional information of the pocket can be represented by the sorted eigenvalues of the coulomb matrix (EGCM) of the coarse-grained atoms composing the binding pocket. In current work, we used our EGCM method and a previously reported binding pocket descriptor DeeplyTough to train cRNN models and compared their performance. It has been shown that the molecules generated with the control of protein environment information have a clear tendency on generating compounds with higher similarity to the original X-ray bound ligand than normal RNN model and also achieving better performance in terms of docking scores. Our results demonstrate the potential application of EGCM controlled generative model for the targeted molecule generation and guided exploration on the drug-like chemical space. </p><p> </p>


Author(s):  
Oleksii Prykhodko ◽  
Simon Viet Johansson ◽  
Panagiotis-Christos Kotsias ◽  
Esben Jannik Bjerrum ◽  
Ola Engkvist ◽  
...  

<p>Recently deep learning method has been used for generating novel structures. In the current study, we proposed a new deep learning method, LatentGAN, which combine an autoencoder and a generative adversarial neural network for doing de novo molecule design. We applied the method for structure generation in two scenarios, one is to generate random drug-like compounds and the other is to generate target biased compounds. Our results show that the method works well in both cases, in which sampled compounds from the trained model can largely occupy the same chemical space of the training set and still a substantial fraction of the generated compound are novel. The distribution of drug-likeness score for compounds sampled from LatentGAN is also similar to that of the training set.</p>


2021 ◽  
Author(s):  
Kostas Papadopoulos ◽  
Kathryn A. Giblin ◽  
Jon Paul Janet ◽  
Atanas Patronov ◽  
Ola Engkvist

<p>We have demonstrated the utility of a 3D shape and pharmacophore similarity scoring component in molecular design with a deep generative model trained with reinforcement learning. Using Dopamine receptor type 2 (DRD2) as an example and its antagonist haloperidol <b>1</b> as a starting point in a ligand based design context, we have shown in a retrospective study that a 3D similarity enabled generative model can discover new leads in the absence of any other information. It can be efficiently used for scaffold hopping and generation of novel series. 3D similarity based models were compared against 2D QSAR based, indicating a significant degree of orthogonality of the generated outputs and with the former having a more diverse output. In addition, when the two scoring components are combined together for training of the generative model, it results in more efficient exploration of desirable chemical space compared to the individual components. </p>


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Shohei Konno ◽  
Takao Namiki ◽  
Koichiro Ishimori

Abstract To quantitatively categorize protein structures, we developed a quantitative coarse-grained model of protein structures with a novel amino acid network, the interaction selective network (ISN), characterized by the links based on interactions in both the main and side chains. We found that the ISN is a novel robust network model to show the higher classification probability in the plots of average vertex degree (k) versus average clustering coefficient (C), both of which are typical network parameters for protein structures, and successfully distinguished between “all-α” and “all-β” proteins. On the other hand, one of the typical conventional networks, the α-carbon network (CAN), was found to be less robust than the ISN, and another typical network, atomic distance network (ADN), failed to distinguish between these two protein structures. Considering that the links in the CAN and ADN are defined by the interactions only between the main chain atoms and by the distance of the closest atom pair between the two amino acid residues, respectively, we can conclude that reflecting structural information from both secondary and tertiary structures in the network parameters improves the quantitative evaluation and robustness in network models, resulting in a quantitative and more robust description of three-dimensional protein structures in the ISN.


Author(s):  
Radek Erban

Three coarse-grained molecular dynamics (MD) models are investigated with the aim of developing and analysing multi-scale methods which use MD simulations in parts of the computational domain and (less detailed) Brownian dynamics (BD) simulations in the remainder of the domain. The first MD model is formulated in one spatial dimension. It is based on elastic collisions of heavy molecules (e.g. proteins) with light point particles (e.g. water molecules). Two three-dimensional MD models are then investigated. The obtained results are applied to a simplified model of protein binding to receptors on the cellular membrane. It is shown that modern BD simulators of intracellular processes can be used in the bulk and accurately coupled with a (more detailed) MD model of protein binding which is used close to the membrane.


2019 ◽  
Author(s):  
Fergus Imrie ◽  
Anthony R. Bradley ◽  
Mihaela van der Schaar ◽  
Charlotte M. Deane

AbstractRational compound design remains a challenging problem for both computational methods and medicinal chemists. Computational generative methods have begun to show promising results for the design problem. However, they have not yet used the power of 3D structural information. We have developed a novel graph-based deep generative model that combines state-of-the-art machine learning techniques with structural knowledge. Our method (“DeLinker”) takes two fragments or partial structures and designs a molecule incorporating both. The generation process is protein context dependent, utilising the relative distance and orientation between the partial structures. This 3D information is vital to successful compound design, and we demonstrate its impact on the generation process and the limitations of omitting such information. In a large scale evaluation, DeLinker designed 60% more molecules with high 3D similarity to the original molecule than a database baseline. When considering the more relevant problem of longer linkers with at least five atoms, the outperformance increased to 200%. We demonstrate the effectiveness and applicability of this approach on a diverse range of design problems: fragment linking, scaffold hopping, and proteolysis targeting chimera (PROTAC) design. As far as we are aware, this is the first molecular generative model to incorporate 3D structural information directly in the design process. Code is available at https://github.com/oxpig/DeLinker.


Author(s):  
Oleksii Prykhodko ◽  
Simon Viet Johansson ◽  
Panagiotis-Christos Kotsias ◽  
Esben Jannik Bjerrum ◽  
Ola Engkvist ◽  
...  

<p>Recently deep learning method has been used for generating novel structures. In the current study, we proposed a new deep learning method, LatentGAN, which combine an autoencoder and a generative adversarial neural network for doing de novo molecule design. We applied the method for structure generation in two scenarios, one is to generate random drug-like compounds and the other is to generate target biased compounds. Our results show that the method works well in both cases, in which sampled compounds from the trained model can largely occupy the same chemical space of the training set and still a substantial fraction of the generated compound are novel. The distribution of drug-likeness score for compounds sampled from LatentGAN is also similar to that of the training set.</p>


2021 ◽  
Author(s):  
Longxing Cao ◽  
Brian Coventry ◽  
Inna Goreshnik ◽  
Buwei Huang ◽  
Joon Sung Park ◽  
...  

The design of proteins that bind to a specific site on the surface of a target protein using no information other than the three-dimensional structure of the target remains an outstanding challenge. We describe a general solution to this problem which starts with a broad exploration of the very large space of possible binding modes and interactions, and then intensifies the search in the most promising regions. We demonstrate its very broad applicability by de novo design of binding proteins to 12 diverse protein targets with very different shapes and surface properties. Biophysical characterization shows that the binders, which are all smaller than 65 amino acids, are hyperstable and bind their targets with nanomolar to picomolar affinities. We succeeded in solving crystal structures of four of the binder-target complexes, and all four are very close to the corresponding computational design models. Experimental data on nearly half a million computational designs and hundreds of thousands of point mutants provide detailed feedback on the strengths and limitations of the method and of our current understanding of protein-protein interactions, and should guide improvement of both. Our approach now enables targeted design of binders to sites of interest on a wide variety of proteins for therapeutic and diagnostic applications.


2001 ◽  
Vol 45 (9) ◽  
pp. 2571-2576 ◽  
Author(s):  
Alex M. Aronov ◽  
Narsimha R. Munagala ◽  
Irwin D. Kuntz ◽  
Ching C. Wang

ABSTRACT Parasitic protozoa lack the ability to synthesize purine nucleotides de novo, relying instead on purine salvage enzymes for their survival. Guanine phosphoribosyltransferase (GPRT) from the protozoan parasite Giardia lamblia is a potential target for rational antiparasitic drug design, based on the experimental evidence, which indicates the lack of interconversion between adenine and guanine nucleotide pools. The present study is a continuation of our efforts to use three-dimensional structures of parasitic phosphoribosyltransferases (PRTs) to design novel antiparasitic agents. Two micromolar phthalimide-based GPRT inhibitors were identified by screening the in-house phthalimide library. A combination of structure-based scaffold selection using virtual library screening across the PRT gene family and solid phase library synthesis led to identification of smaller (molecular weight, <300) ligands with moderate to low specificity for GPRT; the best inhibitors, GP3 and GP5, had K i values in the 23 to 25 μM range. These results represent significant progress toward the goal of designing potent inhibitors of purine salvage inGiardia parasites. As a second step in this process, altering the phthalimide moiety to optimize interactions in the guanine-binding pocket of GPRT is expected to lead to compounds with promising activity against G. lamblia PRT.


Sign in / Sign up

Export Citation Format

Share Document