scholarly journals PDBspheres - a method for finding 3D similarities in local regions in proteins

2022 ◽  
Author(s):  
Adam Zemla ◽  
Jonathan E. Allen ◽  
Dan Kirshner ◽  
Felice C. Lightstone

We present a structure-based method for finding and evaluating structural similarities in protein regions relevant to ligand binding. PDBspheres comprises an exhaustive library of protein structure regions (spheres) adjacent to complexed ligands derived from the Protein Data Bank (PDB), along with methods to find and evaluate structural matches between a protein of interest and spheres in the library. Currently, PDBspheres library contains more than 2 million spheres, organized to facilitate searches by sequence and/or structure similarity of protein-ligand binding sites or interfaces between interacting molecules. PDBspheres uses the LGA structure alignment algorithm as the main engine for detecting structure similarities between the protein of interest and library spheres. An all-atom structure similarity metric ensures that sidechain placement is taken into account in the PDBspheres primary assessment of confidence in structural matches. In this paper, we (1) describe the PDBspheres method, (2) demonstrate how PDBspheres can be used to detect and characterize binding sites in protein structures, (3) compare PDBspheres use for binding site prediction with seven other binding site prediction methods using a curated dataset of 2,528 ligand-bound and ligand-free crystal structures, and (4) use PDBspheres to cluster pockets and assess structural similarities among protein binding sites of the 4,876 structures in the refined set of PDBbind 2019 dataset. The PDBspheres library is made publicly available for download at https://proteinmodel.org/AS2TS/PDBspheres

2019 ◽  
Vol 47 (W1) ◽  
pp. W345-W349 ◽  
Author(s):  
Lukas Jendele ◽  
Radoslav Krivak ◽  
Petr Skoda ◽  
Marian Novotny ◽  
David Hoksza

AbstractPrankWeb is an online resource providing an interface to P2Rank, a state-of-the-art method for ligand binding site prediction. P2Rank is a template-free machine learning method based on the prediction of local chemical neighborhood ligandability centered on points placed on a solvent-accessible protein surface. Points with a high ligandability score are then clustered to form the resulting ligand binding sites. In addition, PrankWeb provides a web interface enabling users to easily carry out the prediction and visually inspect the predicted binding sites via an integrated sequence-structure view. Moreover, PrankWeb can determine sequence conservation for the input molecule and use this in both the prediction and result visualization steps. Alongside its online visualization options, PrankWeb also offers the possibility of exporting the results as a PyMOL script for offline visualization. The web frontend communicates with the server side via a REST API. In high-throughput scenarios, therefore, users can utilize the server API directly, bypassing the need for a web-based frontend or installation of the P2Rank application. PrankWeb is available at http://prankweb.cz/, while the web application source code and the P2Rank method can be accessed at https://github.com/jendelel/PrankWebApp and https://github.com/rdk/p2rank, respectively.


2019 ◽  
Author(s):  
Lukas Jendele ◽  
Radoslav Krivak ◽  
Petr Skoda ◽  
Marian Novotny ◽  
David Hoksza

ABSTRACTPrankWeb is an online resource providing an interface to P2Rank, a state-of-the-art ligand binding site prediction method. P2Rank is a template-free machine learning method which is based on the prediction of ligandability of local chemical neighborhoods centered on points placed on a solvent accessible surface of a protein. Points with high ligandability score are then clustered to form the resulting ligand binding sites. On top of that, PrankWeb then provides a web interface enabling users to easily carry out the prediction and visually inspect the predicted binding sites via an integrated sequence-structure view. Moreover, PrankWeb can determine sequence conservation for the input molecule and use it in both the prediction and results visualization steps. Alongside its online visualization options, PrankWeb also offers the possibility to export the results as a PyMOL script for offline visualization. The web frontend communicates with the serer side via a REST API. Therefore, in high-throughput scenarios users can utilize the server API directly, bypassing the need for a webbased front end or installation of the P2Rank application. PrankWeb is available at http://prankweb.cz/. The source code of the web application and the P2Rank method can be accessed at https://github.com/jendelel/PrankWebApp and https://github.com/rdk/p2rank, respectively.


2021 ◽  
Vol 19 (02) ◽  
pp. 2150006
Author(s):  
Fatemeh Nazem ◽  
Fahimeh Ghasemi ◽  
Afshin Fassihi ◽  
Alireza Mehri Dehnavi

Binding site prediction for new proteins is important in structure-based drug design. The identified binding sites may be helpful in the development of treatments for new viral outbreaks in the world when there is no information available about their pockets with COVID-19 being a case in point. Identification of the pockets using computational methods, as an alternative method, has recently attracted much interest. In this study, the binding site prediction is viewed as a semantic segmentation problem. An improved 3D version of the U-Net model based on the dice loss function is utilized to predict the binding sites accurately. The performance of the proposed model on the independent test datasets and SARS-COV-2 shows the segmentation model could predict the binding sites with a more accurate shape than the recently published deep learning model, i.e. DeepSite. Therefore, the model may help predict the binding sites of proteins and could be used in drug design for novel proteins.


2019 ◽  
Author(s):  
Sebastian Daberdaku

Protein pockets and cavities usually coincide with the active sites of biological processes, and their identification is significant since it constitutes an important step for structure-based drug design and protein-ligand docking applications. This research presents PoCavEDT, an automated purely geometric technique for the identification of binding pockets and occluded cavities in proteins based on the 3D Euclidean Distance Transform. Candidate protein pocket regions are identified between two Solvent-Excluded surfaces generated with the Euclidean Distance Transform using different probe spheres, which depend on the size of the binding ligand. The application of simple, yet effective geometrical heuristics ensures that the proposed method obtains very good ligand binding site prediction results. The method was applied to a representative set of protein-ligand complexes and their corresponding unbound protein structures to evaluate its ligand binding site prediction capabilities. Its performance was compared to the results achieved with several purely geometric pocket and cavity prediction methods, namely SURFNET, PASS, CAST, LIGSITE, LIGSITECS, PocketPicker and POCASA. Success rates PoCavEDT were comparable to those of POCASA and outperformed the other software.


2021 ◽  
Author(s):  
Rishal Aggarwal ◽  
Akash Gupta ◽  
Vineeth Chelur ◽  
C. V. Jawahar ◽  
U. Deva Priyakumar

<div> A structure-based drug design pipeline involves the development of potential drug molecules or ligands that form stable complexes with a given receptor at its binding site. A prerequisite to this is finding druggable and functionally relevant binding sites on the 3D structure of the protein. Although several methods for detecting binding sites have been developed beforehand, a majority of them surprisingly fail in the identification and ranking of binding sites accurately. The rapid adoption and success of deep learning algorithms in various sections of structural biology beckons the usage of such algorithms for accurate binding site detection. As a combination of geometry based software and deep learning, we report a novel framework, DeepPocket that utilises 3D convolutional neural networks for the rescoring of pockets identified by Fpocket and further segments these identified cavities on the protein surface. Apart from this, we also propose another dataset SC6K containing protein structures submitted in the Protein Data Bank (PDB) from January 2018 till February 2020 for ligand binding site (LBS) detection. DeepPocket's results on various binding site datasets and SC6K highlights its better performance over current state-of-the-art methods and good generalization ability over novel structures. </div><div><br></div>


Sign in / Sign up

Export Citation Format

Share Document