Featurization strategies for protein–ligand interactions and their applications in scoring function development

Author(s):  
Guoli Xiong ◽  
Chao Shen ◽  
Ziyi Yang ◽  
Dejun Jiang ◽  
Shao Liu ◽  
...  
2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Xujun Zhang ◽  
Chao Shen ◽  
Xueying Guo ◽  
Zhe Wang ◽  
Gaoqi Weng ◽  
...  

AbstractVirtual screening (VS) based on molecular docking has emerged as one of the mainstream technologies of drug discovery due to its low cost and high efficiency. However, the scoring functions (SFs) implemented in most docking programs are not always accurate enough and how to improve their prediction accuracy is still a big challenge. Here, we propose an integrated platform called ASFP, a web server for the development of customized SFs for structure-based VS. There are three main modules in ASFP: (1) the descriptor generation module that can generate up to 3437 descriptors for the modelling of protein–ligand interactions; (2) the AI-based SF construction module that can establish target-specific SFs based on the pre-generated descriptors through three machine learning (ML) techniques; (3) the online prediction module that provides some well-constructed target-specific SFs for VS and an additional generic SF for binding affinity prediction. Our methodology has been validated on several benchmark datasets. The target-specific SFs can achieve an average ROC AUC of 0.973 towards 32 targets and the generic SF can achieve the Pearson correlation coefficient of 0.81 on the PDBbind version 2016 core set. To sum up, the ASFP server is a powerful tool for structure-based VS.


2013 ◽  
Vol 19 (11) ◽  
pp. 5015-5030 ◽  
Author(s):  
Yingtao Liu ◽  
Zhijian Xu ◽  
Zhuo Yang ◽  
Kaixian Chen ◽  
Weiliang Zhu

2020 ◽  
Vol 21 (15) ◽  
pp. 5183 ◽  
Author(s):  
Eric D. Boittier ◽  
Yat Yin Tang ◽  
McKenna E. Buckley ◽  
Zachariah P. Schuurs ◽  
Derek J. Richard ◽  
...  

A promising protein target for computational drug development, the human cluster of differentiation 38 (CD38), plays a crucial role in many physiological and pathological processes, primarily through the upstream regulation of factors that control cytoplasmic Ca2+ concentrations. Recently, a small-molecule inhibitor of CD38 was shown to slow down pathways relating to aging and DNA damage. We examined the performance of seven docking programs for their ability to model protein-ligand interactions with CD38. A test set of twelve CD38 crystal structures, containing crystallized biologically relevant substrates, were used to assess pose prediction. The rankings for each program based on the median RMSD between the native and predicted were Vina, AD4 > PLANTS, Gold, Glide, Molegro > rDock. Forty-two compounds with known affinities were docked to assess the accuracy of the programs at affinity/ranking predictions. The rankings based on scoring power were: Vina, PLANTS > Glide, Gold > Molegro >> AutoDock 4 >> rDock. Out of the top four performing programs, Glide had the only scoring function that did not appear to show bias towards overpredicting the affinity of the ligand-based on its size. Factors that affect the reliability of pose prediction and scoring are discussed. General limitations and known biases of scoring functions are examined, aided in part by using molecular fingerprints and Random Forest classifiers. This machine learning approach may be used to systematically diagnose molecular features that are correlated with poor scoring accuracy.


2013 ◽  
Vol 53 (3) ◽  
pp. 592-600 ◽  
Author(s):  
Guo-Bo Li ◽  
Ling-Ling Yang ◽  
Wen-Jing Wang ◽  
Lin-Li Li ◽  
Sheng-Yong Yang

2015 ◽  
Vol 21 (6) ◽  
Author(s):  
Zhuo Yang ◽  
Yingtao Liu ◽  
Zhaoqiang Chen ◽  
Zhijian Xu ◽  
Jiye Shi ◽  
...  

Author(s):  
Zhiqiang Yan ◽  
Jin Wang

Scoring function of protein-ligand interactions is used to recognize the “native” binding pose of a ligand on the protein and to predict the binding affinity, so that the active small molecules can be discriminated from the non-active ones. Scoring function is widely used in computationally molecular docking and structure-based drug discovery. The development and improvement of scoring functions have broad implications in pharmaceutical industry and academic research. During the past three decades, much progress have been made in methodology and accuracy for scoring functions, and many successful cases have be witnessed in virtual database screening. In this chapter, the authors introduced the basic types of scoring functions and their derivations, the commonly-used evaluation methods and benchmarks, as well as the underlying challenges and current solutions. Finally, the authors discussed the promising directions to improve and develop scoring functions for future molecular docking-based drug discovery.


Sign in / Sign up

Export Citation Format

Share Document