scholarly journals VISAR: an interactive tool for dissecting chemical features learned by deep neural network QSAR models

2020 ◽  
Vol 36 (11) ◽  
pp. 3610-3612
Author(s):  
Qingyang Ding ◽  
Siyu Hou ◽  
Songpeng Zu ◽  
Yonghui Zhang ◽  
Shao Li

Abstract Summary Although many quantitative structure–activity relationship (QSAR) models are trained and evaluated for their predictive merits, understanding what models have been learning is of critical importance. However, the interpretation and visualization of QSAR model results remain challenging, especially for ‘black box’ models such as deep neural network (DNN). Here, we take a step forward to interpret the learned chemical features from DNN QSAR models, and present VISAR, an interactive tool for visualizing the structure–activity relationship. VISAR first provides functions to construct and train DNN models. Then VISAR builds the activity landscapes based on a series of compounds using the trained model, showing the correlation between the chemical feature space and the experimental activity space after model training, and allowing for knowledge mining from a global perspective. VISAR also maps the gradients of the chemical features to the corresponding compounds as contribution weights for each atom, and visualizes the positive and negative contributor substructures suggested by the models from a local perspective. Using the web application of VISAR, users could interactively explore the activity landscape and the color-coded atom contributions. We propose that VISAR could serve as a helpful tool for training and interactive analysis of the DNN QSAR model, providing insights for drug design, and an additional level of model validation. Availability and implementation The source code and usage instructions for VISAR are available on github https://github.com/qid12/visar. Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.

2021 ◽  
Vol 2021 ◽  
pp. 1-8
Author(s):  
Qingwei Bu ◽  
Qingshan Li ◽  
Yun Liu ◽  
Chun Cai

Assessing the ecotoxicity of pharmaceuticals is of urgent need due to the recognition of their possible adverse effects on nontarget organisms in the aquatic environment. The reality of ecotoxicity data scarcity promotes the development and application of quantitative structure activity relationship (QSAR) models. In the present study, we aimed to clarify whether a QSAR model of ecotoxicity specifically for pharmaceuticals is needed considering that pharmaceuticals are a class of chemicals with complex structures, multiple functional groups, and reactive properties. To this end, we conducted a performance comparison of two previously developed and validated QSAR models specifically for pharmaceuticals with the commonly used narcosis toxicity prediction model, i.e., Ecological Structure Activity Relationship (ECOSAR), using a subset of pharmaceuticals produced in China that had not been included in the training datasets of QSAR models under consideration. A variety of statistical measures demonstrated that the pharmaceutical specific model outperformed ECOSAR, indicating the necessity of developing a specific QSAR model of ecotoxicity for the active pharmaceutical contaminants. ECOSAR, which was generally used to predict the baseline or the minimum toxicity of a compound, generally underestimated the ecotoxicity of the analyzed pharmaceuticals. This could possibly be because some pharmaceuticals can react through specific modes of action. Nonetheless, it should be noted that 95% prediction intervals spread over approximately four orders of magnitude for both tested QSAR models specifically for pharmaceuticals.


2021 ◽  
Vol 43 (1) ◽  
Author(s):  
Toshio Kasamatsu ◽  
Airi Kitazawa ◽  
Sumie Tajima ◽  
Masahiro Kaneko ◽  
Kei-ichi Sugiyama ◽  
...  

Abstract Background Food flavors are relatively low molecular weight chemicals with unique odor-related functional groups that may also be associated with mutagenicity. These chemicals are often difficult to test for mutagenicity by the Ames test because of their low production and peculiar odor. Therefore, application of the quantitative structure–activity relationship (QSAR) approach is being considered. We used the StarDrop™ Auto-Modeller™ to develop a new QSAR model. Results In the first step, we developed a new robust Ames database of 406 food flavor chemicals consisting of existing Ames flavor chemical data and newly acquired Ames test data. Ames results for some existing flavor chemicals have been revised by expert reviews. We also collected 428 Ames test datasets for industrial chemicals from other databases that are structurally similar to flavor chemicals. A total of 834 chemicals’ Ames test datasets were used to develop the new QSAR models. We repeated the development and verification of prototypes by selecting appropriate modeling methods and descriptors and developed a local QSAR model. A new QSAR model “StarDrop NIHS 834_67” showed excellent performance (sensitivity: 79.5%, specificity: 96.4%, accuracy: 94.6%) for predicting Ames mutagenicity of 406 food flavors and was better than other commercial QSAR tools. Conclusions A local QSAR model, StarDrop NIHS 834_67, was customized to predict the Ames mutagenicity of food flavor chemicals and other low molecular weight chemicals. The model can be used to assess the mutagenicity of food flavors without actual testing.


RSC Advances ◽  
2015 ◽  
Vol 5 (70) ◽  
pp. 57030-57037 ◽  
Author(s):  
Arafeh Bigdeli ◽  
Mohammad Reza Hormozi-Nezhad ◽  
Hadi Parastar

A nano-quantitative structure-activity relationship (nano-QSAR) model is proposed to indicate the determining factors responsible in the exocytosis of gold nanoparticles in macrophages.


Foods ◽  
2019 ◽  
Vol 8 (12) ◽  
pp. 628 ◽  
Author(s):  
Rosa Perestrelo ◽  
Catarina Silva ◽  
Miguel X. Fernandes ◽  
José S. Câmara

Terpenoids, including monoterpenoids (C10), norisoprenoids (C13), and sesquiterpenoids (C15), constitute a large group of plant-derived naturally occurring secondary metabolites with highly diverse chemical structures. A quantitative structure–activity relationship (QSAR) model to predict terpenoid toxicity and to evaluate the influence of their chemical structures was developed in this study by assessing in real time the toxicity of 27 terpenoid standards using the Gram-negative bioluminescent Vibrio fischeri. Under the test conditions, at a concentration of 1 µM, the terpenoids showed a toxicity level lower than 5%, with the exception of geraniol, citral, (S)-citronellal, geranic acid, (±)-α-terpinyl acetate, and geranyl acetone. Moreover, the standards tested displayed a toxicity level higher than 30% at concentrations of 50–100 µM, with the exception of (+)-valencene, eucalyptol, (+)-borneol, guaiazulene, β-caryophellene, and linalool oxide. Regarding the functional group, terpenoid toxicity was observed in the following order: alcohol > aldehyde ~ ketone > ester > hydrocarbons. The CODESSA software was employed to develop QSAR models based on the correlation of terpenoid toxicity and a pool of descriptors related to each chemical structure. The QSAR models, based on t-test values, showed that terpenoid toxicity was mainly attributed to geometric (e.g., asphericity) and electronic (e.g., maximum partial charge for a carbon (C) atom (Zefirov’s partial charge (PC)) descriptors. Statistically, the most significant overall correlation was the four-parameter equation with a training coefficient and test coefficient correlation higher than 0.810 and 0.535, respectively, and a square coefficient of cross-validation (Q2) higher than 0.689. According to the obtained data, the QSAR models are suitable and rapid tools to predict terpenoid toxicity in a diversity of food products.


2015 ◽  
Vol 14 (06) ◽  
pp. 1550040 ◽  
Author(s):  
Anuradha Sharma ◽  
Poonam Piplani

Alzheimer's disease (AD) is the most common cause of dementia in old aged people and clinically used drugs for treatment are associated with side effects. Thus, there is a current demand for the discovery and development of new potential molecules. However, the recent advances in drug therapy have challenged the predominance of the disease. In this manuscript, an attempt has been made to develop the 2D and 3D quantitative structure–activity relationship (QSAR) models for a series of rutaecarpine, quinazolines and 7,8-dehydrorutaecarpine derivatives to obtain insights to Acetylcholinesterase (AChE) inhibition. Five different QSAR models have been generated and validated using a set of 52 compounds comprising of varying scaffolds with IC50 values ranging from 11,000 nM to 0.6 nM. These AChE-specific prediction models (M1–M5) adequately reflect the structure–activity relationship of the existing AChE inhibitors. Out of all developed models, QSAR model generated using ADME properties has been found to be the best with satisfactory statistical significance (regression (r2) of 0.9309 and regression adjusted coefficient of variation [Formula: see text] of 0.9194). The QSAR models highlight the importance of aromatic moiety as their presence in the structure influence the biological activity. Additional insights on the compounds show that acyclic amines attached to side chain have lower activity than cyclic amines. The QSAR models pinpointing structural basis for the AChEIs suggest new guidelines for the design of novel molecules.


2017 ◽  
Vol 2017 ◽  
pp. 1-11 ◽  
Author(s):  
Manman Zhao ◽  
Lin Wang ◽  
Linfeng Zheng ◽  
Mengying Zhang ◽  
Chun Qiu ◽  
...  

Epidermal growth factor receptor (EGFR) is an important target for cancer therapy. In this study, EGFR inhibitors were investigated to build a two-dimensional quantitative structure-activity relationship (2D-QSAR) model and a three-dimensional quantitative structure-activity relationship (3D-QSAR) model. In the 2D-QSAR model, the support vector machine (SVM) classifier combined with the feature selection method was applied to predict whether a compound was an EGFR inhibitor. As a result, the prediction accuracy of the 2D-QSAR model was 98.99% by using tenfold cross-validation test and 97.67% by using independent set test. Then, in the 3D-QSAR model, the model with q2=0.565 (cross-validated correlation coefficient) and r2=0.888 (non-cross-validated correlation coefficient) was built to predict the activity of EGFR inhibitors. The mean absolute error (MAE) of the training set and test set was 0.308 log units and 0.526 log units, respectively. In addition, molecular docking was also employed to investigate the interaction between EGFR inhibitors and EGFR.


Sign in / Sign up

Export Citation Format

Share Document