QSAR Models to Predict Effect of Concentration on the Adsorption of Phenolic Compounds onto XAD-4 and ZH-01

It is assumed that the experimental adsorption capacity of phenolic compounds onto resin depends upon the molecular properties as well as background concentration of the aquatic system. The utility of this concept has been demonstrated by incorporating concentration as a parameter in quantitative structure-activity relationship (QSAR). DFT-B3LYP method, with the basis set 6-311G **, was employed to calculate quantum mechanical and physicochemical descriptors of phenolic compounds. The logarithm of the adsorption capacity of phenolic compounds on XAD-4 and ZH-01 investigated from the static experiment along with the descriptors mentioned above were used to establish QSAR models. The variables were reduced using stepwise multiple regression method, and the statistical results indicated that the correlation coefficient in the multiple linear regression (MLR) and cross-validation using leave-one-out(LOO) were 0.966, 0.920, 0.905 and 0.797, respectively. To validate the predictive power of resulting models, external validation was performed with Qext2 values of 0.927 and 0.849, respectively. The developed models suggest that the adsorption mechanism of phenolic compounds onto XAD-4 and ZH-01 is different. Concentration, hydrophobic parameter are dominant factors governing the adsorption capacity of phenolic compounds onto XAD-4, while concentration and energy of the highest occupied molecular orbital are dominant factors controlling that of phenolic compounds on ZH-01. The consistency between experimental and predicted values indicates that the developed models can be used for estimating adsorption capacity of phenolic compounds onto XAD-4 and ZH-01.

Download Full-text

QSAR studies for the acute toxicity of nitrobenzenes to the Tetrahymena pyriformis

Journal of the Serbian Chemical Society ◽

10.2298/jsc130910025w ◽

2014 ◽

Vol 79 (9) ◽

pp. 1111-1125 ◽

Cited By ~ 3

Author(s):

Dan-Dan Wang ◽

Lin-Lin Feng ◽

Guang-Yu He ◽

Hai-Qun Chen

Keyword(s):

Statistical Significance ◽

External Validation ◽

Tetrahymena Pyriformis ◽

Quantitative Structure Activity Relationship ◽

Partial Least Square ◽

Molecular Structures ◽

Least Square ◽

Qsar Studies ◽

Qsar Models ◽

Leave One Out

Quantitative structure-activity relationship (QSAR) models play a key role in finding the relationship between molecular structures and the toxicity of nitrobenzenes to Tetrahymena pyriformis. In this work, genetic algorithm, along with partial least square (GA-PLS) was employed to select optimal subset of descriptors that have significant contribution to the toxicity of nitrobenzenes to Tetrahymena pyriformis. A set of five descriptors, namely G2, HOMT, G(Cl?Cl), Mor03v and MAXDP, was used for the prediction of the toxicity of 45 nitrobenzene derivatives and then were used to build the model by multiple linear regression (MLR) method. It turned out that the built model, whose stability was confirmed using the leave-one-out validation and external validation test, showed high statistical significance (R2=0.963, Q2LOO=0.944). Moreover, Y-scrambling test indicated there was no chance correlation in this model.

Download Full-text

QSAR Studies of Halogenated Pyrimidine Derivatives as Inhibitors of Human Dihydroorotate Dehydrogenase Using Modified Bee Algorithm

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207321666180611092540 ◽

2018 ◽

Vol 21 (5) ◽

pp. 381-387 ◽

Cited By ~ 1

Author(s):

Hossein Atabati ◽

Kobra Zarei ◽

Hamid Reza Zare-Mehrjardi

Keyword(s):

External Validation ◽

Correlation Coefficients ◽

Quantitative Structure Activity Relationship ◽

Molecular Structures ◽

Dihydroorotate Dehydrogenase ◽

Pyrimidine Derivatives ◽

Qsar Studies ◽

Bee Algorithm ◽

Test Sets ◽

Leave One Out

Aim and Objective: Human dihydroorotate dehydrogenase (DHODH) catalyzes the fourth stage of the biosynthesis of pyrimidines in cells. Hence it is important to identify suitable inhibitors of DHODH to prevent virus replication. In this study, a quantitative structure-activity relationship was performed to predict the activity of one group of newly synthesized halogenated pyrimidine derivatives as inhibitors of DHODH. Materials and Methods: Molecular structures of halogenated pyrimidine derivatives were drawn in the HyperChem and then molecular descriptors were calculated by DRAGON software. Finally, the most effective descriptors for 32 halogenated pyrimidine derivatives were selected using bee algorithm. Results: The selected descriptors using bee algorithm were applied for modeling. The mean relative error and correlation coefficient were obtained as 2.86% and 0.9627, respectively, while these amounts for the leave one out−cross validation method were calculated as 4.18% and 0.9297, respectively. The external validation was also conducted using two training and test sets. The correlation coefficients for the training and test sets were obtained as 0.9596 and 0.9185, respectively. Conclusion: The results of modeling of present work showed that bee algorithm has good performance for variable selection in QSAR studies and its results were better than the constructed model with the selected descriptors using the genetic algorithm method.

Download Full-text

Prediction of the Toxicity of Binary Mixtures by QSAR Approach Using the Hypothetical Descriptors

International Journal of Molecular Sciences ◽

10.3390/ijms19113423 ◽

2018 ◽

Vol 19 (11) ◽

pp. 3423 ◽

Cited By ~ 12

Author(s):

Ting Wang ◽

Lili Tang ◽

Feng Luan ◽

M. Natália D. S. Cordeiro

Keyword(s):

Correlation Coefficient ◽

Binary Mixtures ◽

Quantitative Structure Activity Relationship ◽

Training Set ◽

Statistical Parameters ◽

Test Set ◽

Qsar Models ◽

Forward Stepwise ◽

Leave One Out ◽

External Test

Organic compounds are often exposed to the environment, and have an adverse effect on the environment and human health in the form of mixtures, rather than as single chemicals. In this paper, we try to establish reliable and developed classical quantitative structure–activity relationship (QSAR) models to evaluate the toxicity of 99 binary mixtures. The derived QSAR models were built by forward stepwise multiple linear regression (MLR) and nonlinear radial basis function neural networks (RBFNNs) using the hypothetical descriptors, respectively. The statistical parameters of the MLR model provided were N (number of compounds in training set) = 79, R2 (the correlation coefficient between the predicted and observed activities)= 0.869, LOOq2 (leave-one-out correlation coefficient) = 0.864, F (Fisher’s test) = 165.494, and RMS (root mean square) = 0.599 for the training set, and Next (number of compounds in external test set) = 20, R2 = 0.853, qext2 (leave-one-out correlation coefficient for test set)= 0.825, F = 30.861, and RMS = 0.691 for the external test set. The RBFNN model gave the statistical results, namely N = 79, R2 = 0.925, LOOq2 = 0.924, F = 950.686, RMS = 0.447 for the training set, and Next = 20, R2 = 0.896, qext2 = 0.890, F = 155.424, RMS = 0.547 for the external test set. Both of the MLR and RBFNN models were evaluated by some statistical parameters and methods. The results confirm that the built models are acceptable, and can be used to predict the toxicity of the binary mixtures.

Download Full-text

Chemometric Modeling of the Ecotoxicity of Industrial Chemicals to an Avian Species Anas Platyrhynchos

International Journal of Quantitative Structure-Property Relationships ◽

10.4018/ijqspr.2020040101 ◽

2020 ◽

Vol 5 (2) ◽

pp. 1-16

Author(s):

Shinjita Ghosh ◽

Supratik Kar ◽

Jerzy Leszczynski

Keyword(s):

Anas Platyrhynchos ◽

External Validation ◽

Quantitative Structure Activity Relationship ◽

Randomization Test ◽

Avian Species ◽

Industrial Chemicals ◽

Linear Discriminant ◽

Topological Distance ◽

Qsar Models ◽

Classification And Regression

Birds or avians have been imperative species in the ecology, having been evaluated in an effort to understand the toxic effects of endocrine disruption. The ecotoxicity of 56 industrial chemicals classified as endocrine disruptors were modeled employing classification and regression-based quantitative structure-activity relationship (QSAR) models to an important avian species, Anas platyrhynchos. The classification- and regression-based QSAR models were developed using linear discriminant analysis (LDA) and partial least squares (PLS) tools, respectively. All models were validated meticulously by employing internal and external validation metrics followed by randomization test, applicability domain (AD) study, and intelligent consensus prediction of all individual models. Features like topological distance of 1, 3, and 5 between atoms O-P, C-P, and N-S, correspondingly, along with the CR3X fragment, can be responsible for an increase in toxicity. On the contrary, the presence of S-Cl with topological distance 6 is accountable for lowering the toxicity of towards A. platyrhynchos. The developed chemometric models can offer significant evidence and guidance in the framework of virtual screening as well as a toxicity prediction of new and/or untested chemical libraries towards this specific avian species.

Download Full-text

A QSAR Study Based on SVM for the Compound of Hydroxyl Benzoic Esters

Bioinorganic Chemistry and Applications ◽

10.1155/2017/4914272 ◽

2017 ◽

Vol 2017 ◽

pp. 1-10 ◽

Cited By ~ 2

Author(s):

Li Wen ◽

Qing Li ◽

Wei Li ◽

Qiao Cai ◽

Yong-Ming Cai

Keyword(s):

Standard Deviation ◽

Correlation Coefficient ◽

Nonlinear Models ◽

Predictive Ability ◽

Quantitative Structure Activity Relationship ◽

Support Vector ◽

Qsar Study ◽

Stability Robustness ◽

Qsar Models ◽

Leave One Out

Hydroxyl benzoic esters are preservative, being widely used in food, medicine, and cosmetics. To explore the relationship between the molecular structure and antibacterial activity of these compounds and predict the compounds with similar structures, Quantitative Structure-Activity Relationship (QSAR) models of 25 kinds of hydroxyl benzoic esters with the quantum chemical parameters and molecular connectivity indexes are built based on support vector machine (SVM) by using R language. The External Standard Deviation Error of Prediction (SDEPext), fitting correlation coefficient (R2), and leave-one-out cross-validation (Q2LOO) are used to value the reliability, stability, and predictive ability of models. The results show that R2 and Q2LOO of 4 kinds of nonlinear models are more than 0.6 and SDEPext is 0.213, 0.222, 0.189, and 0.218, respectively. Compared with the multiple linear regression (MLR) model (R2=0.421, RSD = 0.260), the correlation coefficient and the standard deviation are both better than MLR. The reliability, stability, robustness, and external predictive ability of models are good, particularly of the model of linear kernel function and eps-regression type. This model can predict the antimicrobial activity of the compounds with similar structure in the applicability domain.

Download Full-text

A QSAR STUDY OF SUBSTITUTED PYRAZOLINE DERIVATIVES AS POTENTIAL ANTI-TUBERCULOSIS AGENTS

INDIAN DRUGS ◽

10.53879/id.54.04.10781 ◽

2017 ◽

Vol 54 (04) ◽

pp. 22-31

Author(s):

M. C Sharma ◽

Keyword(s):

Statistical Significance ◽

External Validation ◽

Model Development ◽

Qsar Model ◽

Quantitative Structure Activity Relationship ◽

Partial Least Square ◽

Least Square ◽

Qsar Study ◽

Partial Least Square Analysis ◽

Qsar Models

A quantitative structure–activity relationship (QSAR) of a series of substituted pyrazoline derivatives, in regard to their anti-tuberculosis activity, has been studied using the partial least square (PLS) analysis method. QSAR model development of 64 pyrazoline derivatives was carried out to predict anti-tubercular activity. Partial least square analysis was applied to derive QSAR models, which were further evaluated for statistical significance and predictive power by internal and external validation. The best QSAR model with good external and internal predictivity for the training and test set has shown cross validation (q2) and external validation (pred_r2) values of 0.7426 and 0.7903, respectively. Two-dimensional QSAR analyses of such pyrazoline derivatives provide important structural insights for designing potent antituberculosis drugs.

Download Full-text

In-silico Design of Aryl and Aralkyl Amine-Based Triazolopyrimidine Derivatives with Enhanced Activity Against Resistant Plasmodium falciparum

Chemistry Africa ◽

10.1007/s42250-020-00199-4 ◽

2020 ◽

Author(s):

Zakari Ya’u Ibrahim ◽

Adamu Uzairu ◽

Gideon Shallangwa ◽

Stephen Abechi

Keyword(s):

In Silico ◽

Density Functional ◽

Antimalarial Activity ◽

Qsar Model ◽

Quantitative Structure Activity Relationship ◽

Randomization Test ◽

Basis Set ◽

Coefficient Of Determination ◽

Statistical Parameters ◽

Leave One Out

Abstract A blend of genetic algorithm with multiple linear regression (GA-MLR) method was utilized in generating a quantitative structure–activity relationship (QSAR) model on the antimalarial activity of aryl and aralkyl amine-based triazolopyrimidine derivatives. The structures of derivatives were optimized using density functional theory (DFT) DFT/B3LYP/6–31 + G* basis set to generate their molecular descriptors, where two (2) predictive models were developed with the aid of these descriptors. The model with an excellent statistical parameters; high coefficient of determination (R2) = 0.8884, cross-validated R2 (Q2cv) = 0.8317 and highest external validated R2 (R2pred) = 0.7019 was selected as the best model. The model generated was validated through internal (leave-one-out (LOO) cross-validation), external test set, and Y-randomization test. These parameters are indicators of robustness, excellent prediction, and validity of the selected model. The most relevant descriptor to the antimalarial activity in the model was found to be GATS6p (Geary autocorrelation—lag 6/weighted by polarizabilities), in the model due to its highest mean effect. The descriptor (GATS6p) was significant in the in-silico design of sixteen (16) derivatives of aryl and aralkyl amine-based triazolopyrimidine adopting compound DSM191 with the highest activity (pEC50 = 7.1805) as the design template. The design compound D8 was found to be the most active compound due to its superior hypothetical activity (pEC50 = 8.9545).

Download Full-text

QSAR Models for Human Carcinogenicity: An Assessment Based on Oral and Inhalation Slope Factors

Molecules ◽

10.3390/molecules26010127 ◽

2020 ◽

Vol 26 (1) ◽

pp. 127

Author(s):

Cosimo Toma ◽

Alberto Manganaro ◽

Giuseppa Raitano ◽

Marco Marzo ◽

Domenico Gadaleta ◽

...

Keyword(s):

Risk Assessment ◽

Regression Models ◽

External Validation ◽

Inhalation Exposure ◽

Quantitative Structure Activity Relationship ◽

Learning Approaches ◽

Chemical Safety ◽

Qsar Models ◽

Using Data

Carcinogenicity is a crucial endpoint for the safety assessment of chemicals and products. During the last few decades, the development of quantitative structure–activity relationship ((Q)SAR) models has gained importance for regulatory use, in combination with in vitro testing or expert-based reasoning. Several classification models can now predict both human and rat carcinogenicity, but there are few models to quantitatively assess carcinogenicity in humans. To our knowledge, slope factor (SF), a parameter describing carcinogenicity potential used especially for human risk assessment of contaminated sites, has never been modeled for both inhalation and oral exposures. In this study, we developed classification and regression models for inhalation and oral SFs using data from the Risk Assessment Information System (RAIS) and different machine learning approaches. The models performed well in classification, with accuracies for the external set of 0.76 and 0.74 for oral and inhalation exposure, respectively, and r2 values of 0.57 and 0.65 in the regression models for oral and inhalation SFs in external validation. These models might therefore support regulators in (de)prioritizing substances for regulatory action and in weighing evidence in the context of chemical safety assessments. Moreover, these models are implemented on the VEGA platform and are now freely downloadable online.

Download Full-text

Multilayer Perceptron Model for Predicting Acute Toxicity of Fungicides on Rats

International Journal of Quantitative Structure-Property Relationships ◽

10.4018/ijqspr.2018010106 ◽

2018 ◽

Vol 3 (1) ◽

pp. 100-118 ◽

Cited By ~ 1

Author(s):

Mabrouk Hamadache ◽

Abdeltif Amrane ◽

Salah Hanini ◽

Othmane Benkortbi

Keyword(s):

Molecular Descriptors ◽

External Validation ◽

Qsar Model ◽

Quantitative Structure Activity Relationship ◽

Acute Oral Toxicity ◽

Oral Toxicity ◽

Structure Activity ◽

Marquardt Algorithm ◽

Qsar Models ◽

Validation Set

Quantitative Structure Activity Relationship (QSAR) models are expected to play an important role in the risk assessment of chemicals on humans and the environment. In this study, a QSAR model based on 10 molecular descriptors to predict acute oral toxicity of 91 fungicides to rats was developed and validated. Good results (PRESS/SSY = 0.085 and VIF < 5) were obtained, showing the validation of descriptors in the obtained model. The best results were obtained with a 10/11/1 Artificial Neural Network model trained with the Levenberg-Marquardt algorithm. The prediction accuracy for the external validation set was estimated by the Q2ext which was equal to 0.960. Accordingly, the model developed in this study provided excellent predictions and can be used to predict the acute oral toxicity of fungicides, particularly for those that have not been tested as well as new fungicides.

Download Full-text

Predicting AAK1/GAK Dual-Target Inhibitor against SARS-CoV-2 Viral Entry into Host Cells: An in silico Approach

Journal of Smart Science and Technology ◽

10.24191/jsst.v1i1.14 ◽

2021 ◽

Vol 1 (1) ◽

pp. 48-67

Author(s):

Xavier Chee Wezen ◽

Clement Sim Jun Wen ◽

Lilian Siaw Yung Ping ◽

Yeong Kah Ho ◽

Kong Hao Qing ◽

...

Keyword(s):

Viral Entry ◽

Performance Metrics ◽

Mean Squared Error ◽

Quantitative Structure Activity Relationship ◽

Randomization Test ◽

Host Cells ◽

Qsar Models ◽

Leave One Out ◽

External Test ◽

Dual Target

Clathrin-mediated endocytosis (CME) is a normal biological process where cellular contents are transported into the cells. However, this process is often hijacked by different viruses to enter host cells and cause infections. Recently, two proteins that regulate CME – AAK1 and GAK – have been proposed as potential therapeutic targets for designing broad-spectrum antiviral drugs. In this work, we curated two compound datasets containing 83 AAK1 inhibitors and 196 GAK inhibitors each. Subsequently, machine learning methods, namely Random Forest, Elastic Net and Sequential Minimal Optimization, were used to construct Quantitative Structure Activity Relationship (QSAR) models to predict small molecule inhibitors of AAK1 and GAK. To ensure predictivity, these models were evaluated by using Leave-One-Out (LOO) cross validation and with an external test set. In all cases, our QSAR models achieved a q2LOO in range of 0.64 to 0.84 (Root Mean Squared Error; RMSE = 0.41 to 0.52) and a q2ext in range of 0.57 to 0.92 (RMSE = 0.36 to 0.61). Besides, our QSAR models were evaluated by using additional QSAR performance metrics and y-randomization test. Finally, by using a concensus scoring approach, nine chemical compounds from the Drugbank compound library were predicted as AAK1/GAK dual-target inhibitors. The electrostatic potential maps for the nine compounds were generated and compared against two known dual-target inhibitors, sunitinib and baricitinib. Our work provides the rationale to validate these nine compounds experimentally against the protein targets AAK1 and GAK.

Download Full-text