QSAR Models to Predict Effect of Concentration on the Adsorption of Phenolic Compounds onto XAD-4 and ZH-01

2011 ◽  
Vol 356-360 ◽  
pp. 340-344
Author(s):  
Yun Lan Gu ◽  
Zhen Xing Li ◽  
Zheng Hao Fei ◽  
Gen Cheng Zhang

It is assumed that the experimental adsorption capacity of phenolic compounds onto resin depends upon the molecular properties as well as background concentration of the aquatic system. The utility of this concept has been demonstrated by incorporating concentration as a parameter in quantitative structure-activity relationship (QSAR). DFT-B3LYP method, with the basis set 6-311G **, was employed to calculate quantum mechanical and physicochemical descriptors of phenolic compounds. The logarithm of the adsorption capacity of phenolic compounds on XAD-4 and ZH-01 investigated from the static experiment along with the descriptors mentioned above were used to establish QSAR models. The variables were reduced using stepwise multiple regression method, and the statistical results indicated that the correlation coefficient in the multiple linear regression (MLR) and cross-validation using leave-one-out(LOO) were 0.966, 0.920, 0.905 and 0.797, respectively. To validate the predictive power of resulting models, external validation was performed with Qext2 values of 0.927 and 0.849, respectively. The developed models suggest that the adsorption mechanism of phenolic compounds onto XAD-4 and ZH-01 is different. Concentration, hydrophobic parameter are dominant factors governing the adsorption capacity of phenolic compounds onto XAD-4, while concentration and energy of the highest occupied molecular orbital are dominant factors controlling that of phenolic compounds on ZH-01. The consistency between experimental and predicted values indicates that the developed models can be used for estimating adsorption capacity of phenolic compounds onto XAD-4 and ZH-01.

2014 ◽  
Vol 79 (9) ◽  
pp. 1111-1125 ◽  
Author(s):  
Dan-Dan Wang ◽  
Lin-Lin Feng ◽  
Guang-Yu He ◽  
Hai-Qun Chen

Quantitative structure-activity relationship (QSAR) models play a key role in finding the relationship between molecular structures and the toxicity of nitrobenzenes to Tetrahymena pyriformis. In this work, genetic algorithm, along with partial least square (GA-PLS) was employed to select optimal subset of descriptors that have significant contribution to the toxicity of nitrobenzenes to Tetrahymena pyriformis. A set of five descriptors, namely G2, HOMT, G(Cl?Cl), Mor03v and MAXDP, was used for the prediction of the toxicity of 45 nitrobenzene derivatives and then were used to build the model by multiple linear regression (MLR) method. It turned out that the built model, whose stability was confirmed using the leave-one-out validation and external validation test, showed high statistical significance (R2=0.963, Q2LOO=0.944). Moreover, Y-scrambling test indicated there was no chance correlation in this model.


2018 ◽  
Vol 21 (5) ◽  
pp. 381-387 ◽  
Author(s):  
Hossein Atabati ◽  
Kobra Zarei ◽  
Hamid Reza Zare-Mehrjardi

Aim and Objective: Human dihydroorotate dehydrogenase (DHODH) catalyzes the fourth stage of the biosynthesis of pyrimidines in cells. Hence it is important to identify suitable inhibitors of DHODH to prevent virus replication. In this study, a quantitative structure-activity relationship was performed to predict the activity of one group of newly synthesized halogenated pyrimidine derivatives as inhibitors of DHODH. Materials and Methods: Molecular structures of halogenated pyrimidine derivatives were drawn in the HyperChem and then molecular descriptors were calculated by DRAGON software. Finally, the most effective descriptors for 32 halogenated pyrimidine derivatives were selected using bee algorithm. Results: The selected descriptors using bee algorithm were applied for modeling. The mean relative error and correlation coefficient were obtained as 2.86% and 0.9627, respectively, while these amounts for the leave one out−cross validation method were calculated as 4.18% and 0.9297, respectively. The external validation was also conducted using two training and test sets. The correlation coefficients for the training and test sets were obtained as 0.9596 and 0.9185, respectively. Conclusion: The results of modeling of present work showed that bee algorithm has good performance for variable selection in QSAR studies and its results were better than the constructed model with the selected descriptors using the genetic algorithm method.


2018 ◽  
Vol 19 (11) ◽  
pp. 3423 ◽  
Author(s):  
Ting Wang ◽  
Lili Tang ◽  
Feng Luan ◽  
M. Natália D. S. Cordeiro

Organic compounds are often exposed to the environment, and have an adverse effect on the environment and human health in the form of mixtures, rather than as single chemicals. In this paper, we try to establish reliable and developed classical quantitative structure–activity relationship (QSAR) models to evaluate the toxicity of 99 binary mixtures. The derived QSAR models were built by forward stepwise multiple linear regression (MLR) and nonlinear radial basis function neural networks (RBFNNs) using the hypothetical descriptors, respectively. The statistical parameters of the MLR model provided were N (number of compounds in training set) = 79, R2 (the correlation coefficient between the predicted and observed activities)= 0.869, LOOq2 (leave-one-out correlation coefficient) = 0.864, F (Fisher’s test) = 165.494, and RMS (root mean square) = 0.599 for the training set, and Next (number of compounds in external test set) = 20, R2 = 0.853, qext2 (leave-one-out correlation coefficient for test set)= 0.825, F = 30.861, and RMS = 0.691 for the external test set. The RBFNN model gave the statistical results, namely N = 79, R2 = 0.925, LOOq2 = 0.924, F = 950.686, RMS = 0.447 for the training set, and Next = 20, R2 = 0.896, qext2 = 0.890, F = 155.424, RMS = 0.547 for the external test set. Both of the MLR and RBFNN models were evaluated by some statistical parameters and methods. The results confirm that the built models are acceptable, and can be used to predict the toxicity of the binary mixtures.


Author(s):  
Shinjita Ghosh ◽  
Supratik Kar ◽  
Jerzy Leszczynski

Birds or avians have been imperative species in the ecology, having been evaluated in an effort to understand the toxic effects of endocrine disruption. The ecotoxicity of 56 industrial chemicals classified as endocrine disruptors were modeled employing classification and regression-based quantitative structure-activity relationship (QSAR) models to an important avian species, Anas platyrhynchos. The classification- and regression-based QSAR models were developed using linear discriminant analysis (LDA) and partial least squares (PLS) tools, respectively. All models were validated meticulously by employing internal and external validation metrics followed by randomization test, applicability domain (AD) study, and intelligent consensus prediction of all individual models. Features like topological distance of 1, 3, and 5 between atoms O-P, C-P, and N-S, correspondingly, along with the CR3X fragment, can be responsible for an increase in toxicity. On the contrary, the presence of S-Cl with topological distance 6 is accountable for lowering the toxicity of towards A. platyrhynchos. The developed chemometric models can offer significant evidence and guidance in the framework of virtual screening as well as a toxicity prediction of new and/or untested chemical libraries towards this specific avian species.


2017 ◽  
Vol 2017 ◽  
pp. 1-10 ◽  
Author(s):  
Li Wen ◽  
Qing Li ◽  
Wei Li ◽  
Qiao Cai ◽  
Yong-Ming Cai

Hydroxyl benzoic esters are preservative, being widely used in food, medicine, and cosmetics. To explore the relationship between the molecular structure and antibacterial activity of these compounds and predict the compounds with similar structures, Quantitative Structure-Activity Relationship (QSAR) models of 25 kinds of hydroxyl benzoic esters with the quantum chemical parameters and molecular connectivity indexes are built based on support vector machine (SVM) by using R language. The External Standard Deviation Error of Prediction (SDEPext), fitting correlation coefficient (R2), and leave-one-out cross-validation (Q2LOO) are used to value the reliability, stability, and predictive ability of models. The results show that R2 and Q2LOO of 4 kinds of nonlinear models are more than 0.6 and SDEPext is 0.213, 0.222, 0.189, and 0.218, respectively. Compared with the multiple linear regression (MLR) model (R2=0.421, RSD = 0.260), the correlation coefficient and the standard deviation are both better than MLR. The reliability, stability, robustness, and external predictive ability of models are good, particularly of the model of linear kernel function and eps-regression type. This model can predict the antimicrobial activity of the compounds with similar structure in the applicability domain.


INDIAN DRUGS ◽  
2017 ◽  
Vol 54 (04) ◽  
pp. 22-31
Author(s):  
M. C Sharma ◽  

A quantitative structure–activity relationship (QSAR) of a series of substituted pyrazoline derivatives, in regard to their anti-tuberculosis activity, has been studied using the partial least square (PLS) analysis method. QSAR model development of 64 pyrazoline derivatives was carried out to predict anti-tubercular activity. Partial least square analysis was applied to derive QSAR models, which were further evaluated for statistical significance and predictive power by internal and external validation. The best QSAR model with good external and internal predictivity for the training and test set has shown cross validation (q2) and external validation (pred_r2) values of 0.7426 and 0.7903, respectively. Two-dimensional QSAR analyses of such pyrazoline derivatives provide important structural insights for designing potent antituberculosis drugs.


2020 ◽  
Author(s):  
Zakari Ya’u Ibrahim ◽  
Adamu Uzairu ◽  
Gideon Shallangwa ◽  
Stephen Abechi

Abstract A blend of genetic algorithm with multiple linear regression (GA-MLR) method was utilized in generating a quantitative structure–activity relationship (QSAR) model on the antimalarial activity of aryl and aralkyl amine-based triazolopyrimidine derivatives. The structures of derivatives were optimized using density functional theory (DFT) DFT/B3LYP/6–31 + G* basis set to generate their molecular descriptors, where two (2) predictive models were developed with the aid of these descriptors. The model with an excellent statistical parameters; high coefficient of determination (R2) = 0.8884, cross-validated R2 (Q2cv) = 0.8317 and highest external validated R2 (R2pred) = 0.7019 was selected as the best model. The model generated was validated through internal (leave-one-out (LOO) cross-validation), external test set, and Y-randomization test. These parameters are indicators of robustness, excellent prediction, and validity of the selected model. The most relevant descriptor to the antimalarial activity in the model was found to be GATS6p (Geary autocorrelation—lag 6/weighted by polarizabilities), in the model due to its highest mean effect. The descriptor (GATS6p) was significant in the in-silico design of sixteen (16) derivatives of aryl and aralkyl amine-based triazolopyrimidine adopting compound DSM191 with the highest activity (pEC50 = 7.1805) as the design template. The design compound D8 was found to be the most active compound due to its superior hypothetical activity (pEC50 = 8.9545).


Molecules ◽  
2020 ◽  
Vol 26 (1) ◽  
pp. 127
Author(s):  
Cosimo Toma ◽  
Alberto Manganaro ◽  
Giuseppa Raitano ◽  
Marco Marzo ◽  
Domenico Gadaleta ◽  
...  

Carcinogenicity is a crucial endpoint for the safety assessment of chemicals and products. During the last few decades, the development of quantitative structure–activity relationship ((Q)SAR) models has gained importance for regulatory use, in combination with in vitro testing or expert-based reasoning. Several classification models can now predict both human and rat carcinogenicity, but there are few models to quantitatively assess carcinogenicity in humans. To our knowledge, slope factor (SF), a parameter describing carcinogenicity potential used especially for human risk assessment of contaminated sites, has never been modeled for both inhalation and oral exposures. In this study, we developed classification and regression models for inhalation and oral SFs using data from the Risk Assessment Information System (RAIS) and different machine learning approaches. The models performed well in classification, with accuracies for the external set of 0.76 and 0.74 for oral and inhalation exposure, respectively, and r2 values of 0.57 and 0.65 in the regression models for oral and inhalation SFs in external validation. These models might therefore support regulators in (de)prioritizing substances for regulatory action and in weighing evidence in the context of chemical safety assessments. Moreover, these models are implemented on the VEGA platform and are now freely downloadable online.


Author(s):  
Mabrouk Hamadache ◽  
Abdeltif Amrane ◽  
Salah Hanini ◽  
Othmane Benkortbi

Quantitative Structure Activity Relationship (QSAR) models are expected to play an important role in the risk assessment of chemicals on humans and the environment. In this study, a QSAR model based on 10 molecular descriptors to predict acute oral toxicity of 91 fungicides to rats was developed and validated. Good results (PRESS/SSY = 0.085 and VIF < 5) were obtained, showing the validation of descriptors in the obtained model. The best results were obtained with a 10/11/1 Artificial Neural Network model trained with the Levenberg-Marquardt algorithm. The prediction accuracy for the external validation set was estimated by the Q2ext which was equal to 0.960. Accordingly, the model developed in this study provided excellent predictions and can be used to predict the acute oral toxicity of fungicides, particularly for those that have not been tested as well as new fungicides.


2021 ◽  
Vol 1 (1) ◽  
pp. 48-67
Author(s):  
Xavier Chee Wezen ◽  
Clement Sim Jun Wen ◽  
Lilian Siaw Yung Ping ◽  
Yeong Kah Ho ◽  
Kong Hao Qing ◽  
...  

Clathrin-mediated endocytosis (CME) is a normal biological process where cellular contents are transported into the cells. However, this process is often hijacked by different viruses to enter host cells and cause infections. Recently, two proteins that regulate CME – AAK1 and GAK – have been proposed as potential therapeutic targets for designing broad-spectrum antiviral drugs. In this work, we curated two compound datasets containing 83 AAK1 inhibitors and 196 GAK inhibitors each. Subsequently, machine learning methods, namely Random Forest, Elastic Net and Sequential Minimal Optimization, were used to construct Quantitative Structure Activity Relationship (QSAR) models to predict small molecule inhibitors of AAK1 and GAK. To ensure predictivity, these models were evaluated by using Leave-One-Out (LOO) cross validation and with an external test set. In all cases, our QSAR models achieved a q2LOO in range of 0.64 to 0.84 (Root Mean Squared Error; RMSE = 0.41 to 0.52) and a q2ext in range of 0.57 to 0.92 (RMSE = 0.36 to 0.61). Besides, our QSAR models were evaluated by using additional QSAR performance metrics and y-randomization test. Finally, by using a concensus scoring approach, nine chemical compounds from the Drugbank compound library were predicted as AAK1/GAK dual-target inhibitors. The electrostatic potential maps for the nine compounds were generated and compared against two known dual-target inhibitors, sunitinib and baricitinib. Our work provides the rationale to validate these nine compounds experimentally against the protein targets AAK1 and GAK.


Sign in / Sign up

Export Citation Format

Share Document