A segmented principal component analysis–regression approach to quantitative structure–activity relationship modeling

2009 ◽  
Vol 646 (1-2) ◽  
pp. 30-38 ◽  
Author(s):  
Bahram Hemmateenejad ◽  
Maryam Elyasi
Author(s):  
Meysam Shirmohammadi ◽  
Zakiyeh Bayat ◽  
Esmat Mohammadinasab

: Quantitative structure activity relationship (QSAR) was used to study the partition coefficient of some quinolones and their derivatives. These molecules are broad-spectrum antibiotic pharmaceutics. First, data were divided into two categories of train and test (validation) sets using random selection method. Second, three approaches including stepwise selection (STS) (forward), genetic algorithm (GA), and simulated annealing (SA) were used to select the descriptors, with the aim of examining the effect feature selection methods. To find the relation between descriptors and partition coefficient, multiple linear regression (MLR), principal component regression (PCR) and partial least squares (PLS) were used. QSAR study showed that the both regression and descriptor selection methods have vital role in the results. Different statistical metrics showed that the MLR-SA approach with (r2=0.96, q2=0.91, pred_r2=0.95) gives the best outcome. The proposed expression by MLR-SA approach can be used in the better design of novel quinolones and their derivatives.


Quantitative structure-activity relationship (QSAR), gives useful information for drug design and medicinal chemistry. QSAR is a method used to anticipate the organic reaction of a molecule by developing equations which use descriptors calculated from its compounds. The molecular descriptors vary in complexity. A time consuming and expensive process for pharmaceutical industries is drug discovery. An inspiration driving these QSAR models is to help revive the revelation of molecular drug candidates through minimized test work and to bring a drug to market faster. To obtain sorted features principal component analysis is used. The biological activities of the test set are determined by training the neural network using training set. By predicting the activities it can be known whether the drug is close to the target or not.


Sign in / Sign up

Export Citation Format

Share Document