Quantitative structure–retention relationship for the Kovats retention indices of a large set of terpenes: A combined data splitting-feature selection strategy

2007 ◽  
Vol 592 (1) ◽  
pp. 72-81 ◽  
Author(s):  
Bahram Hemmateenejad ◽  
Katayoun Javadnia ◽  
Maryam Elyasi
2021 ◽  
Vol 65 (4) ◽  
Author(s):  
Ivana Čabarkapa ◽  
Milica Aćimović ◽  
Lato Pezo ◽  
Vanja Tadić

Abstract. This work aimed to obtain a validated model for the prediction of retention times of compounds isolated from Origanum heracleoticum, Origanum vulgare, Thymus vulgaris, and Thymus serpyllum essential oils. In total 68 experimentally obtained retention times of compounds, which were separated and detected by GC-MS were further used to build the prediction models. The quantitative structure–retention relationship was employed to foresee the Kovats retention indices of compounds acquired by GC-MS analysis, using eight molecular descriptors selected by a genetic algorithm. The chosen descriptors were used as inputs for the four artificial neural networks, to construct a Kovats retention indices predictive quantitative structure–retention relationship model. The coefficients of determination in the training cycle were 0.830; 0.852; 0.922 and 0.815 (for compounds found in O. heracleoticum, O. vulgare, T. vulgaris and T. serpyllum essential oils, respectively), demonstrating that these models could be used for prediction of Kovats retention indices, due to low prediction error and high r2.   Resumen. El objetivo de este trabajo es la obtención de modelos validados para la predicción del tiempo de retención de los compuestos aislados de aceites esenciales de Origanum heracleoticum, Origanum vulgare, Thymus vulgaris y Thymus serpyllum. Se han obtenido un total de 68 tiempos de retención de compuestos, separándose y detectándose por cromatografía de gases con detección por espectrometría de masas (GC-MS) con posterior desarrollo de modelos de predicción.  La relación cuantitativa estructura-retención ha sido utilizada para predecir el índice de retención Kovats de los compuestos obtenidos por análisis de GC-MS, utilizando ocho descriptores moleculares seleccionados mediante algoritmo genético. Los descriptores seleccionados han sido utilizados como entrada para las cuatro redes neuronales artificiales y así elaborar los índices predictivos del modelo de relación cuantitativa estructura-retención.  Los coeficientes de determinación en el ciclo de entrenamiento fueron de 0.830; 0.852; 0.922 y 0.815 (para los compuestos identificados en los aceites esenciales del O. heracleoticum, O. vulgare, T. vulgaris y T. serpyllum respectivamente) demostrando así que estos modelos son útiles en la predicción de los índices de retención de Kovats con un error de bajo predicción y alta r2.


2011 ◽  
Vol 65 (2) ◽  
Author(s):  
Patryk Bielecki ◽  
Wiesław Wasiak

AbstractThe quantitative structure-retention relationship (QSRR) was used to predict Kováts retention indices of forty-three volatile olefins on the chemically bonded stationary phase, containing 1,4,8,11-tetraazacycloteradecane (cyclam) complexes of copper(II) chloride. Retention indices were correlated with eleven descriptors derived from structures of olefins optimised using the molecular mechanics force field calculations (MM2). Descriptors were generated with the use of quantitative structure-activity relationships (QSAR), semi-empirical Austin Model 1 methods (AM1), and obtained from physicochemical databases. Five well-correlated models were built, with predictive coefficients of determination (R 2) values of 0.993 and 0.995. The dielectric energy (DE) descriptor was identified as being as important as the polarizability (P) descriptor in the process of separation of unsaturated olefins on stationary phases containing metal complexes. The DE index proved to be decisive in distinguishing between the geometric cis and trans isomers of the tested compounds.


2011 ◽  
Vol 76 (12) ◽  
pp. 1627-1637 ◽  
Author(s):  
Aberoomand Azar ◽  
Mehdi Nekoei ◽  
Kambiz Larijani ◽  
Sakineh Bahraminasab

The chemical composition of the volatile fraction obtained by head-space solid phase microextraction (HS-SPME), single drop microextraction (SDME) and the essential oil obtained by cold-press from the peels of C. sinensis cv. valencia were analyzed employing gas chromatography-flame ionization detector (GC-FID) and gas chromatography-mass spectrometry (GC-MS). The main components were limonene (61.34 %, 68.27 %, 90.50 %), myrcene (17.55 %, 12.35 %, 2.50 %), sabinene (6.50 %, 7.62 %, 0.5 %) and ?-pinene (0 %, 6.65 %, 1.4 %) respectively obtained by HS-SPME, SDME and cold-press. Then a quantitative structure-retention relationship (QSRR) study for the prediction of retention indices (RI) of the compounds was developed by application of structural descriptors and the multiple linear regression (MLR) method. Principal components analysis was used to select the training set. A simple model with low standard errors and high correlation coefficients was obtained. The results illustrated that linear techniques such as MLR combined with a successful variable selection procedure are capable of generating an efficient QSRR model for prediction of the retention indices of different compounds. This model, with high statistical significance (R2 train = 0.983, R2 test = 0.970, Q2 LOO = 0.962, Q2 LGO = 0.936, REP(%) = 3.00), could be used adequately for the prediction and description of the retention indices of the volatile compounds.


2011 ◽  
Vol 76 (6) ◽  
pp. 891-902 ◽  
Author(s):  
Aberomand Azar ◽  
Mehdi Nekoei ◽  
Siavash Riahi ◽  
Mohamad Ganjali ◽  
Karim Zare

A simple, descriptive and interpretable model, based on a quantitative structure-retention relationship (QSRR), was developed using the genetic algorithm-multiple linear regression (GA-MLR) approach for the prediction of the retention indices (RI) of essential oil components. By molecular modeling, three significant descriptors related to the RI values of the essential oils were identified. A data set was selected consisting of the retention indices for 32 essential oil molecules with a range of more than 931 compounds. Then, a suitable set of the molecular descriptors was calculated and the important descriptors were selected with the aid of the genetic algorithm and multiple regression method. A model with a low prediction error and a good correlation coefficient was obtained. This model was used for the prediction of the RI values of some essential oil components which were not used in the modeling procedure.


2019 ◽  
Vol 84 (4) ◽  
pp. 405-416 ◽  
Author(s):  
Youssouf Driouche ◽  
Djelloul Messadi

In this paper, a quantitative structure?retention relationship (QSRR) model was developed for predicting the retention indices (log RI) of 36 constituents of essential oils. First, the chemical structure of each compound was sketched using HyperChem software. Then, molecular descriptors covering different information of molecular structures were calculated by Dragon software. The results illustrated that linear techniques, such as multiple linear regression (MLR), combined with a successful variable selection procedure are capable of generating an efficient QSRR model for predicting the retention indices of different compounds. This model, with high statistical significance (R2 = 0.9781, Q2 LOO = 0.9691, Q2 ext = 0.9546, Q2 L(5)O = 0.9667, F = 245.27), could be used adequately for the prediction and description of the retention indices of other essential oil compounds. The reliability of the proposed model was further illustrated using various evaluation techniques: leave-5-out cross-validation, bootstrap, randomization test and validation through the test set.


Sign in / Sign up

Export Citation Format

Share Document