An Innovative Classification Model for CAD Dataset Using SVM Based Iterative Linear Discriminant Analysis

Background: Bioinformatics and statistical analysis have been employed to develop a classification model to distinguish toxic and non-toxic molecules. Aims: The primary objective of this study is to enumerate the cut-off values of various physico-chemical (ligand-centric) and target interaction (receptor-centric) descriptors which forms the basis for classifying cardiotoxic and non-toxic molecules. We also sought correlation of molecular docking, absorption, distribution, metabolism, excretion, and toxicology (ADMET) parameters, Lipinski rules, physico-chemical parameters, etc. of human cardiotoxicity drugs. Methods: A training and test set of 91 compounds were applied to linear discriminant analysis (LDA) using 2D and 3D descriptors as discriminating variables representing various molecular modeling parameters to identify which function of descriptor type is responsible for cardiotoxicity. Internal validation was performed using the leave-one-out cross-validation methodology ensuing in good results, assuring the stability of the discriminant function (DF). Results: The values of the statistical parameters Fisher Discriminant Analysis (FDA) and Wilk’s λ for the DF showed reliable statistical significance, as long as the success rate in the prediction for both the training and the test set attained more than 93% accuracy, 87.50% sensitivity and 94.74% specificity. Conclusion: The predictive model was built using a hybrid approach using organ-specific targets for docking and ADMET properties for the FDA (Food and Drug Administration) approved and withdrawn drugs. Classifiers were developed by linear discriminant analysis and the cut-off was enumerated by receiver operating characteristic curve (ROC) analysis to achieve reliable specificity and sensitivity.

Download Full-text

Classification of Diesel Engine Health Using Sparse Linear Discriminant Analysis (SLDA)

ASME 2009 Dynamic Systems and Control Conference, Volume 1 ◽

10.1115/dscc2009-2790 ◽

2009 ◽

Author(s):

Neha Chandrachud ◽

Ravindra Kakade ◽

Peter H. Meckl ◽

Galen B. King ◽

Kristofer Jennings

Keyword(s):

Steady State ◽

Diesel Engine ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Optimal Number ◽

Classification Model ◽

Misclassification Rate ◽

Linear Discriminant ◽

State Data ◽

Input Variables

With requirements for on-board diagnostics on diesel engines becoming more stringent for the coming model years, diesel engine manufacturers must improve their ability to identify fault conditions that lead to increased exhaust emissions. This paper proposes a statistical classifier model to identify the state of the engine, i.e. healthy or faulty, using an optimal number of sensors based on the data acquired from the engine. The classification model proposed in this paper is based on Sparse Linear Discriminant Analysis. This technique performs Linear Discriminant Analysis with a sparseness criterion imposed such that classification, dimension reduction and feature selection are merged into one step. It was concluded that the analysis technique could produce 0% misclassification rate for the steady-state data acquired from the diesel engine using five input variables. The classifier model was also extended to transient operation of the engine. The misclassification rate in the case of transient data was reduced from 31% to 26% by using the steady-state data trained classifier using thirteen variables.

Download Full-text

Summary for PTML Chemoinformatics Linear Discriminant Analysis classification model for enantioselective reactions

10.3390/mol2net-07-11203 ◽

2021 ◽

Author(s):

Shan He

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Classification Model ◽

Linear Discriminant ◽

Enantioselective Reactions

Download Full-text

ECG Signal Classification using Support Vector Machine and Linear Discriminant Analysis

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i5.17201725 ◽

2019 ◽

Vol 7 (5) ◽

pp. 1720-1725

Author(s):

S. Grover ◽

Shailja .

Keyword(s):

Support Vector Machine ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Signal Classification ◽

Support Vector ◽

Ecg Signal ◽

Linear Discriminant

Download Full-text

Feature selection based on linear discriminant analysis

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02781 ◽

2009 ◽

Vol 29 (10) ◽

pp. 2781-2785

Author(s):

Zi-feng CUI ◽

Xiao-hua JI

Keyword(s):

Feature Selection ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Linear Discriminant

Download Full-text

SAR target recognition method based on weighted two-directional and two-dimensional linear discriminant analysis

Journal of Computer Applications ◽

10.3724/sp.j.1087.2013.00534 ◽

2013 ◽

Vol 33 (2) ◽

pp. 534-538

Author(s):

Zhen LIU ◽

Hui JIANG ◽

Libin WANG

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Target Recognition ◽

Two Dimensional ◽

Recognition Method ◽

Linear Discriminant ◽

Sar Target Recognition

Download Full-text

A Self-Calibrated Direct Approach to Precision Matrix Estimation and Linear Discriminant Analysis in High Dimensions

SSRN Electronic Journal ◽

10.2139/ssrn.3422590 ◽

2019 ◽

Author(s):

Chi Seng Pun ◽

Matthew Zakharia Hadimaja

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Direct Approach ◽

High Dimensions ◽

Precision Matrix ◽

Linear Discriminant ◽

Matrix Estimation

Download Full-text

Effect of Topography on Maize Grains Elemental Profile: A Chemometric Approach

Current Analytical Chemistry ◽

10.2174/1573411016666200319095312 ◽

2020 ◽

Vol 16 (8) ◽

pp. 1079-1087

Author(s):

Jorgelina Z. Heredia ◽

Carlos A. Moldes ◽

Raúl A. Gil ◽

José M. Camiña

Keyword(s):

Cluster Analysis ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Microwave Plasma ◽

Emission Spectrometry ◽

Linear Discriminant ◽

Maize Seeds ◽

Elemental Profile ◽

Mineral Profile ◽

Topographic Characteristics

Background: The elemental composition of maize grains depends on the soil, land and environment characteristics where the crop grows. These effects are important to evaluate the availability of nutrients with complex dynamics, such as the concentration of macro and micronutrients in soils, which can vary according to different topographies. There is available scarce information about the influence of topographic characteristics (upland and lowland) where culture is developed with the mineral composition of crop products, in the present case, maize seeds. On the other hand, the study of the topographic effect on crops using multivariate analysis tools has not been reported. Objective: This paper assesses the effect of topographic conditions on plants, analyzing the mineral profiles in maize seeds obtained in two land conditions: uplands and lowlands. Materials and Methods: The mineral profile was studied by microwave plasma atomic emission spectrometry. Samples were collected from lowlands and uplands of cultivable lands of the north-east of La Pampa province, Argentina. Results: Differentiation of maize seeds collected from both topographical areas was achieved by principal components analysis (PCA), cluster analysis (CA) and linear discriminant analysis (LDA). PCA model based on mineral profile allowed to differentiate seeds from upland and lowlands by the influence of Cr and Mg variables. A significant accumulation of Cr and Mg in seeds from lowlands was observed. Cluster analysis confirmed such grouping but also, linear discriminant analysis achieved a correct classification of both the crops, showing the effect of topography on elemental profile. Conclusions: Multi-elemental analysis combined with chemometric tools proved useful to assess the effect of topographic characteristics on crops.

Download Full-text

Colorectal Cancer Classification and Survival Analysis Based on an Integrated RNA and DNA molecular signature

Current Bioinformatics ◽

10.2174/1574893615999200711170445 ◽

2020 ◽

Vol 15 ◽

Author(s):

Mohanad Mohammed ◽

Henry Mwambi ◽

Bernard Omolo

Keyword(s):

Colorectal Cancer ◽

Survival Analysis ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Negative Binomial ◽

Sub Saharan Africa ◽

Support Vector ◽

Mutation Status ◽

Linear Discriminant ◽

Rnaseq Data

Background: Colorectal cancer (CRC) is the third most common cancer among women and men in the USA, and recent studies have shown an increasing incidence in less developed regions, including Sub-Saharan Africa (SSA). We developed a hybrid (DNA mutation and RNA expression) signature and assessed its predictive properties for the mutation status and survival of CRC patients. Methods: Publicly-available microarray and RNASeq data from 54 matched formalin-fixed paraffin-embedded (FFPE) samples from the Affymetrix GeneChip and RNASeq platforms, were used to obtain differentially expressed genes between mutant and wild-type samples. We applied the support-vector machines, artificial neural networks, random forests, k-nearest neighbor, naïve Bayes, negative binomial linear discriminant analysis, and the Poisson linear discriminant analysis algorithms for classification. Cox proportional hazards model was used for survival analysis. Results: Compared to the genelist from each of the individual platforms, the hybrid genelist had the highest accuracy, sensitivity, specificity, and AUC for mutation status, across all the classifiers and is prognostic for survival in patients with CRC. NBLDA method was the best performer on the RNASeq data while the SVM method was the most suitable classifier for CRC across the two data types. Nine genes were found to be predictive of survival. Conclusion: This signature could be useful in clinical practice, especially for colorectal cancer diagnosis and therapy. Future studies should determine the effectiveness of integration in cancer survival analysis and the application on unbalanced data, where the classes are of different sizes, as well as on data with multiple classes.

Download Full-text