Feature clustering based support vector machine recursive feature elimination for gene selection

The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use oft-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM.

Download Full-text

Gene Selection Using Gaussian Kernel Support Vector Machine Based Recursive Feature Elimination with Adaptive Kernel Width Strategy

Rough Sets and Knowledge Technology - Lecture Notes in Computer Science ◽

10.1007/11795131_116 ◽

2006 ◽

pp. 799-806 ◽

Cited By ~ 3

Author(s):

Yong Mao ◽

Xiaobo Zhou ◽

Zheng Yin ◽

Daoying Pi ◽

Youxian Sun ◽

...

Keyword(s):

Support Vector Machine ◽

Gene Selection ◽

Gaussian Kernel ◽

Recursive Feature Elimination ◽

Support Vector ◽

Kernel Width ◽

Kernel Support Vector Machine ◽

Adaptive Kernel

Download Full-text

Multiclass Cancer Classification by Using Fuzzy Support Vector Machine and Binary Decision Tree With Gene Selection

Journal of Biomedicine and Biotechnology ◽

10.1155/jbb.2005.160 ◽

2005 ◽

Vol 2005 (2) ◽

pp. 160-171 ◽

Cited By ~ 44

Author(s):

Yong Mao ◽

Xiaobo Zhou ◽

Daoying Pi ◽

Youxian Sun ◽

Stephen T. C. Wong

Keyword(s):

Support Vector Machine ◽

Gene Selection ◽

Binary Classification ◽

Classification Tree ◽

Cancer Classification ◽

Recursive Feature Elimination ◽

Support Vector ◽

Fuzzy Support Vector Machine ◽

F Test ◽

Leukemia Data

We investigate the problems of multiclass cancer classification with gene selection from gene expression data. Two different constructed multiclass classifiers with gene selection are proposed, which are fuzzy support vector machine (FSVM) with gene selection and binary classification tree based on SVM with gene selection. Using F test and recursive feature elimination based on SVM as gene selection methods, binary classification tree based on SVM with F test, binary classification tree based on SVM with recursive feature elimination based on SVM, and FSVM with recursive feature elimination based on SVM are tested in our experiments. To accelerate computation, preselecting the strongest genes is also used. The proposed techniques are applied to analyze breast cancer data, small round blue-cell tumors, and acute leukemia data. Compared to existing multiclass cancer classifiers and binary classification tree based on SVM with F test or binary classification tree based on SVM with recursive feature elimination based on SVM mentioned in this paper, FSVM based on recursive feature elimination based on SVM can find most important genes that affect certain types of cancer with high recognition accuracy.

Download Full-text

SVM-BT-RFE: An improved gene selection framework using Bayesian T-test embedded in support vector machine (recursive feature elimination) algorithm

Karbala International Journal of Modern Science ◽

10.1016/j.kijoms.2015.10.002 ◽

2015 ◽

Vol 1 (2) ◽

pp. 86-96 ◽

Cited By ~ 17

Author(s):

Shruti Mishra ◽

Debahuti Mishra

Keyword(s):

Support Vector Machine ◽

Gene Selection ◽

T Test ◽

Recursive Feature Elimination ◽

Support Vector ◽

Elimination Algorithm ◽

Selection Framework

Download Full-text

Feature gene selection for Chinese hamster classification based on support vector machine

Journal of Computer Applications ◽

10.3724/sp.j.1087.2011.00584 ◽

2011 ◽

Vol 31 (2) ◽

pp. 584-586

Author(s):

Jun-li YANG ◽

Tian-fu LIU

Keyword(s):

Support Vector Machine ◽

Gene Selection ◽

Chinese Hamster ◽

Support Vector ◽

Selection For

Download Full-text

Realizing an Integrated Multistage Support Vector Machine Model for Augmented Recognition of Unipolar Depression

Electronics ◽

10.3390/electronics9040647 ◽

2020 ◽

Vol 9 (4) ◽

pp. 647

Author(s):

Kathiravan Srinivasan ◽

Nivedhitha Mahendran ◽

Durai Raj Vincent ◽

Chuan-Yu Chang ◽

Shabbir Syed-Abdul

Keyword(s):

Support Vector Machine ◽

Support Vector Machine Model ◽

Sampling Technique ◽

Unipolar Depression ◽

Clinical Depression ◽

Majority Voting ◽

Recursive Feature Elimination ◽

Support Vector ◽

Daily Routine ◽

Machine Model

Unipolar depression (UD), also referred to as clinical depression, appears to be a widespread mental disorder around the world. Further, this is a vital state related to a person’s health that influences his/her daily routine. Besides, this state also influences the person’s frame of mind, behavior, and several body functionalities like sleep, appetite, and also it can cause a scenario where a person could harm himself/herself or others. In several cases, it becomes an arduous task to detect UD, since, it is a state of comorbidity. For that reason, this research proposes a more convenient approach for the physicians to detect the state of clinical depression at an initial phase using an integrated multistage support vector machine model. Initially, the dataset is preprocessed using multiple imputation by chained equations (MICE) technique. Then, for selecting the appropriate features, the support vector machine-based recursive feature elimination (SVM RFE) is deployed. Subsequently, the integrated multistage support vector machine classifier is built by employing the bagging random sampling technique. Finally, the experimental outcomes indicate that the proposed integrated multistage support vector machine model surpasses methods such as logistic regression, multilayer perceptron, random forest, and bagging SVM (majority voting), in terms of overall performance.

Download Full-text

Support Vector Machine - Recursive Feature Elimination (SVM - RFE) for Selection of MicroRNA Expression Features of Breast Cancer

2018 2nd International Conference on Informatics and Computational Sciences (ICICoS) ◽

10.1109/icicos.2018.8621708 ◽

2018 ◽

Author(s):

Amazona Adorada ◽

Ratih Permatasari ◽

Panji Wisnu Wirawan ◽

Adi Wibowo ◽

Adi Sujiwo

Keyword(s):

Breast Cancer ◽

Support Vector Machine ◽

Microrna Expression ◽

Recursive Feature Elimination ◽

Support Vector ◽

Selection Of

Download Full-text

Gene selection in cancer classification using hybrid method based on Particle Swarm Optimization (PSO), Artificial Bee Colony (ABC) feature selection and support vector machine

10.1063/1.5132474 ◽

2019 ◽

Cited By ~ 1

Author(s):

D. A. Utami ◽

Z. Rustam

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Particle Swarm Optimization ◽

Hybrid Method ◽

Artificial Bee Colony ◽

Gene Selection ◽

Cancer Classification ◽

Support Vector ◽

Swarm Optimization ◽

Bee Colony

Download Full-text

Hybrid adapted fast correlation FCBF-support vector machine recursive feature elimination for feature selection

Intelligent Decision Technologies ◽

10.3233/idt-190014 ◽

2020 ◽

Vol 14 (3) ◽

pp. 269-279

Author(s):

Hayet Djellali ◽

Nacira Ghoualmi-Zine ◽

Souad Guessoum

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Recursive Feature Elimination ◽

Support Vector ◽

Svm Classifier ◽

Hybrid Architecture ◽

Features Selection ◽

K Nearest Neighbors ◽

Correlation Based Feature Selection ◽

Embedded Method

This paper investigates feature selection methods based on hybrid architecture using feature selection algorithm called Adapted Fast Correlation Based Feature selection and Support Vector Machine Recursive Feature Elimination (AFCBF-SVMRFE). The AFCBF-SVMRFE has three stages and composed of SVMRFE embedded method with Correlation based Features Selection. The first stage is the relevance analysis, the second one is a redundancy analysis, and the third stage is a performance evaluation and features restoration stage. Experiments show that the proposed method tested on different classifiers: Support Vector Machine SVM and K nearest neighbors KNN provide a best accuracy on various dataset. The SVM classifier outperforms KNN classifier on these data. The AFCBF-SVMRFE outperforms FCBF multivariate filter, SVMRFE, Particle swarm optimization PSO and Artificial bees colony ABC.

Download Full-text

ITERATIVE FEATURE PERTURBATION AS A GENE SELECTOR FOR MICROARRAY DATA

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001412600038 ◽

2012 ◽

Vol 26 (05) ◽

pp. 1260003 ◽

Cited By ~ 16

Author(s):

JUANA CANUL-REICH ◽

LAWRENCE O. HALL ◽

DMITRY B. GOLDGOF ◽

JOHN N. KORECKI ◽

STEVEN ESCHRICH

Keyword(s):

Gene Expression ◽

Colon Cancer ◽

Support Vector Machine ◽

Performance Improvement ◽

Microarray Data ◽

T Test ◽

Recursive Feature Elimination ◽

Support Vector ◽

Gene Sets ◽

Microarray Datasets

Gene-expression microarray datasets often consist of a limited number of samples with a large number of gene-expression measurements, usually on the order of thousands. Therefore, dimensionality reduction is critical prior to any classification task. In this work, the iterative feature perturbation method (IFP), an embedded gene selector, is introduced and applied to four microarray cancer datasets: colon cancer, leukemia, Moffitt colon cancer, and lung cancer. We compare results obtained by IFP to those of support vector machine-recursive feature elimination (SVM-RFE) and the t-test as a feature filter using a linear support vector machine as the base classifier. Analysis of the intersection of gene sets selected by the three methods across the four datasets was done. Additional experiments included an initial pre-selection of the top 200 genes based on their p values. IFP and SVM-RFE were then applied on the reduced feature sets. These results showed up to 3.32% average performance improvement for IFP across the four datasets. A statistical analysis (using the Friedman/Holm test) for both scenarios showed the highest accuracies came from the t-test as a filter on experiments without gene pre-selection. IFP and SVM-RFE had greater classification accuracy after gene pre-selection. Analysis showed the t-test is a good gene selector for microarray data. IFP and SVM-RFE showed performance improvement on a reduced by t-test dataset. The IFP approach resulted in comparable or superior average class accuracy when compared to SVM-RFE on three of the four datasets. The same or similar accuracies can be obtained with different sets of genes.

Download Full-text