Partial maximum correlation information: A new feature selection method for microarray data classification

Classification of cancers based on gene expressions produces better accuracy when compared to that of the clinical markers. Feature selection improves the accuracy of these classification algorithms by reducing the chance of overfitting that happens due to large number of features. We develop a new feature selection method called Biological Pathway-based Feature Selection (BPFS) for microarray data. Unlike most of the existing methods, our method integrates signaling and gene regulatory pathways with gene expression data to minimize the chance of overfitting of the method and to improve the test accuracy. Thus, BPFS selects a biologically meaningful feature set that is minimally redundant. Our experiments on published breast cancer datasets demonstrate that all of the top 20 genes found by our method are associated with cancer. Furthermore, the classification accuracy of our signature is up to 18% better than that of vant Veers 70 gene signature, and it is up to 8% better accuracy than the best published feature selection method, I-RELIEF.

Download Full-text

General Learning Equilibrium Optimizer: A New Feature Selection Method for Biological Data Classification

Applied Artificial Intelligence ◽

10.1080/08839514.2020.1861407 ◽

2020 ◽

pp. 1-17

Author(s):

Jingwei Too ◽

Seyedali Mirjalili

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method ◽

Biological Data ◽

New Feature

Download Full-text

A novel feature selection method for microarray data classification based on hidden Markov model

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2019.103213 ◽

2019 ◽

Vol 95 ◽

pp. 103213 ◽

Cited By ~ 2

Author(s):

Mohammadreza Momenzadeh ◽

Mohammadreza Sehhati ◽

Hossein Rabbani

Keyword(s):

Feature Selection ◽

Markov Model ◽

Hidden Markov Model ◽

Microarray Data ◽

Hidden Markov ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method

Download Full-text

A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification

PLoS ONE ◽

10.1371/journal.pone.0102541 ◽

2014 ◽

Vol 9 (7) ◽

pp. e102541 ◽

Cited By ~ 32

Author(s):

Shiquan Sun ◽

Qinke Peng ◽

Adnan Shakoor

Keyword(s):

Feature Selection ◽

Microarray Data ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method

Download Full-text

Leukemia and colon tumor detection based on microarray data classification using momentum backpropagation and genetic algorithm as a feature selection method

Journal of Physics Conference Series ◽

10.1088/1742-6596/971/1/012018 ◽

2018 ◽

Vol 971 ◽

pp. 012018 ◽

Cited By ~ 1

Author(s):

Untari N Wisesty ◽

Riris S Warastri ◽

Shinta Y Puspitasari

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Microarray Data ◽

Feature Selection Method ◽

Data Classification ◽

Tumor Detection ◽

Selection Method ◽

Colon Tumor

Download Full-text

A lazy feature selection method for multi-label classification

Intelligent Data Analysis ◽

10.3233/ida-194878 ◽

2021 ◽

Vol 25 (1) ◽

pp. 21-34

Author(s):

Rafael B. Pereira ◽

Alexandre Plastino ◽

Bianca Zadrozny ◽

Luiz H.C. Merschmann

Keyword(s):

Feature Selection ◽

Text Categorization ◽

Feature Selection Method ◽

Selection Method ◽

Video Classification ◽

Classification Problems ◽

Class Label ◽

New Feature ◽

Feature Selection Techniques ◽

Biomolecular Analysis

In many important application domains, such as text categorization, biomolecular analysis, scene or video classification and medical diagnosis, instances are naturally associated with more than one class label, giving rise to multi-label classification problems. This has led, in recent years, to a substantial amount of research in multi-label classification. More specifically, feature selection methods have been developed to allow the identification of relevant and informative features for multi-label classification. This work presents a new feature selection method based on the lazy feature selection paradigm and specific for the multi-label context. Experimental results show that the proposed technique is competitive when compared to multi-label feature selection techniques currently used in the literature, and is clearly more scalable, in a scenario where there is an increasing amount of data.

Download Full-text

A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-190134 ◽

2021 ◽

Vol 24 (4) ◽

pp. 289-301

Author(s):

B. Venkatesh ◽

J. Anuradha

Keyword(s):

Feature Selection ◽

Microarray Data ◽

Classification Accuracy ◽

Performance Metrics ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Svm Classifier ◽

Binary Particle Swarm Optimization ◽

Selection Methods

In Microarray Data, it is complicated to achieve more classification accuracy due to the presence of high dimensions, irrelevant and noisy data. And also It had more gene expression data and fewer samples. To increase the classification accuracy and the processing speed of the model, an optimal number of features need to extract, this can be achieved by applying the feature selection method. In this paper, we propose a hybrid ensemble feature selection method. The proposed method has two phases, filter and wrapper phase in filter phase ensemble technique is used for aggregating the feature ranks of the Relief, minimum redundancy Maximum Relevance (mRMR), and Feature Correlation (FC) filter feature selection methods. This paper uses the Fuzzy Gaussian membership function ordering for aggregating the ranks. In wrapper phase, Improved Binary Particle Swarm Optimization (IBPSO) is used for selecting the optimal features, and the RBF Kernel-based Support Vector Machine (SVM) classifier is used as an evaluator. The performance of the proposed model are compared with state of art feature selection methods using five benchmark datasets. For evaluation various performance metrics such as Accuracy, Recall, Precision, and F1-Score are used. Furthermore, the experimental results show that the performance of the proposed method outperforms the other feature selection methods.

Download Full-text