Stability of feature selection in classification issues for high-dimensional correlated data

Émeline Perthame; Chloé Friguet; David Causeur

doi:10.1007/s11222-015-9569-2

Stability of feature selection in classification issues for high-dimensional correlated data

Statistics and Computing ◽

10.1007/s11222-015-9569-2 ◽

2015 ◽

Vol 26 (4) ◽

pp. 783-796 ◽

Cited By ~ 6

Author(s):

Émeline Perthame ◽

Chloé Friguet ◽

David Causeur

Keyword(s):

Feature Selection ◽

Correlated Data ◽

High Dimensional

Download Full-text

Combination of Ensembles of Regularized Regression Models with Resampling-Based Lasso Feature Selection in High Dimensional Data

Mathematics ◽

10.3390/math8010110 ◽

2020 ◽

Vol 8 (1) ◽

pp. 110 ◽

Cited By ~ 1

Author(s):

Abhijeet R Patil ◽

Sangjin Kim

Keyword(s):

Feature Selection ◽

Geometric Mean ◽

High Dimensional Data ◽

Correlated Data ◽

Rank Aggregation ◽

Adaptive Lasso ◽

Superior Performance ◽

High Dimensional ◽

Regularized Regression ◽

The Individual

In high-dimensional data, the performances of various classifiers are largely dependent on the selection of important features. Most of the individual classifiers with the existing feature selection (FS) methods do not perform well for highly correlated data. Obtaining important features using the FS method and selecting the best performing classifier is a challenging task in high throughput data. In this article, we propose a combination of resampling-based least absolute shrinkage and selection operator (LASSO) feature selection (RLFS) and ensembles of regularized regression (ERRM) capable of dealing data with the high correlation structures. The ERRM boosts the prediction accuracy with the top-ranked features obtained from RLFS. The RLFS utilizes the lasso penalty with sure independence screening (SIS) condition to select the top k ranked features. The ERRM includes five individual penalty based classifiers: LASSO, adaptive LASSO (ALASSO), elastic net (ENET), smoothly clipped absolute deviations (SCAD), and minimax concave penalty (MCP). It was built on the idea of bagging and rank aggregation. Upon performing simulation studies and applying to smokers’ cancer gene expression data, we demonstrated that the proposed combination of ERRM with RLFS achieved superior performance of accuracy and geometric mean.

Download Full-text

RHDSI: A Novel Dimensionality Reduction Based Algorithm on High Dimensional Feature Selection with Interactions

Information Sciences ◽

10.1016/j.ins.2021.06.096 ◽

2021 ◽

Author(s):

Rahi Jain ◽

Wei Xu

Keyword(s):

Feature Selection ◽

Dimensionality Reduction ◽

High Dimensional

Download Full-text

BagMeLiF: stable boosting-based hybrid-ensemble feature selection algorithm for high-dimensional data

2020 International Conference on Control, Robotics and Intelligent System ◽

10.1145/3437802.3437835 ◽

2020 ◽

Author(s):

Nikita Pilnenskiy ◽

Ivan Smetannikov

Keyword(s):

Feature Selection ◽

High Dimensional Data ◽

High Dimensional ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

Scatter search for high-dimensional feature selection using feature grouping

Proceedings of the Genetic and Evolutionary Computation Conference Companion ◽

10.1145/3449726.3459481 ◽

2021 ◽

Author(s):

Miguel García-Torres ◽

Francisco Gómez-Vela ◽

Federico Divina ◽

Diego P. Pinto-Roa ◽

José Luis Vázquez Noguera ◽

...

Keyword(s):

Feature Selection ◽

Scatter Search ◽

High Dimensional ◽

Feature Grouping

Download Full-text

Data-driven Feature Selection for Long Longitudinal Breadth and High Dimensional Dataset

Proceedings of the 2020 12th International Conference on Machine Learning and Computing ◽

10.1145/3383972.3383992 ◽

2020 ◽

Author(s):

Ji-Han Liu ◽

Cheng-Tse Wu ◽

Ta-Wei Chu ◽

and Jyh-Shing Roger Jang

Keyword(s):

Feature Selection ◽

Data Driven ◽

High Dimensional ◽

Selection For

Download Full-text

Research of Medical High-Dimensional Imbalanced Data Classification Ensemble Feature Selection Algorithm with Random Forest

2017 International Conference on Smart Grid and Electrical Automation (ICSGEA) ◽

10.1109/icsgea.2017.158 ◽

2017 ◽

Cited By ~ 2

Author(s):

Min Zhu ◽

Bo Su ◽

Gangmin Ning

Keyword(s):

Feature Selection ◽

Random Forest ◽

Imbalanced Data ◽

Data Classification ◽

High Dimensional ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Imbalanced Data Classification

Download Full-text

CLASSIFICATION OF HIGH-DIMENSIONAL MICROARRAY DATA WITH A TWO-STEP PROCEDURE VIA A WILCOXON CRITERION AND MULTILAYER PERCEPTRON

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026811002969 ◽

2011 ◽

Vol 10 (01) ◽

pp. 1-14

Author(s):

VLADIMIR NIKULIN ◽

TIAN-HSIANG HUANG ◽

GEOFFREY J. MCLACHLAN

Keyword(s):

Data Mining ◽

Feature Selection ◽

High Dimensional ◽

Second Step ◽

Support Vector ◽

Step Procedure ◽

Leave One Out ◽

Natural Combination ◽

Feature Selection Techniques

The method presented in this paper is novel as a natural combination of two mutually dependent steps. Feature selection is a key element (first step) in our classification system, which was employed during the 2010 International RSCTC data mining (bioinformatics) Challenge. The second step may be implemented using any suitable classifier such as linear regression, support vector machine or neural networks. We conducted leave-one-out (LOO) experiments with several feature selection techniques and classifiers. Based on the LOO evaluations, we decided to use feature selection with the separation type Wilcoxon-based criterion for all final submissions. The method presented in this paper was tested successfully during the RSCTC data mining Challenge, where we achieved the top score in the Basic track.

Download Full-text

Optimal Feature Selection in High-Dimensional Discriminant Analysis

IEEE Transactions on Information Theory ◽

10.1109/tit.2014.2381241 ◽

2015 ◽

Vol 61 (2) ◽

pp. 1063-1083 ◽

Cited By ~ 9

Author(s):

Mladen Kolar ◽

Han Liu

Keyword(s):

Feature Selection ◽

Discriminant Analysis ◽

High Dimensional ◽

Optimal Feature Selection ◽

Optimal Feature

Download Full-text

On fuzzy feature selection in designing fuzzy classifiers for high-dimensional data

Evolving Systems ◽

10.1007/s12530-015-9142-4 ◽

2015 ◽

Vol 7 (4) ◽

pp. 255-265 ◽

Cited By ~ 6

Author(s):

Eghbal G. Mansoori ◽

Khadijeh S. Shafiee

Keyword(s):

Feature Selection ◽

High Dimensional Data ◽

High Dimensional ◽

Fuzzy Classifiers ◽

Fuzzy Feature Selection

Download Full-text

An Evolutionary Multitasking-Based Feature Selection Method for High-Dimensional Classification

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2020.3042243 ◽

2021 ◽

pp. 1-15

Author(s):

Ke Chen ◽

Bing Xue ◽

Mengjie Zhang ◽

Fengyu Zhou

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Selection Method ◽

High Dimensional ◽

Dimensional Classification ◽

Evolutionary Multitasking

Download Full-text