On the Feature Selection Criterion Based on an Approximation of Multidimensional Mutual Information

Mutual Information (MI) is an information theory concept often used in the recent time as a criterion for feature selection methods. This is due to its ability to capture both linear and non-linear dependency relationships between two variables. In theory, mutual information is formulated based on probability density functions (pdfs) or entropies of the two variables. In most machine learning applications, mutual information estimation is formulated for classification problems (that is data with labeled output). This study investigates the use of mutual information estimation as a feature selection criterion for regression tasks and introduces enhancement in selecting optimal feature subset based on previous works. Specifically, while focusing on regression tasks, it builds on the previous work in which a scientifically sound stopping criteria for feature selection greedy algorithms was proposed. Four real-world regression datasets were used in this study, three of the datasets are public obtained from UCI machine learning repository and the remaining one is a private well log dataset. Two Machine learning models namely multiple regression and artificial neural networks (ANN) were used to test the performance of IFSMIR. The results obtained has proved the effectiveness of the proposed method.

Download Full-text

Feature selection based on mutual information for gear imbalanced problem faulty diagnosis

2012 International Conference on System Simulation (ICUSS 2012) ◽

10.1049/cp.2012.0506 ◽

2012 ◽

Cited By ~ 1

Author(s):

T.Y. Liu

Keyword(s):

Feature Selection ◽

Mutual Information

Download Full-text

Ensemble Feature Selection from Cancer Gene Expression Data using Mutual Information and Recursive Feature Elimination

2020 Third International Conference on Advances in Electronics, Computers and Communications (ICAECC) ◽

10.1109/icaecc50550.2020.9339518 ◽

2020 ◽

Author(s):

Nimrita Koul ◽

Sunilkumar S Manvi

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Mutual Information ◽

Gene Expression Data ◽

Recursive Feature Elimination ◽

Cancer Gene ◽

Expression Data

Download Full-text

An Overview of Methods for Feature Selection Based on Mutual Information for Stream Data Classification

2013 International Conference on Communication Systems and Network Technologies ◽

10.1109/csnt.2013.135 ◽

2013 ◽

Cited By ~ 2

Author(s):

K. Wankhade ◽

D. Rane ◽

R. Thool

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Data Classification ◽

Stream Data

Download Full-text

Hofs: Higher Order Mutual Information Approximation for Feature Selection in R

SSRN Electronic Journal ◽

10.2139/ssrn.4003261 ◽

2022 ◽

Author(s):

Krzysztof Gajowniczek ◽

Jialin Wu ◽

Soumyajit Gupta ◽

Chandrajit Bajaj

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Higher Order

Download Full-text

Feature Selection Method Based on Mutual Information and Support Vector Machine

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800142150021x ◽

2021 ◽

pp. 2150021

Author(s):

Gang Liu ◽

Chunlei Yang ◽

Sen Liu ◽

Chunbao Xiao ◽

Bin Song

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Mutual Information ◽

Classification Accuracy ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Svm Classifier ◽

Standard Data ◽

Feature Dimension

A feature selection method based on mutual information and support vector machine (SVM) is proposed in order to eliminate redundant feature and improve classification accuracy. First, local correlation between features and overall correlation is calculated by mutual information. The correlation reflects the information inclusion relationship between features, so the features are evaluated and redundant features are eliminated with analyzing the correlation. Subsequently, the concept of mean impact value (MIV) is defined and the influence degree of input variables on output variables for SVM network based on MIV is calculated. The importance weights of the features described with MIV are sorted by descending order. Finally, the SVM classifier is used to implement feature selection according to the classification accuracy of feature combination which takes MIV order of feature as a reference. The simulation experiments are carried out with three standard data sets of UCI, and the results show that this method can not only effectively reduce the feature dimension and high classification accuracy, but also ensure good robustness.

Download Full-text