Improved Relief Weight Feature Selection Algorithm Based on Relief and Mutual Information

As the classic feature selection algorithm, the Relief algorithm has the advantages of simple computation and high efficiency, but the algorithm itself is limited to only dealing with binary classification problems, and the comprehensive distinguishing ability of the feature subsets composed of the former K features selected by the Relief algorithm is often redundant, as the algorithm cannot select the ideal feature subset. When calculating the correlation and redundancy between characteristics by mutual information, the computation speed is slow because of the high computational complexity and the method’s need to calculate the probability density function of the corresponding features. Aiming to solve the above problems, we first improve the weight of the Relief algorithm, so that it can be used to evaluate a set of candidate feature sets. Then we use the improved joint mutual information evaluation function to replace the basic mutual information computation and solve the problem of computation speed and correlation, and redundancy between features. Finally, a compound correlation feature selection algorithm based on Relief and joint mutual information is proposed using the evaluation function and the heuristic sequential forward search strategy. This algorithm can effectively select feature subsets with small redundancy and strong classification characteristics, and has the excellent characteristics of faster calculation speed.

Download Full-text

A Feature Selection Algorithm Based on Approximate Markov Blanket and Dynamic Mutual Information

Intelligent Science and Intelligent Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-642-31919-8_29 ◽

2012 ◽

pp. 226-233

Author(s):

Xiaodan Wang ◽

Xu Yao ◽

Yuxi Zhang ◽

Lei Lei

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Markov Blanket

Download Full-text

Research on Spam Filtering Technology Based on New Mutual Information Feature Selection Algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/1673/1/012028 ◽

2020 ◽

Vol 1673 ◽

pp. 012028

Author(s):

Kunfu Wang ◽

Wanfeng Mao ◽

Wei Feng ◽

Hui Wang

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Spam Filtering ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

Modulation Recognition of Digital Multimedia Signal Based on Data Feature Selection

International Journal of Mobile Computing and Multimedia Communications ◽

10.4018/ijmcmc.2017070107 ◽

2017 ◽

Vol 8 (3) ◽

pp. 90-111 ◽

Cited By ~ 2

Author(s):

Hui Wang ◽

Li Li Guo ◽

Yun Lin

Keyword(s):

Feature Selection ◽

Information Entropy ◽

Feature Subset ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Modulation Recognition ◽

Signal Modulation ◽

Digital Multimedia ◽

Optimal Feature Subset ◽

Optimal Feature

Automatic modulation recognition is very important for the receiver design in the broadband multimedia communication system, and the reasonable signal feature extraction and selection algorithm is the key technology of Digital multimedia signal recognition. In this paper, the information entropy is used to extract the single feature, which are power spectrum entropy, wavelet energy spectrum entropy, singular spectrum entropy and Renyi entropy. And then, the feature selection algorithm of distance measurement and Sequential Feature Selection(SFS) are presented to select the optimal feature subset. Finally, the BP neural network is used to classify the signal modulation. The simulation result shows that the four-different information entropy can be used to classify different signal modulation, and the feature selection algorithm is successfully used to choose the optimal feature subset and get the best performance.

Download Full-text

An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification

2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics ◽

10.1109/ihmsc.2013.37 ◽

2013 ◽

Author(s):

Xiao-Yu Jiang ◽

Jin Shui

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Text Classification ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

A NOVEL FEATURE SELECTION ALGORITHM WITH SUPERVISED MUTUAL INFORMATION FOR CLASSIFICATION

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013500279 ◽

2013 ◽

Vol 22 (04) ◽

pp. 1350027

Author(s):

JAGANATHAN PALANICHAMY ◽

KUPPUCHAMY RAMASAMY

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Class A ◽

Selection Algorithms ◽

The Relationship ◽

Class Variable

Feature selection is essential in data mining and pattern recognition, especially for database classification. During past years, several feature selection algorithms have been proposed to measure the relevance of various features to each class. A suitable feature selection algorithm normally maximizes the relevancy and minimizes the redundancy of the selected features. The mutual information measure can successfully estimate the dependency of features on the entire sampling space, but it cannot exactly represent the redundancies among features. In this paper, a novel feature selection algorithm is proposed based on maximum relevance and minimum redundancy criterion. The mutual information is used to measure the relevancy of each feature with class variable and calculate the redundancy by utilizing the relationship between candidate features, selected features and class variables. The effectiveness is tested with ten benchmarked datasets available in UCI Machine Learning Repository. The experimental results show better performance when compared with some existing algorithms.

Download Full-text

WJMI: A New Feature Selection Algorithm Based on Weighted Joint Mutual Information

Proceedings of the 3rd International Conference on Mechatronics and Industrial Informatics ◽

10.2991/icmii-15.2015.108 ◽

2015 ◽

Cited By ~ 1

Author(s):

Xiuli Qi ◽

Chengxiang Yin ◽

Kai Cheng ◽

Xianglin Liao ◽

Xingdang Kang

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

New Feature

Download Full-text

Hospital readmission prediction based on improved feature selection using grey relational analysis and LASSO

Grey Systems Theory and Application ◽

10.1108/gs-12-2020-0168 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Nor Hamizah Miswan ◽

Chee Seng Chan ◽

Chong Guan Ng

Keyword(s):

Feature Selection ◽

Grey Relational Analysis ◽

Hospital Readmission ◽

Grey System Theory ◽

Feature Subset ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Content Type ◽

Relational Analysis ◽

Grey Relational

PurposeThis paper develops a robust hospital readmission prediction framework by combining the feature selection algorithm and machine learning (ML) classifiers. The improved feature selection is proposed by considering the uncertainty in patient's attributes that leads to the output variable.Design/methodology/approachFirst, data preprocessing is conducted which includes how raw data is managed. Second, the impactful features are selected through feature selection process. It started with calculating the relational grade of each patient towards readmission using grey relational analysis (GRA) and the grade is used as the target values for feature selection. Then, the influenced features are selected using the Least Absolute Shrinkage and Selection Operator (LASSO) method. This proposed method is termed as Grey-LASSO feature selection. The final task is the readmission prediction using ML classifiers.FindingsThe proposed method offered good performances with a minimum feature subset up to 54–65% discarded features. Multi-Layer Perceptron with Grey-LASSO gave the best performance.Research limitations/implicationsThe performance of Grey-LASSO is justified in two readmission datasets. Further research is required to examine the generalisability to other datasets.Originality/valueIn designing the feature selection algorithm, the selection on influenced input variables was based on the integration of GRA and LASSO. Specifically, GRA is a part of the grey system theory, which was employed to analyse the relation between systems under uncertain conditions. The LASSO approach was adopted due to its ability for sparse data representation.

Download Full-text

A Feature Selection Algorithm based on Hoeffding Inequality and Mutual Information

International Journal of Signal Processing Image Processing and Pattern Recognition ◽

10.14257/ijsip.2015.8.11.39 ◽

2015 ◽

Vol 8 (11) ◽

pp. 433-444 ◽

Cited By ~ 1

Author(s):

Chunyong Yin ◽

Lu Feng ◽

Luyu Ma ◽

Zhichao Yin ◽

Jin Wang

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Hoeffding Inequality

Download Full-text

MRF-RFS: A Modified Random Forest Recursive Feature Selection Algorithm for Nasopharyngeal Carcinoma Segmentation

Methods of Information in Medicine ◽

10.1055/s-0040-1721791 ◽

2020 ◽

Vol 59 (04/05) ◽

pp. 151-161

Author(s):

Yuchen Fei ◽

Fengyu Zhang ◽

Chen Zu ◽

Mei Hong ◽

Xingchen Peng ◽

...

Keyword(s):

Feature Selection ◽

Random Forest ◽

Nasopharyngeal Carcinoma ◽

Soft Tissues ◽

Feature Selection Method ◽

Selection Method ◽

Feature Subset ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Tumor Margins

Abstract Background An accurate and reproducible method to delineate tumor margins is of great importance in clinical diagnosis and treatment. In nasopharyngeal carcinoma (NPC), due to limitations such as high variability, low contrast, and discontinuous boundaries in presenting soft tissues, tumor margin can be extremely difficult to identify in magnetic resonance imaging (MRI), increasing the challenge of NPC segmentation task. Objectives The purpose of this work is to develop a semiautomatic algorithm for NPC image segmentation with minimal human intervention, while it is also capable of delineating tumor margins with high accuracy and reproducibility. Methods In this paper, we propose a novel feature selection algorithm for the identification of the margin of NPC image, named as modified random forest recursive feature selection (MRF-RFS). Specifically, to obtain a more discriminative feature subset for segmentation, a modified recursive feature selection method is applied to the original handcrafted feature set. Moreover, we combine the proposed feature selection method with the classical random forest (RF) in the training stage to take full advantage of its intrinsic property (i.e., feature importance measure). Results To evaluate the segmentation performance, we verify our method on the T1-weighted MRI images of 18 NPC patients. The experimental results demonstrate that the proposed MRF-RFS method outperforms the baseline methods and deep learning methods on the task of segmenting NPC images. Conclusion The proposed method could be effective in NPC diagnosis and useful for guiding radiation therapy.

Download Full-text

Revealing False Positive Features in Epileptic EEG Identification

International Journal of Neural Systems ◽

10.1142/s0129065720500173 ◽

2020 ◽

Vol 30 (11) ◽

pp. 2050017 ◽

Cited By ~ 1

Author(s):

Jian Lian ◽

Yunfeng Shi ◽

Yan Zhang ◽

Weikuan Jia ◽

Xiaojun Fan ◽

...

Keyword(s):

Feature Selection ◽

Nearest Neighbor ◽

State Of The Art ◽

The State ◽

Support Vector ◽

Feature Subset ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Eeg Signals ◽

Eeg Classification

Feature selection plays a vital role in the detection and discrimination of epileptic seizures in electroencephalogram (EEG) signals. The state-of-the-art EEG classification techniques commonly entail the extraction of the multiple features that would be fed into classifiers. For some techniques, the feature selection strategies have been used to reduce the dimensionality of the entire feature space. However, most of these approaches focus on the performance of classifiers while neglecting the association between the feature and the EEG activity itself. To enhance the inner relationship between the feature subset and the epileptic EEG task with a promising classification accuracy, we propose a machine learning-based pipeline using a novel feature selection algorithm built upon a knockoff filter. First, a number of temporal, spectral, and spatial features are extracted from the raw EEG signals. Second, the proposed feature selection algorithm is exploited to obtain the optimal subgroup of features. Afterwards, three classifiers including [Formula: see text]-nearest neighbor (KNN), random forest (RF) and support vector machine (SVM) are used. The experimental results on the Bonn dataset demonstrate that the proposed approach outperforms the state-of-the-art techniques, with accuracy as high as 99.93% for normal and interictal EEG discrimination and 98.95% for interictal and ictal EEG classification. Meanwhile, it has achieved satisfactory sensitivity (95.67% in average), specificity (98.83% in average), and accuracy (98.89% in average) over the Freiburg dataset.

Download Full-text