Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System

Feature selection is one of the core contents of rough set theory and application. Since the reduction ability and classification performance of many feature selection algorithms based on rough set theory and its extensions are not ideal, this paper proposes a feature selection algorithm that combines the information theory view and algebraic view in the neighborhood decision system. First, the neighborhood relationship in the neighborhood rough set model is used to retain the classification information of continuous data, to study some uncertainty measures of neighborhood information entropy. Second, to fully reflect the decision ability and classification performance of the neighborhood system, the neighborhood credibility and neighborhood coverage are defined and introduced into the neighborhood joint entropy. Third, a feature selection algorithm based on neighborhood joint entropy is designed, which improves the disadvantage that most feature selection algorithms only consider information theory definition or algebraic definition. Finally, experiments and statistical analyses on nine data sets prove that the algorithm can effectively select the optimal feature subset, and the selection result can maintain or improve the classification performance of the data set.

Download Full-text

An exact feature selection algorithm based on rough set theory

Complexity ◽

10.1002/cplx.21526 ◽

2014 ◽

Vol 20 (5) ◽

pp. 50-62 ◽

Cited By ~ 4

Author(s):

Mohammad Taghi Rezvan ◽

Ali Zeinal Hamadani ◽

Seyed Reza Hejazi

Keyword(s):

Feature Selection ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

An effective supervised filter based feature selection algorithm using rough set theory

2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS) ◽

10.1109/icecds.2017.8389865 ◽

2017 ◽

Author(s):

Rubul Kumar Bania

Keyword(s):

Feature Selection ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

Classification Performance Improvement Using Random Subset Feature Selection Algorithm for Data Mining

Big Data Research ◽

10.1016/j.bdr.2018.02.007 ◽

2018 ◽

Vol 12 ◽

pp. 1-12 ◽

Cited By ~ 7

Author(s):

Lakshmipadmaja D ◽

B. Vishnuvardhan

Keyword(s):

Data Mining ◽

Feature Selection ◽

Performance Improvement ◽

Classification Performance ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Random Subset

Download Full-text

Rough Set-hypergraph-based Feature Selection Approach for Intrusion Detection Systems

Defence Science Journal ◽

10.14429/dsj.66.10802 ◽

2016 ◽

Vol 66 (6) ◽

pp. 612 ◽

Cited By ~ 14

Author(s):

M.R. Gauthama Raman ◽

K. Kannan ◽

S.K. Pal ◽

V. S. Shankar Sriram

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Rough Set ◽

Network Architecture ◽

Cyber Attacks ◽

Intrusion Detection Systems ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Detection Systems ◽

Helly Property

Immense growth in network-based services had resulted in the upsurge of internet users, security threats and cyber-attacks. Intrusion detection systems (IDSs) have become an essential component of any network architecture, in order to secure an IT infrastructure from the malicious activities of the intruders. An efficient IDS should be able to detect, identify and track the malicious attempts made by the intruders. With many IDSs available in the literature, the most common challenge due to voluminous network traffic patterns is the curse of dimensionality. This scenario emphasizes the importance of feature selection algorithm, which can identify the relevant features and ignore the rest without any information loss. In this paper, a novel rough set κ-Helly property technique (RSKHT) feature selection algorithm had been proposed to identify the key features for network IDSs. Experiments carried using benchmark KDD cup 1999 dataset were found to be promising, when compared with the existing feature selection algorithms with respect to reduct size, classifier’s performance and time complexity. RSKHT was found to be computationally attractive and flexible for massive datasets.

Download Full-text

Hybrid Rough Set with Black Hole Optimization Based Feature Selection Algorithm for Protein Structure Prediction

International Journal of Advanced Intelligence Paradigms ◽

10.1504/ijaip.2018.10023064 ◽

2018 ◽

Vol 10 (1) ◽

pp. 1

Author(s):

M. Bagyamathi ◽

Ahmad Taher Azar ◽

H. Hannah Inbarani

Keyword(s):

Black Hole ◽

Feature Selection ◽

Protein Structure ◽

Protein Structure Prediction ◽

Rough Set ◽

Structure Prediction ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

A NOVEL FEATURE SELECTION ALGORITHM WITH SUPERVISED MUTUAL INFORMATION FOR CLASSIFICATION

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013500279 ◽

2013 ◽

Vol 22 (04) ◽

pp. 1350027

Author(s):

JAGANATHAN PALANICHAMY ◽

KUPPUCHAMY RAMASAMY

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Class A ◽

Selection Algorithms ◽

The Relationship ◽

Class Variable

Feature selection is essential in data mining and pattern recognition, especially for database classification. During past years, several feature selection algorithms have been proposed to measure the relevance of various features to each class. A suitable feature selection algorithm normally maximizes the relevancy and minimizes the redundancy of the selected features. The mutual information measure can successfully estimate the dependency of features on the entire sampling space, but it cannot exactly represent the redundancies among features. In this paper, a novel feature selection algorithm is proposed based on maximum relevance and minimum redundancy criterion. The mutual information is used to measure the relevancy of each feature with class variable and calculate the redundancy by utilizing the relationship between candidate features, selected features and class variables. The effectiveness is tested with ten benchmarked datasets available in UCI Machine Learning Repository. The experimental results show better performance when compared with some existing algorithms.

Download Full-text

Artificial Bee Colony–Based Feature Selection Algorithm for Cyberbullying

The Computer Journal ◽

10.1093/comjnl/bxaa066 ◽

2020 ◽

Author(s):

Esra Sarac Essiz ◽

Murat Oturakci

Keyword(s):

Feature Selection ◽

Artificial Bee Colony ◽

Feature Selection Method ◽

Classification Performance ◽

Selection Method ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Traditional Methods ◽

Bee Colony ◽

Nature Inspired Algorithm

Abstract As a nature-inspired algorithm, artificial bee colony (ABC) is an optimization algorithm that is inspired by the search behaviour of honey bees. The main aim of this study is to examine the effects of the ABC-based feature selection algorithm on classification performance for cyberbullying, which has become a significant worldwide social issue in recent years. With this purpose, the classification performance of the proposed ABC-based feature selection method is compared with three different traditional methods such as information gain, ReliefF and chi square. Experimental results present that ABC-based feature selection method outperforms than three traditional methods for the detection of cyberbullying. The Macro averaged F_measure of the data set is increased from 0.659 to 0.8 using proposed ABC-based feature selection method.

Download Full-text