An Enhancing Grasshopper Optimization for Efficient Feature Selection

Author(s):  
Trong-The Nguyen ◽  
Shi-Jie Jiang ◽  
Thi-Kien Dao ◽  
Truong-Giang Ngo ◽  
Thi-Thanh-Tan Nguyen ◽  
...  
2018 ◽  
Vol 145 ◽  
pp. 25-45 ◽  
Author(s):  
Majdi Mafarja ◽  
Ibrahim Aljarah ◽  
Ali Asghar Heidari ◽  
Abdelaziz I. Hammouri ◽  
Hossam Faris ◽  
...  

2018 ◽  
Vol 10 (3) ◽  
pp. 478-495 ◽  
Author(s):  
Ibrahim Aljarah ◽  
Ala’ M. Al-Zoubi ◽  
Hossam Faris ◽  
Mohammad A. Hassonah ◽  
Seyedali Mirjalili ◽  
...  

Genes ◽  
2020 ◽  
Vol 11 (7) ◽  
pp. 717
Author(s):  
Garba Abdulrauf Sharifai ◽  
Zurinahni Zainol

The training machine learning algorithm from an imbalanced data set is an inherently challenging task. It becomes more demanding with limited samples but with a massive number of features (high dimensionality). The high dimensional and imbalanced data set has posed severe challenges in many real-world applications, such as biomedical data sets. Numerous researchers investigated either imbalanced class or high dimensional data sets and came up with various methods. Nonetheless, few approaches reported in the literature have addressed the intersection of the high dimensional and imbalanced class problem due to their complicated interactions. Lately, feature selection has become a well-known technique that has been used to overcome this problem by selecting discriminative features that represent minority and majority class. This paper proposes a new method called Robust Correlation Based Redundancy and Binary Grasshopper Optimization Algorithm (rCBR-BGOA); rCBR-BGOA has employed an ensemble of multi-filters coupled with the Correlation-Based Redundancy method to select optimal feature subsets. A binary Grasshopper optimisation algorithm (BGOA) is used to construct the feature selection process as an optimisation problem to select the best (near-optimal) combination of features from the majority and minority class. The obtained results, supported by the proper statistical analysis, indicate that rCBR-BGOA can improve the classification performance for high dimensional and imbalanced datasets in terms of G-mean and the Area Under the Curve (AUC) performance metrics.


Author(s):  
M. Jeyakarthic ◽  
A. Thirumalairaj

Background: Due to the advanced improvement in internet and network technologies, significant number of intrusions and attacks takes place. An intrusion detection system (IDS) is employed to prevent distinct attacks. Several machine learning approaches has been presented for the classification of IDS. But, IDS suffer from the curse of dimensionality that results to increased complexity and decreased resource exploitation. Consequently, it becomes necessary that significant features of data must be investigated by the use of IDS for reducing the dimensionality. Aim: In this article, a new feature selection (FS) based classification system is presented which carries out the FS and classification processes. Methods: Here, the binary variants of the Grasshopper Optimization Algorithm called BGOA is applied as a FS model. The significant features are integrated using an effective model to extract the useful ones and discard the useless features. The chosen features are given to the feed forward neural network (FFNN) model to train and test the KDD99 dataset. Results: The validation of the presented model takes place using a benchmark KDD Cup 1999 dataset. By the inclusion of FS process, the classifier results gets increased by attaining FPR of 0.43, FNR of 0.45, sensitivity of 99.55, specificity of 99.57, accuracy of 99.56, Fscore of 99.59 and kappa value of 99.11. Conclusion: The experimental outcome ensured the superior performance of the presented model compared to diverse models under several aspects and is found to be an appropriate tool for detecting intrusions.


2020 ◽  
Vol 127 ◽  
pp. 33-53 ◽  
Author(s):  
Dong Wang ◽  
Hongmei Chen ◽  
Tianrui Li ◽  
Jihong Wan ◽  
Yanyong Huang

Author(s):  
Songwei Zhao ◽  
Pengjun Wang ◽  
Ali Asghar Heidari ◽  
Xuehua Zhao ◽  
Chao Ma ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document