Robust multiobjective evolutionary feature subset selection algorithm for binary classification using machine learning techniques

The past 10 years have seen the prediction of software defects proposed by many researchers using various metrics based on measurable aspects of source code entities (e.g. methods, classes, files or modules) and the social structure of software project in an effort to predict the software defects. However, these metrics could not predict very high accuracies in terms of sensitivity, specificity and accuracy. In this chapter, we propose the use of machine learning techniques to predict software defects. The effectiveness of all these techniques is demonstrated on ten datasets taken from literature. Based on an experiment, it is observed that PNN outperformed all other techniques in terms of accuracy and sensitivity in all the software defects datasets followed by CART and Group Method of data handling. We also performed feature selection by t-statistics based approach for selecting feature subsets across different folds for a given technique and followed by the feature subset selection. By taking the most important variables, we invoked the classifiers again and observed that PNN outperformed other classifiers in terms of sensitivity and accuracy. Moreover, the set of ‘if- then rules yielded by J48 and CART can be used as an expert system for prediction of software defects.

Download Full-text

SVM and KNN Based SGO Feature Selection Algorithm for Breast Cancer Diagnosis

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d4428.038620 ◽

2020 ◽

Vol 8 (2S7) ◽

pp. 2237-2240

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Selection ◽

Learning Algorithms ◽

Subset Selection ◽

Machine Learning Algorithms ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm ◽

Feature Selection Algorithm

In diagnosis and prediction systems, algorithms working on datasets with a high number of dimensions tend to take more time than those with fewer dimensions. Feature subset selection algorithms enhance the efficiency of Machine Learning algorithms in prediction problems by selecting a subset of the total features and thus pruning redundancy and noise. In this article, such a feature subset selection method is proposed and implemented to diagnose breast cancer using Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) algorithms. This feature selection algorithm is based on Social Group Optimization (SGO) an evolutionary algorithm. Higher accuracy in diagnosing breast cancer is achieved using our proposed model when compared to other feature selection-based Machine Learning algorithms

Download Full-text

Interaction between feature subset selection techniques and machine learning classifiers for detecting unsolicited emails

ACM SIGAPP Applied Computing Review ◽

10.1145/2600617.2600622 ◽

2014 ◽

Vol 14 (1) ◽

pp. 53-61 ◽

Cited By ~ 15

Author(s):

Shrawan Kumar Trivedi ◽

Shubhamoy Dey

Keyword(s):

Machine Learning ◽

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Machine Learning Classifiers ◽

Learning Classifiers

Download Full-text

A conservative feature subset selection algorithm with missing data

Neurocomputing ◽

10.1016/j.neucom.2009.05.019 ◽

2010 ◽

Vol 73 (4-6) ◽

pp. 585-590 ◽

Cited By ~ 12

Author(s):

Alex Aussem ◽

Sergio Rodrigues de Morais

Keyword(s):

Missing Data ◽

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm

Download Full-text

Improved Intrusion Detection Algorithm based on TLBO and GA Algorithms

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/2/5 ◽

2021 ◽

Vol 18 (2) ◽

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Optimization Algorithm ◽

Feature Subset Selection ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Feature Subset ◽

Teaching Learning Based Optimization ◽

Teaching Learning

Optimization algorithms are widely used for the identification of intrusion. This is attributable to the increasing number of audit data features and the decreasing performance of human-based smart Intrusion Detection Systems (IDS) regarding classification accuracy and training time. In this paper, an improved method for intrusion detection for binary classification was presented and discussed in detail. The proposed method combined the New Teaching-Learning-Based Optimization Algorithm (NTLBO), Support Vector Machine (SVM), Extreme Learning Machine (ELM), and Logistic Regression (LR) (feature selection and weighting) NTLBO algorithm with supervised machine learning techniques for Feature Subset Selection (FSS). The process of selecting the least number of features without any effect on the result accuracy in FSS was considered a multi-objective optimization problem. The NTLBO was proposed in this paper as an FSS mechanism; its algorithm-specific, parameter-less concept (which requires no parameter tuning during an optimization) was explored. The experiments were performed on the prominent intrusion machine-learning datasets (KDDCUP’99 and CICIDS 2017), where significant enhancements were observed with the suggested NTLBO algorithm as compared to the classical Teaching-Learning-Based Optimization algorithm (TLBO), NTLBO presented better results than TLBO and many existing works. The results showed that NTLBO reached 100% accuracy for KDDCUP’99 dataset and 97% for CICIDS dataset

Download Full-text

A novel Markov boundary based feature subset selection algorithm

Neurocomputing ◽

10.1016/j.neucom.2009.05.018 ◽

2010 ◽

Vol 73 (4-6) ◽

pp. 578-584 ◽

Cited By ~ 17

Author(s):

Sérgio Rodrigues de Morais ◽

Alex Aussem

Keyword(s):

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm

Download Full-text

Feature Subset Selection Algorithm for High-Minded Dimensional Data by Using Fast Cluster

International Journal of Engineering Trends and Technology ◽

10.14445/22315381/ijett-v14p246 ◽

2014 ◽

Vol 14 (5) ◽

pp. 232-237

Author(s):

B.Swarna Kumari ◽

◽

M.Doorvasulu Naidu

Keyword(s):

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm

Download Full-text

The feature subset selection algorithm

Journal of Electronics (China) ◽

10.1007/s11767-003-0088-5 ◽

2003 ◽

Vol 20 (1) ◽

pp. 57-61

Author(s):

Yongguo Liu ◽

Xueming Li ◽

Zhongfu Wu

Keyword(s):

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm

Download Full-text

Predicting Takeover Success Using Machine Learning Techniques

Journal of Business & Economics Research (JBER) ◽

10.19030/jber.v10i10.7264 ◽

2012 ◽

Vol 10 (10) ◽

pp. 547

Author(s):

Mei Zhang ◽

Gregory Johnson ◽

Jia Wang

Keyword(s):

Machine Learning ◽

Learning Community ◽

Binary Classification ◽

Classification Problem ◽

Machine Learning Techniques ◽

Success Prediction ◽

Support Vector ◽

Font Size ◽

Network Support ◽

Learning Techniques

A takeover success prediction model aims at predicting the probability that a takeover attempt will succeed by using publicly available information at the time of the announcement. We perform a thorough study using machine learning techniques to predict takeover success. Specifically, we model takeover success prediction as a binary classification problem, which has been widely studied in the machine learning community. Motivated by the recent advance in machine learning, we empirically evaluate and analyze many state-of-the-art classifiers, including logistic regression, artificial neural network, support vector machines with different kernels, decision trees, random forest, and Adaboost. The experiments validate the effectiveness of applying machine learning in takeover success prediction, and we found that the support vector machine with linear kernel and the Adaboost with stump weak classifiers perform the best for the task. The result is consistent with the general observations of these two approaches.

Download Full-text

An Implementation of Novel Feature Subset Selection Algorithm for IDS in Mobile Networks

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2019/43852019 ◽

2019 ◽

Vol 8 (5) ◽

pp. 2132-2141

Author(s):

N Chandra Sekhar Reddy ◽

Keyword(s):

Mobile Networks ◽

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Selection Algorithm

Download Full-text