Efficient Classification of DDoS Attacks Using an Ensemble Feature Selection Algorithm

Abstract In the current cyber world, one of the most severe cyber threats are distributed denial of service (DDoS) attacks, which make websites and other online resources unavailable to legitimate clients. It is different from other cyber threats that breach security parameters; however, DDoS is a short-term attack that brings down the server temporarily. Appropriate selection of features plays a crucial role for effective detection of DDoS attacks. Too many irrelevant features not only produce unrelated class categories but also increase computation overhead. In this article, we propose an ensemble feature selection algorithm to determine which attribute in the given training datasets is efficient in categorizing the classes. The result of the ensemble algorithm when compared to a threshold value will enable us to decide the features. The selected features are deployed as training inputs for various classifiers to select a classifier that yields maximum accuracy. We use a multilayer perceptron classifier as the final classifier, as it provides better accuracy when compared to other conventional classification models. The proposed method classifies the new datasets into either attack or normal classes with an efficiency of 98.3% and also reduces the overall computation time. We use the CAIDA 2007 dataset to evaluate the performance of the proposed method using MATLAB and Weka 3.6 simulators.

Download Full-text

Feature selection algorithm for classification of multispectral MR images using constrained energy minimization

2010 10th International Conference on Hybrid Intelligent Systems ◽

10.1109/his.2010.5604768 ◽

2010 ◽

Cited By ~ 3

Author(s):

Geng-Cheng Lin ◽

Wen-June Wang ◽

Chuin-Mu Wang

Keyword(s):

Feature Selection ◽

Energy Minimization ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Mr Images

Download Full-text

Classification of ECG waveform using feature selection algorithm

2012 IEEE International Conference on Advanced Communication Control and Computing Technologies (ICACCCT) ◽

10.1109/icaccct.2012.6320762 ◽

2012 ◽

Cited By ~ 6

Author(s):

S. Muthulakshmi ◽

K. Latha

Keyword(s):

Feature Selection ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

Classification of ECG beats by using a fast least square support vector machines with a dynamic programming feature selection algorithm

Neural Computing and Applications ◽

10.1007/s00521-005-0466-z ◽

2005 ◽

Vol 14 (4) ◽

pp. 299-309 ◽

Cited By ~ 55

Author(s):

Nurettin Acır

Keyword(s):

Dynamic Programming ◽

Feature Selection ◽

Support Vector Machines ◽

Least Square ◽

Support Vector ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Vector Machines

Download Full-text

Multiclass classification of Parkinson’s disease using different classifiers and LLBFS feature selection algorithm

International Journal of Speech Technology ◽

10.1007/s10772-017-9401-9 ◽

2017 ◽

Vol 20 (1) ◽

pp. 179-184 ◽

Cited By ~ 6

Author(s):

Elmehdi Benmalek ◽

Jamal Elmhamdi ◽

Abdelilah Jilbab

Keyword(s):

Parkinson’S Disease ◽

Parkinson's Disease ◽

Feature Selection ◽

Multiclass Classification ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

A new optimal feature selection algorithm for classification of power quality disturbances using discrete wavelet transform and probabilistic neural network

Measurement ◽

10.1016/j.measurement.2016.10.013 ◽

2017 ◽

Vol 95 ◽

pp. 246-259 ◽

Cited By ~ 88

Author(s):

Suhail Khokhar ◽

Abdullah Asuhaimi Mohd Zin ◽

Aslam Pervez Memon ◽

Ahmad Safawi Mokhtar

Keyword(s):

Neural Network ◽

Feature Selection ◽

Probabilistic Neural Network ◽

Discrete Wavelet ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Optimal Feature Selection ◽

Power Quality Disturbances ◽

Optimal Feature

Download Full-text

Classification of Diabetes using Random Forest with Feature Selection Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3595.119119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1295-1300 ◽

Cited By ~ 1

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Random Forest ◽

Electronic Health Records ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Health Records

Diabetes has become a serious problem now a day. So there is a need to take serious precautions to eradicate this. To eradicate, we should know the level of occurrence. In this project we predict the level of occurrence of diabetes. We predict the level of occurrence of diabetes using Random Forest, a Machine Learning Algorithm. Using the patient’s Electronic Health Records (EHR) we can build accurate models that predict the presence of diabetes.

Download Full-text

Automatic classification of auditory brainstem responses using SVM-based feature selection algorithm for threshold detection

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2005.08.004 ◽

2006 ◽

Vol 19 (2) ◽

pp. 209-218 ◽

Cited By ~ 30

Author(s):

Nurettin Acır ◽

Özcan Özdamar ◽

Cüneyt Güzeliş

Keyword(s):

Feature Selection ◽

Automatic Classification ◽

Auditory Brainstem ◽

Auditory Brainstem Responses ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Threshold Detection

Download Full-text

Determining Threshold Value on Information Gain Feature Selection to Increase Speed and Prediction Accuracy of Random Forest

10.21203/rs.3.rs-132775/v1 ◽

2020 ◽

Author(s):

Maria Irmina Prasetiyowati ◽

Nur Ulfa Maulidevi ◽

Kridanto Surendro

Keyword(s):

Feature Selection ◽

Random Forest ◽

Information Gain ◽

Threshold Value ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Random Forest Classification ◽

Average Value ◽

Time Required

Abstract Feature selection is a preprocessing technique aims to remove the unnecessary features and speed up the algorithm's work process. One of the feature selection techniques is by calculating the information gain value of each feature in a dataset. From the information gain value obtained, then the determined threshold value will be used to make feature selection. Generally, the threshold value is used freely, or using a value of 0.05. This study proposed the determination of the threshold value using the standard deviation of the information gain value generated by each feature in the dataset. The determination of this threshold value was tested on ten original datasets and datasets that had been transformed by FFT and IFFT, then classified using Random Forest. The results of the average value of accuracy and the average time required from the Random Forest classification using the proposed threshold value are better compared to the results of feature selection with a threshold value of 0.05 and the Correlation-Base Feature Selection algorithm. Likewise, the result of the average accuracy value of the proposed threshold using a transformed dataset in terms are better than the threshold value of 0.05 and the Correlation-Base Feature Selection algorithm. However, the calculation results for the average time required are higher (slower).

Download Full-text