Feature selection and intrusion classification in NSL-KDD cup 99 dataset employing SVMs

Author(s):  
Muhammad Shakil Pervez ◽  
Dewan Md. Farid
Keyword(s):  
2011 ◽  
Vol 38 (5) ◽  
pp. 5947-5957 ◽  
Author(s):  
V. Bolón-Canedo ◽  
N. Sánchez-Maroño ◽  
A. Alonso-Betanzos

Author(s):  
Hai Thanh Nguyen ◽  
Katrin Franke ◽  
Slobodan Petrovic

In this paper, the authors propose a new feature selection procedure for intrusion detection, which is based on filter method used in machine learning. They focus on Correlation Feature Selection (CFS) and transform the problem of feature selection by means of CFS measure into a mixed 0-1 linear programming problem with a number of constraints and variables that is linear in the number of full set features. The mixed 0-1 linear programming problem can then be solved by using branch-and-bound algorithm. This feature selection algorithm was compared experimentally with the best-first-CFS and the genetic-algorithm-CFS methods regarding the feature selection capabilities. Classification accuracies obtained after the feature selection by means of the C4.5 and the BayesNet over the KDD CUP’99 dataset were also tested. Experiments show that the authors’ method outperforms the best-first-CFS and the genetic-algorithm-CFS methods by removing much more redundant features while keeping the classification accuracies or getting better performances.


2019 ◽  
Vol 13 (3) ◽  
pp. 31-47 ◽  
Author(s):  
Alok Kumar Shukla ◽  
Pradeep Singh

The duplicate and insignificant features present in the data set to cause a long-term problem in the classification of network or web traffic. The insignificant features not only decrease the classification performance but also prevent a classifier from making accurate decisions, exclusively when substantial volumes of data are managed. In this article, the author introduced an ensemble feature selection (EFS) technique, where multiple homogeneous feature selection (FS) methods are combined to choose the optimal subset of relevant and non-redundant features. An intrusion detection system, named support vector machine-based IDS (SVM-IDS), is prompted using the feature selected by the proposed method. The SVM-IDS performance is evaluated using two benchmark datasets of intrusion detection, including KDD Cup 99 and NSL-KDD. Our proposed method provided more significant features for SVM-IDS and compared with the other state-of-the-art methods. The experimental results demonstrate that proposed method achieves a maximum accuracy as 98.95% in KDD Cup 99 data set and 98.12% in the NSL-KDD data set.


Author(s):  
Iago Porto-Díaz ◽  
David Martínez-Rego ◽  
Amparo Alonso-Betanzos ◽  
Oscar Fontenla-Romero
Keyword(s):  

Author(s):  
V. Bolón-Canedo ◽  
N. Sánchez-Maroño ◽  
A. Alonso-Betanzos ◽  
E. Hernández-Pereira
Keyword(s):  

Author(s):  
Lindsey M. Kitchell ◽  
Francisco J. Parada ◽  
Brandi L. Emerick ◽  
Tom A. Busey

2012 ◽  
Vol 19 (2) ◽  
pp. 97-111 ◽  
Author(s):  
Muhammad Ahmad ◽  
Syungyoung Lee ◽  
Ihsan Ul Haq ◽  
Qaisar Mushtaq

Sign in / Sign up

Export Citation Format

Share Document