Evaluation of K-Means Clustering for Effective Intrusion Detection and Prevention in Massive Network Traffic Data

In this chapter we will focus on examining computer network traffic and data. A computer network combines a set of computers and physically and logically connects them together to exchange information. Network traffic acquired from a network system provides information on data communications within the network and between networks or individual computers. The most common data types are log data, such as Kerberos logs, transmission control protocol/Internet protocol (TCP/IP) logs, Central processing unit (CPU) usage data, event logs, user command data, Internet visit data, operating system audit trail data, intrusion detection and prevention service (IDS/IPS) logs, Netflow1 data, and the simple network management protocol (SNMP) reporting data. Such information is unique and valuable for network security, specifically for intrusion detection and prevention. Although we have already presented some essential challenges in collecting such data in Chapter I, we will discuss traffic data, as well as other related data, in greater detail in this chapter. Specifically, we will describe system-specific and user-specific data types in Sections System- Specific Data and User-Specific Data, respectively, and provide detailed information on publicly available data in Section Publicly Available Data.

Download Full-text

MOVICAB-IDS: Visual Analysis of Network Traffic Data Streams for Intrusion Detection

Intelligent Data Engineering and Automated Learning – IDEAL 2006 - Lecture Notes in Computer Science ◽

10.1007/11875581_169 ◽

2006 ◽

pp. 1424-1433 ◽

Cited By ~ 6

Author(s):

Álvaro Herrero ◽

Emilio Corchado ◽

José Manuel Sáiz

Keyword(s):

Intrusion Detection ◽

Data Streams ◽

Network Traffic ◽

Visual Analysis ◽

Traffic Data

Download Full-text

THE STATISTICAL ANALYSIS OF A NETWORK TRAFFIC FOR THE INTRUSION DETECTION AND PREVENTION SYSTEMS

Telecommunications and Radio Engineering ◽

10.1615/telecomradeng.v74.i1.60 ◽

2015 ◽

Vol 74 (1) ◽

pp. 61-78 ◽

Cited By ~ 32

Author(s):

A.A. Kuznetsov ◽

A.A. Smirnov ◽

D.A. Danilenko ◽

A. Berezovsky

Keyword(s):

Statistical Analysis ◽

Intrusion Detection ◽

Network Traffic ◽

Intrusion Detection And Prevention

Download Full-text

Processing and Analytics of Big Network Traffic Data for Intrusion Detection

10.1109/telsiks52058.2021.9606353 ◽

2021 ◽

Author(s):

Nikola Ilic ◽

Jana Zdravkovic ◽

Dragan Stojanovic

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Traffic Data

Download Full-text

Implementation of an Intrusion Detection and Prevention System Module for Corporate Network Traffic Management

2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE) ◽

10.1109/apeie.2018.8545042 ◽

2018 ◽

Author(s):

Evgeny A. Basinya ◽

Yuliya K. Ravtovich

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Traffic Management ◽

Prevention System ◽

Corporate Network ◽

System Module ◽

Intrusion Detection And Prevention

Download Full-text

Neural visualization of network traffic data for intrusion detection

Applied Soft Computing ◽

10.1016/j.asoc.2010.07.002 ◽

2011 ◽

Vol 11 (2) ◽

pp. 2042-2056 ◽

Cited By ~ 120

Author(s):

Emilio Corchado ◽

Álvaro Herrero

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Traffic Data

Download Full-text

A Network Intrusion Detection System for Concept Drifting Network Traffic Data

10.1007/978-3-030-88942-5_9 ◽

2021 ◽

pp. 111-121

Author(s):

Giuseppina Andresini ◽

Annalisa Appice ◽

Corrado Loglisci ◽

Vincenzo Belvedere ◽

Domenico Redavid ◽

...

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Intrusion Detection System ◽

Detection System ◽

Network Intrusion Detection ◽

Traffic Data ◽

Network Intrusion ◽

Network Intrusion Detection System

Download Full-text

A comparative simulation of normalization methods for machine learning-based intrusion detection systems using KDD Cup’99 dataset

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211191 ◽

2021 ◽

pp. 1-18

Author(s):

Satish Kumar ◽

Sunanda Gupta ◽

Sakshi Arora

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Network Traffic ◽

Intrusion Detection Systems ◽

High Dimensional ◽

Traffic Data ◽

Detection Systems ◽

Dimensional Network ◽

Normalization Methods ◽

Kdd Cup 99

Network Intrusion detection systems (NIDS) detect malicious and intrusive information in computer networks. Presently, commercial NIDS is based on machine learning approaches that have complex algorithms and increase intrusion detection efficiency and efficacy. These machine learning-based NIDS use high dimensional network traffic data from which intrusive information is to be detected. This high-dimensional network traffic data in NIDS needs to be preprocessed and normalized to make it suitable for machine learning tools. A machine learning approach with appropriate normalization and prepossessing increases NIDS performance. This paper presents an empirical study on various normalization methods implemented on a benchmark network traffic dataset, KDD Cup’99, that has been used to evaluate the NIDS model. The present study shows decimal normalization has a better prediction performance than non-normalized traffic data categorized into ‘normal’ or ‘intrusive’ classes.

Download Full-text

Evaluation

Statistical Techniques for Network Security ◽

10.4018/978-1-59904-708-9.ch012 ◽

2011 ◽

pp. 427-457

Author(s):

Yu Wang

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Goodness Of Fit ◽

False Negative ◽

Data Consistency ◽

Misclassification Rate ◽

Traffic Data ◽

Research Areas ◽

Need To Evaluate ◽

Or Economics

Increasing the accuracy of classification has been a constant challenge in the network security area. While expansively increasing in the volume of network traffic and advantage in network bandwidth, many classification algorithms used for intrusion detection and prevention face high false positive and false negative rates. A stream of network traffic data with many positive predictors might not necessary represent a true attack, and a seemingly anomaly-free stream could represent a novel attack. Depending on the infrastructure of a network system, traffic data can become very large. As a result of such large volumes of data, a very low misclassification rate can yield a large number of alarms; for example, a system with 22 million hourly traffics with a 1% misclassification rate could have approximately 75 alarms within a second (excluding repeated connections). Validating every such case for review is not practical. To address this challenge we can improve the data collection process and develop more robust algorithms. Unlike other research areas, such as the life sciences, healthcare, or economics, where an analysis can be achieved based on a single statistical approach, a robust intrusion detection scheme need to be constructed hierarchically with multiple algorithms. For example, profiling and classifying user behavior hierarchically, using hybrid algorithms (e.g., combining statistics and AI). On the other hand, we can improve the precision of classification by carefully evaluating the results. There are several key elements that are important for statistical evaluation in classification and prediction, such as reliability, sensitivity, specificity, misclassification, and goodness-of-fit. We also need to evaluate the goodness of the data (consistency and repeatability), goodness of the classification, and goodness of the model. We will discuss these topics in this chapter.

Download Full-text

Intrusion Detection System Modeling Based on Learning from Network Traffic Data

KSII Transactions on Internet and Information Systems ◽

10.3837/tiis.2018.11.022 ◽

2018 ◽

Vol 12 (11) ◽

Keyword(s):

Intrusion Detection ◽

Network Traffic ◽

Intrusion Detection System ◽

Detection System ◽

System Modeling ◽

Traffic Data

Download Full-text