Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Nathan Martindale; Muhammad Ismail; Douglas A. Talbert

doi:10.3390/info11060315

Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Information ◽

10.3390/info11060315 ◽

2020 ◽

Vol 11 (6) ◽

pp. 315

Author(s):

Nathan Martindale ◽

Muhammad Ismail ◽

Douglas A. Talbert

Keyword(s):

Machine Learning ◽

Random Forest ◽

Intrusion Detection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

As new cyberattacks are launched against systems and networks on a daily basis, the ability for network intrusion detection systems to operate efficiently in the big data era has become critically important, particularly as more low-power Internet-of-Things (IoT) devices enter the market. This has motivated research in applying machine learning algorithms that can operate on streams of data, trained online or “live” on only a small amount of data kept in memory at a time, as opposed to the more classical approaches that are trained solely offline on all of the data at once. In this context, one important concept from machine learning for improving detection performance is the idea of “ensembles”, where a collection of machine learning algorithms are combined to compensate for their individual limitations and produce an overall superior algorithm. Unfortunately, existing research lacks proper performance comparison between homogeneous and heterogeneous online ensembles. Hence, this paper investigates several homogeneous and heterogeneous ensembles, proposes three novel online heterogeneous ensembles for intrusion detection, and compares their performance accuracy, run-time complexity, and response to concept drifts. Out of the proposed novel online ensembles, the heterogeneous ensemble consisting of an adaptive random forest of Hoeffding Trees combined with a Hoeffding Adaptive Tree performed the best, by dealing with concept drift in the most effective way. While this scheme is less accurate than a larger size adaptive random forest, it offered a marginally better run-time, which is beneficial for online training.

Get full-text (via PubEx)

Analyzing the Performance of Machine Learning Algorithms in Anomaly Network Intrusion Detection Systems

2018 4th International Conference on Science and Technology (ICST) ◽

10.1109/icstc.2018.8528645 ◽

2018 ◽

Cited By ~ 3

Author(s):

Pascal Maniriho ◽

Tohari Ahmad

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Get full-text (via PubEx)

Evaluating the performance of machine learning algorithms for network intrusion detection systems in the internet of things infrastructure

Journal of Advanced Computer Science & Technology ◽

10.14419/jacst.v9i1.30992 ◽

2020 ◽

Vol 9 (1) ◽

pp. 14

Author(s):

Yaser M. Banadaki

Keyword(s):

Machine Learning ◽

Internet Of Things ◽

Intrusion Detection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

As numerous Internet-of-Things (IoT) devices are deploying on a daily basis, network intrusion detection systems (NIDS) are among the most critical tools to ensure the protection and security of networks against malicious cyberattacks. This paper employs four machine learning algorithms: XGBoost, random forest, decision tree, and gradient boosting, and evaluates their performance in NIDS, considering the accuracy, precision, recall, and F-score. The comparative analysis conducted using the CICIDS2017 dataset reveals that the XGBoost performs better than the other algorithms reaching the predicted accuracy of 99.6% in detecting cyberattacks. XGBoost-based attack detectors also have the largest weighted metrics of F1-score, precision, and recall. The paper also studies the effect of class imbalance and the size of the normal and attack classes. The small numbers of some attacks in training datasets mislead the classifier to bias towards the majority classes resulting in a bottleneck to improving macro recall and macro F1 score. The results assist the network engineers in choosing the most effective machine learning-based NIDS to ensure network security for today’s growing IoT network traffic.

Get full-text (via PubEx)

Modeling Realistic Adversarial Attacks against Network Intrusion Detection Systems

Digital Threats: Research and Practice ◽

10.1145/3469659 ◽

2021 ◽

Author(s):

Giovanni Apruzzese ◽

Mauro Andreolini ◽

Luca Ferretti ◽

Mirco Marchetti ◽

Michele Colajanni

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Machine Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Learning Methods ◽

Detection Systems ◽

Machine Learning Methods ◽

Network Intrusion ◽

Network Intrusion Detection Systems

The incremental diffusion of machine learning algorithms in supporting cybersecurity is creating novel defensive opportunities but also new types of risks. Multiple researches have shown that machine learning methods are vulnerable to adversarial attacks that create tiny perturbations aimed at decreasing the effectiveness of detecting threats. We observe that existing literature assumes threat models that are inappropriate for realistic cybersecurity scenarios because they consider opponents with complete knowledge about the cyber detector or that can freely interact with the target systems. By focusing on Network Intrusion Detection Systems based on machine learning methods, we identify and model the real capabilities and circumstances that are necessary for an attacker to carry out a feasible and successful adversarial attack. We then apply our model to several adversarial attacks proposed in literature and highlight the limits and merits that can result in actual adversarial attacks. The contributions of this paper can help hardening defensive systems by letting cyber defenders address the most critical and real issues, and can benefit researchers by allowing them to devise novel forms of adversarial attacks based on realistic threat models.

Get full-text (via PubEx)

Effect of Activation Functions on the Performance of Deep Learning Algorithms for Network Intrusion Detection Systems

Proceedings of ICETIT 2019 - Lecture Notes in Electrical Engineering ◽

10.1007/978-3-030-30577-2_84 ◽

2019 ◽

pp. 949-960

Author(s):

Neha Gupta ◽

Punam Bedi ◽

Vinita Jindal

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Activation Functions ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Get full-text (via PubEx)

Poisoning Attacks and Data Sanitization Mitigations for Machine Learning Models in Network Intrusion Detection Systems

10.1109/milcom52596.2021.9652916 ◽

2021 ◽

Author(s):

Sridhar Venkatesan ◽

Harshvardhan Sikka ◽

Rauf Izmailov ◽

Ritu Chadha ◽

Alina Oprea ◽

...

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Learning Models ◽

Detection Systems ◽

Data Sanitization ◽

Network Intrusion ◽

Network Intrusion Detection Systems ◽

Machine Learning Models

Get full-text (via PubEx)

On evaluation of Network Intrusion Detection Systems: Statistical analysis of CIDDS-001 dataset using Machine Learning Techniques

10.36227/techrxiv.11454276.v1 ◽

2019 ◽

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Machine Learning Techniques ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Learning Techniques ◽

Network Intrusion Detection Systems ◽

Secure Networks

In the era of digital revolution, a huge amount of data is being generated from different networks on a daily basis. Security of this data is of utmost importance. Intrusion Detection Systems are found to be one the best solutions towards detecting intrusions. Network Intrusion Detection Systems are employed as a defence system to secure networks. Various techniques for the effective development of these defence systems have been proposed in the literature. However, the research on the development of datasets used for training and testing purpose of such defence systems is equally concerned. Better datasets improve the online and offline intrusion detection capability of detection model. Benchmark datasets like KDD 99 and NSL-KDD cup 99 obsolete and do not contain network traces of modern attacks like Denial of Service, hence are unsuitable for the evaluation purpose. In this work, a detailed analysis of CIDDS-001 dataset has been done and presented. We have used different well-known machine learning techniques for analysing the complexity of the dataset. Eminent evaluation metrics including Detection Rate, Accuracy, False Positive Rate, Kappa statistics, Root mean squared error have been used to show the performance of employed machine learning techniques.

Get full-text (via PubEx)

Statistical analysis of CIDDS-001 dataset for Network Intrusion Detection Systems using Distance-based Machine Learning

Procedia Computer Science ◽

10.1016/j.procs.2017.12.091 ◽

2018 ◽

Vol 125 ◽

pp. 709-716 ◽

Cited By ~ 27

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Statistical Analysis ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Get full-text (via PubEx)

Network intrusion detection using oversampling technique and machine learning algorithms

PeerJ Computer Science ◽

10.7717/peerj-cs.820 ◽

2022 ◽

Vol 8 ◽

pp. e820

Author(s):

Hafiza Anisa Ahmed ◽

Anum Hameed ◽

Narmeen Zakaria Bawany

Keyword(s):

Machine Learning ◽

Random Forest ◽

Network Security ◽

Intrusion Detection ◽

Network Traffic ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Network Intrusion Detection ◽

Classification Models ◽

Network Intrusion

The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrusion Detection System (NIDS) is used to effectively detect various attacks, thus providing timely protection to network resources from these attacks. To implement NIDS, a stream of supervised and unsupervised machine learning approaches is applied to detect irregularities in network traffic and to address network security issues. Such NIDSs are trained using various datasets that include attack traces. However, due to the advancement in modern-day attacks, these systems are unable to detect the emerging threats. Therefore, NIDS needs to be trained and developed with a modern comprehensive dataset which contains contemporary common and attack activities. This paper presents a framework in which different machine learning classification schemes are employed to detect various types of network attack categories. Five machine learning algorithms: Random Forest, Decision Tree, Logistic Regression, K-Nearest Neighbors and Artificial Neural Networks, are used for attack detection. This study uses a dataset published by the University of New South Wales (UNSW-NB15), a relatively new dataset that contains a large amount of network traffic data with nine categories of network attacks. The results show that the classification models achieved the highest accuracy of 89.29% by applying the Random Forest algorithm. Further improvement in the accuracy of classification models is observed when Synthetic Minority Oversampling Technique (SMOTE) is applied to address the class imbalance problem. After applying the SMOTE, the Random Forest classifier showed an accuracy of 95.1% with 24 selected features from the Principal Component Analysis method.

Get full-text (via PubEx)

On evaluation of Network Intrusion Detection Systems: Statistical analysis of CIDDS-001 dataset using Machine Learning Techniques

10.36227/techrxiv.11454276 ◽

2019 ◽

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Machine Learning Techniques ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Learning Techniques ◽

Network Intrusion Detection Systems ◽

Secure Networks

Get full-text (via PubEx)

Evading Machine Learning Based Network Intrusion Detection Systems with GANs

10.1002/9781119723950.ch17 ◽

2021 ◽

pp. 335-356

Author(s):

Bolor‐Erdene Zolbayar ◽

Ryan Sheatsley ◽

Patrick McDaniel ◽

Mike Weisman

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Get full-text (via PubEx)