Feature Selection for Machine Learning-Based Early Detection of Distributed Cyber Attacks

Abstract. The climate models are extremely complex pieces of software. They reflect best knowledge on physical components of the climate, nevertheless, they contain several parameters, which are too weakly constrained by observations, and can potentially lead to a crash of simulation. Recently a study by Lucas et al. (2013) has shown that machine learning methods can be used for predicting which combinations of parameters can lead to crash of simulation, and hence which processes described by these parameters need refined analyses. In the current study we reanalyse the dataset used in this research using different methodology. We confirm the main conclusion of the original study concerning suitability of machine learning for prediction of crashes. We show, that only three of the eight parameters indicated in the original study as relevant for prediction of the crash are indeed strongly relevant, three other are relevant but redundant, and two are not relevant at all. We also show that the variance due to split of data between training and validation sets has large influence both on accuracy of predictions and relative importance of variables, hence only cross-validated approach can deliver robust prediction of performance and relevance of variables.

Download Full-text

Early Detection of the Alzheimer’s Disease: A Novel Cognitive Feature Selection Approach Using Machine Learning

Advances in Information, Communication and Cybersecurity - Lecture Notes in Networks and Systems ◽

10.1007/978-3-030-91738-8_35 ◽

2022 ◽

pp. 383-392

Author(s):

Muhammad Irfan ◽

Seyed Shahrestani ◽

Mahmoud Elkhodr

Keyword(s):

Machine Learning ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Feature Selection ◽

Early Detection ◽

Selection Approach ◽

Feature Selection Approach ◽

Cognitive Feature

Download Full-text

A novel machine learning based feature selection for motor imagery EEG signal classification in Internet of medical things environment

Future Generation Computer Systems ◽

10.1016/j.future.2019.01.048 ◽

2019 ◽

Vol 98 ◽

pp. 419-434 ◽

Cited By ~ 15

Author(s):

Rajdeep Chatterjee ◽

Tanmoy Maitra ◽

SK Hafizul Islam ◽

Mohammad Mehedi Hassan ◽

Atif Alamri ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Motor Imagery ◽

Signal Classification ◽

Eeg Signal ◽

Internet Of Medical Things ◽

Eeg Signal Classification ◽

Selection For

Download Full-text

Towards a Lightweight Detection System for Cyber Attacks in the IoT Environment Using Corresponding Features

Electronics ◽

10.3390/electronics9010144 ◽

2020 ◽

Vol 9 (1) ◽

pp. 144 ◽

Cited By ~ 6

Author(s):

Yan Naung Soe ◽

Yaokai Feng ◽

Paulus Insap Santosa ◽

Rudy Hartanto ◽

Kouichi Sakurai

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Detection System ◽

Detection Performance ◽

Cyber Attacks ◽

Raspberry Pi ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

New Feature ◽

Iot Devices

The application of a large number of Internet of Things (IoT) devices makes our life more convenient and industries more efficient. However, it also makes cyber-attacks much easier to occur because so many IoT devices are deployed and most of them do not have enough resources (i.e., computation and storage capacity) to carry out ordinary intrusion detection systems (IDSs). In this study, a lightweight machine learning-based IDS using a new feature selection algorithm is designed and implemented on Raspberry Pi, and its performance is verified using a public dataset collected from an IoT environment. To make the system lightweight, we propose a new algorithm for feature selection, called the correlated-set thresholding on gain-ratio (CST-GR) algorithm, to select really necessary features. Because the feature selection is conducted on three specific kinds of cyber-attacks, the number of selected features can be significantly reduced, which makes the classifiers very small and fast. Thus, our detection system is lightweight enough to be implemented and carried out in a Raspberry Pi system. More importantly, as the really necessary features corresponding to each kind of attack are exploited, good detection performance can be expected. The performance of our proposal is examined in detail with different machine learning algorithms, in order to learn which of them is the best option for our system. The experiment results indicate that the new feature selection algorithm can select only very few features for each kind of attack. Thus, the detection system is lightweight enough to be implemented in the Raspberry Pi environment with almost no sacrifice on detection performance.

Download Full-text

Assessment of feature selection for student academic performance through machine learning classification

Journal of Statistics and Management Systems ◽

10.1080/09720510.2019.1609729 ◽

2019 ◽

Vol 22 (4) ◽

pp. 729-739 ◽

Cited By ~ 1

Author(s):

R. Suguna ◽

M. Shyamala Devi ◽

Rupali Amit Bagate ◽

Aparna Shashikant Joshi

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Academic Performance ◽

Machine Learning Classification ◽

Student Academic Performance ◽

Selection For

Download Full-text

Feature selection for an automated ancient Tamil script classification system using machine learning techniques

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET) ◽

10.1109/icammaet.2017.8186731 ◽

2017 ◽

Cited By ~ 2

Author(s):

T S Suganya ◽

S Murugavalli

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Classification System ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Selection For ◽

Tamil Script

Download Full-text

Machine learning methods in the computational biology of cancer

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2014.0081 ◽

2014 ◽

Vol 470 (2167) ◽

pp. 20140081 ◽

Cited By ~ 15

Author(s):

M. Vidyasagar

Keyword(s):

Machine Learning ◽

Ovarian Cancer ◽

Feature Selection ◽

Classification Problems ◽

Open Problems ◽

Machine Learning Methods ◽

Selection For ◽

Personalized Cancer Therapy ◽

Personalized Cancer ◽

Sparse Feature Selection

The objectives of this Perspective paper are to review some recent advances in sparse feature selection for regression and classification, as well as compressed sensing, and to discuss how these might be used to develop tools to advance personalized cancer therapy. As an illustration of the possibilities, a new algorithm for sparse regression is presented and is applied to predict the time to tumour recurrence in ovarian cancer. A new algorithm for sparse feature selection in classification problems is presented, and its validation in endometrial cancer is briefly discussed. Some open problems are also presented.

Download Full-text