scholarly journals Evaluation of Selected Stacked Ensemble Models for the Optimal Multi-class Cyber-Attacks Detection

2021 ◽  
Vol 5 (1) ◽  
pp. 26-48
Author(s):  
Olasehinde Olayemi Oladimeji ◽  
Alese Boniface Kayode ◽  
Adetunmbi Adebayo Olusola ◽  
Aladesote Olomi Isaiah

The significant rise in the frequency and sophistication of cyber-attacks and their diversity necessitated various researchers to develop strong and effective approaches to address recurring cyber threat challenges. This study evaluated the performance of three selected meta-learning models for optimal multi-class detection of cyber-attacks using the University of New South Wales 2015 Network benchmark (UNSW-NB15) Intrusion Dataset. The results of this study show and confirm the ability of the three base models; Naive Bayes, C4.5 Decision Tree, and K-Nearest Neighbor for solving multi-class problems. It further affirms the knack of the duo of feature selection techniques and stacked ensemble learning to optimize ML models' performances. The stacking of the predictions of the information gain base models with Model Decision Tree meta-algorithm recorded the most improved and optimal cyber-attacks detection accuracy and Mattew's correlation Coefficient than the stacking with the Multiple Model Trees (MMT) and Multi Response Linear regression (MLR) Meta algorithms.

2020 ◽  
Vol 17 (1) ◽  
pp. 319-328
Author(s):  
Ade Muchlis Maulana Anwar ◽  
Prihastuti Harsani ◽  
Aries Maesya

Population Data is individual data or aggregate data that is structured as a result of Population Registration and Civil Registration activities. Birth Certificate is a Civil Registration Deed as a result of recording the birth event of a baby whose birth is reported to be registered on the Family Card and given a Population Identification Number (NIK) as a basis for obtaining other community services. From the total number of integrated birth certificate reporting for the 2018 Population Administration Information System (SIAK) totaling 570,637 there were 503,946 reported late and only 66,691 were reported publicly. Clustering is a method used to classify data that is similar to others in one group or similar data to other groups. K-Nearest Neighbor is a method for classifying objects based on learning data that is the closest distance to the test data. k-means is a method used to divide a number of objects into groups based on existing categories by looking at the midpoint. In data mining preprocesses, data is cleaned by filling in the blank data with the most dominating data, and selecting attributes using the information gain method. Based on the k-nearest neighbor method to predict delays in reporting and the k-means method to classify priority areas of service with 10,000 birth certificate data on birth certificates in 2019 that have good enough performance to produce predictions with an accuracy of 74.00% and with K = 2 on k-means produces a index davies bouldin of 1,179.


Author(s):  
M. Ilayaraja ◽  
S. Hemalatha ◽  
P. Manickam ◽  
K. Sathesh Kumar ◽  
K. Shankar

Cloud computing is characterized as the arrangement of assets or administrations accessible through the web to the clients on their request by cloud providers. It communicates everything as administrations over the web in view of the client request, for example operating system, organize equipment, storage, assets, and software. Nowadays, Intrusion Detection System (IDS) plays a powerful system, which deals with the influence of experts to get actions when the system is hacked under some intrusions. Most intrusion detection frameworks are created in light of machine learning strategies. Since the datasets, this utilized as a part of intrusion detection is Knowledge Discovery in Database (KDD). In this paper detect or classify the intruded data utilizing Machine Learning (ML) with the MapReduce model. The primary face considers Hadoop MapReduce model to reduce the extent of database ideal weight decided for reducer model and second stage utilizing Decision Tree (DT) classifier to detect the data. This DT classifier comprises utilizing an appropriate classifier to decide the class labels for the non-homogeneous leaf nodes. The decision tree fragment gives a coarse section profile while the leaf level classifier can give data about the qualities that influence the label inside a portion. From the proposed result accuracy for detection is 96.21% contrasted with existing classifiers, for example, Neural Network (NN), Naive Bayes (NB) and K Nearest Neighbor (KNN).


2020 ◽  
Author(s):  
Hoda Heidari ◽  
Zahra Einalou ◽  
Mehrdad Dadgostar ◽  
Hamidreza Hosseinzadeh

Abstract Most of the studies in the field of Brain-Computer Interface (BCI) based on electroencephalography have a wide range of applications. Extracting Steady State Visual Evoked Potential (SSVEP) is regarded as one of the most useful tools in BCI systems. In this study, different methods such as feature extraction with different spectral methods (Shannon entropy, skewness, kurtosis, mean, variance) (bank of filters, narrow-bank IIR filters, and wavelet transform magnitude), feature selection performed by various methods (decision tree, principle component analysis (PCA), t-test, Wilcoxon, Receiver operating characteristic (ROC)), and classification step applying k nearest neighbor (k-NN), perceptron, support vector machines (SVM), Bayesian, multiple layer perceptron (MLP) were compared from the whole stream of signal processing. Through combining such methods, the effective overview of the study indicated the accuracy of classical methods. In addition, the present study relied on a rather new feature selection described by decision tree and PCA, which is used for the BCI-SSVEP systems. Finally, the obtained accuracies were calculated based on the four recorded frequencies representing four directions including right, left, up, and down.


Author(s):  
Abdaoui Noura ◽  
Ismahène Hadj Khalifa ◽  
Sami Faiz

In the concept of internet of things (IOT), physical position of smart object is very useful for relevant function over sensor networks. However, the invalid information of indoor geo-localization systems relative to these wireless sensor compromises the intelligence of IOT network. Therefore, this chapter produces the recent progress in the indoor geo-localization systems and the IOTs area. It defines the best indoor geo-localization technologies that meet their needs while respecting the constraints related to sensor networks. This framework combines between simplicity of Bluetooth low energy (BLE), popular wi-fi infrastructure, and the k-nearest neighbor (KNN) algorithm (in order to filter the initial fingerprint dataset). This new conception increases real-time detection accuracy and guarantees the low energy consumption.


2018 ◽  
Vol 246 ◽  
pp. 03007
Author(s):  
Fei He ◽  
Geyi Zhou ◽  
Xinyi He ◽  
Heng Yin ◽  
Ling He

Pharyngeal fricative occurs during the production of consonants, which makes the consonants lose or weaken in cleft palate speech. In clinical application, the automatic detection of pharyngeal fricative in cleft palate speech could provide objective and effective assistant aids for speech language pathologists. In this paper, a novel acoustic parameter is proposed to detect the existence of pharyngeal fricative in cleft palate speech. This proposed acoustic feature ICPD (Independent Consonant Prominent Distribution) reflects the movement of mouth and tongue. The experimental results show that normal fricative has the higher ICPD. The extracted ICPD feature is combined with k-nearest neighbor classifier to achieve the automatic detection of pharyngeal fricative. The proposed system is tested on 127 speech samples recorded by cleft palate patients and 94 by normal speakers of controls. The overall pharyngeal fricative detection accuracy is around 90%.


Author(s):  
Kiran Marri ◽  
Ramakrishnan Swaminathan

Muscle fatigue is a neuromuscular condition experienced during daily activities. This phenomenon is generally characterized using surface electromyography (sEMG) signals and has gained a lot of interest in the fields of clinical rehabilitation, prosthetics control, and sports medicine. sEMG signals are complex, nonstationary and also exhibit self-similarity fractal characteristics. In this work, an attempt has been made to differentiate sEMG signals in nonfatigue and fatigue conditions during dynamic contraction using multifractal analysis. sEMG signals are recorded from biceps brachii muscles of 42 healthy adult volunteers while performing curl exercise. The signals are preprocessed and segmented into nonfatigue and fatigue conditions using the first and last curls, respectively. The multifractal detrended moving average algorithm (MFDMA) is applied to both segments, and multifractal singularity spectrum (SSM) function is derived. Five conventional features are extracted from the singularity spectrum. Twenty-five new features are proposed for analyzing muscle fatigue from the multifractal spectrum. These proposed features are adopted from analysis of sEMG signals and muscle fatigue studies performed in time and frequency domain. These proposed 25 feature sets are compared with conventional five features using feature selection methods such as Wilcoxon rank sum, information gain (IG) and genetic algorithm (GA) techniques. Two classification algorithms, namely, k-nearest neighbor (k-NN) and logistic regression (LR), are explored for differentiating muscle fatigue. The results show that about 60% of the proposed features are statistically highly significant and suitable for muscle fatigue analysis. The results also show that eight proposed features ranked among the top 10 features. The classification accuracy with conventional features in dynamic contraction is 75%. This accuracy improved to 88% with k-NN-GA combination with proposed new feature set. Based on the results, it appears that the multifractal spectrum analysis with new singularity features can be used for clinical evaluation in varied neuromuscular conditions, and the proposed features can also be useful in analyzing other physiological time series.


Sign in / Sign up

Export Citation Format

Share Document