Enhancing Malicious Activity Classification of IoT Network Traffic Characteristics using Stacked Ensemble Learning

Abstract Background Eligibility criteria are the primary strategy for screening the target participants of a clinical trial. Automated classification of clinical trial eligibility criteria text by using machine learning methods improves recruitment efficiency to reduce the cost of clinical research. However, existing methods suffer from poor classification performance due to the complexity and imbalance of eligibility criteria text data. Methods An ensemble learning-based model with metric learning is proposed for eligibility criteria classification. The model integrates a set of pre-trained models including Bidirectional Encoder Representations from Transformers (BERT), A Robustly Optimized BERT Pretraining Approach (RoBERTa), XLNet, Pre-training Text Encoders as Discriminators Rather Than Generators (ELECTRA), and Enhanced Representation through Knowledge Integration (ERNIE). Focal Loss is used as a loss function to address the data imbalance problem. Metric learning is employed to train the embedding of each base model for feature distinguish. Soft Voting is applied to achieve final classification of the ensemble model. The dataset is from the standard evaluation task 3 of 5th China Health Information Processing Conference containing 38,341 eligibility criteria text in 44 categories. Results Our ensemble method had an accuracy of 0.8497, a precision of 0.8229, and a recall of 0.8216 on the dataset. The macro F1-score was 0.8169, outperforming state-of-the-art baseline methods by 0.84% improvement on average. In addition, the performance improvement had a p-value of 2.152e-07 with a standard t-test, indicating that our model achieved a significant improvement. Conclusions A model for classifying eligibility criteria text of clinical trials based on multi-model ensemble learning and metric learning was proposed. The experiments demonstrated that the classification performance was improved by our ensemble model significantly. In addition, metric learning was able to improve word embedding representation and the focal loss reduced the impact of data imbalance to model performance.

Download Full-text

Classification of FPGA Based Network Traffic Using Machine Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/1964/6/062008 ◽

2021 ◽

Vol 1964 (6) ◽

pp. 062008

Author(s):

K Gunasekaran ◽

Radhika Baskar ◽

R Dhanagopal ◽

K Elangovan

Keyword(s):

Machine Learning ◽

Network Traffic

Download Full-text

Improving DDoS Detection in IoT Networks Through Analysis of Network Traffic Characteristics

2020 IEEE Latin-American Conference on Communications (LATINCOM) ◽

10.1109/latincom50620.2020.9282265 ◽

2020 ◽

Author(s):

Wanderson L. Costa ◽

Matheus M. Silveira ◽

Thelmo de Araujo ◽

Rafael L. Gomes

Keyword(s):

Network Traffic ◽

Traffic Characteristics ◽

Ddos Detection

Download Full-text

Parametric Convolutional Neural Network for Radar-based Human Activity Classification Using Raw ADC Data

10.36227/techrxiv.12896108 ◽

2020 ◽

Author(s):

Thomas Stadelmayer ◽

Avik Santra

Keyword(s):

Neural Network ◽

Human Activity ◽

Recognition System ◽

Radar Data ◽

Fast Time ◽

Activity Classification ◽

Radar Sensors ◽

Equivalent State ◽

Machine Interface

Radar sensors offer a promising and effective sensing modality for human activity classification. Human activity classification enables several smart homes applications for energy saving, human-machine interface for gesture controlled appliances and elderly fall-motion recognition. Present radar-based activity recognition system exploit micro-Doppler signature by generating Doppler spectrograms or video of range-Doppler images (RDIs), followed by deep neural network or machine learning for classification. Although, deep convolutional neural networks (DCNN) have been shown to implicitly learn features from raw sensor data in other fields, such as camera and speech, yet for the case of radar DCNN preprocessing followed by feature image generation, such as video of RDI or Doppler spectrogram, is required to develop a scalable and robust classification or regression application. In this paper, we propose a parametric convolutional neural network that mimics the radar preprocessing across fast-time and slow-time radar data through 2D sinc filter or 2D wavelet filter kernels to extract features for classification of various human activities. It is demonstrated that our proposed solution shows improved results compared to equivalent state-of-art DCNN solutions that rely on Doppler spectrogram or video of RDIs as feature images.

Download Full-text

Ensemble learning based classifier to predict depression caused due to pandemic

Journal of Physics Conference Series ◽

10.1088/1742-6596/2089/1/012026 ◽

2021 ◽

Vol 2089 (1) ◽

pp. 012026

Author(s):

P Vaishali ◽

P L S Kumari

Keyword(s):

Neural Network ◽

Ensemble Learning ◽

Text Classification ◽

Virus Disease ◽

Model Accuracy ◽

Research Article ◽

The World ◽

Social Gathering ◽

Second Wave

Abstract Pandemic caused due to Corona Virus Disease 2019 (COVID-19) affected each and every person life throughout the world. First wave of COVID-19 followed by second wave made situation more panic. Government declared Lockdown imposed strict prohibition on social gathering, unnecessary outing, travelling, and education. During home quarantine, people shared opinion, expressed views, feelings on social media. Home isolation and quarantine affected mental health of people which may lead to depression. Hence in this research article depression is predicted by implementing Neural Network based model. At first level this model implements text classification of COVID-19 based Tweets. Neural network model accuracy is 86.85%. In next level, using same tweet dataset as input, Ensemble learning based model is constructed. This model uses one of the boosting techniques known as Adaboost. Model is executed by varying Train-test-validation ratio. It is observed that accuracy of the model is improved. The model showed accuracy of 99.33 % successfully in every execution. Obtained results are compared with previous work in same area.

Download Full-text

Classifying IoT Devices in Smart Environments Using Network Traffic Characteristics

IEEE Transactions on Mobile Computing ◽

10.1109/tmc.2018.2866249 ◽

2019 ◽

Vol 18 (8) ◽

pp. 1745-1759 ◽

Cited By ~ 47

Author(s):

Arunan Sivanathan ◽

Hassan Habibi Gharakheili ◽

Franco Loi ◽

Adam Radford ◽

Chamith Wijenayake ◽

...

Keyword(s):

Network Traffic ◽

Smart Environments ◽

Traffic Characteristics ◽

Iot Devices

Download Full-text

Economic activity classification of third or social economy sector institutions and volunteer work

Satellite Account on Nonprofit and Related Institutions and Volunteer Work - Studies in Methods (Ser. F) ◽

10.18356/398d07fb-en ◽

2020 ◽

pp. 65-80

Author(s):

Keyword(s):

Economic Activity ◽

Social Economy ◽

Volunteer Work ◽

Activity Classification

Download Full-text

An Ensemble Learning Approach for Automatic Emotion Classification of Sri Lankan Folk Music

Lecture Notes in Networks and Systems - Proceedings of International Conference on Sustainable Expert Systems ◽

10.1007/978-981-33-4355-9_23 ◽

2021 ◽

pp. 289-302

Author(s):

Joseph Charles ◽

Sugeeswari Lekamge

Keyword(s):

Ensemble Learning ◽

Folk Music ◽

Learning Approach ◽

Sri Lankan ◽

Emotion Classification

Download Full-text

Ensemble Deep Learning for Multilabel Binary Classification of User-Generated Content

Algorithms ◽

10.3390/a13040083 ◽

2020 ◽

Vol 13 (4) ◽

pp. 83 ◽

Cited By ~ 5

Author(s):

Giannis Haralabopoulos ◽

Ioannis Anagnostopoulos ◽

Derek McAuley

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Ensemble Learning ◽

Affective Computing ◽

Binary Classification ◽

User Generated Content ◽

Emotional Information ◽

Multiple Classifiers ◽

Different Levels

Sentiment analysis usually refers to the analysis of human-generated content via a polarity filter. Affective computing deals with the exact emotions conveyed through information. Emotional information most frequently cannot be accurately described by a single emotion class. Multilabel classifiers can categorize human-generated content in multiple emotional classes. Ensemble learning can improve the statistical, computational and representation aspects of such classifiers. We present a baseline stacked ensemble and propose a weighted ensemble. Our proposed weighted ensemble can use multiple classifiers to improve classification results without hyperparameter tuning or data overfitting. We evaluate our ensemble models with two datasets. The first dataset is from Semeval2018-Task 1 and contains almost 7000 Tweets, labeled with 11 sentiment classes. The second dataset is the Toxic Comment Dataset with more than 150,000 comments, labeled with six different levels of abuse or harassment. Our results suggest that ensemble learning improves classification results by 1.5 % to 5.4 % .

Download Full-text