Lightweight Anomaly Detection Scheme Using Incremental Principal Component Analysis and Support Vector Machine

Wireless Sensors Networks have been the focus of significant attention from research and development due to their applications of collecting data from various fields such as smart cities, power grids, transportation systems, medical sectors, military, and rural areas. Accurate and reliable measurements for insightful data analysis and decision-making are the ultimate goals of sensor networks for critical domains. However, the raw data collected by WSNs usually are not reliable and inaccurate due to the imperfect nature of WSNs. Identifying misbehaviours or anomalies in the network is important for providing reliable and secure functioning of the network. However, due to resource constraints, a lightweight detection scheme is a major design challenge in sensor networks. This paper aims at designing and developing a lightweight anomaly detection scheme to improve efficiency in terms of reducing the computational complexity and communication and improving memory utilization overhead while maintaining high accuracy. To achieve this aim, one-class learning and dimension reduction concepts were used in the design. The One-Class Support Vector Machine (OCSVM) with hyper-ellipsoid variance was used for anomaly detection due to its advantage in classifying unlabelled and multivariate data. Various One-Class Support Vector Machine formulations have been investigated and Centred-Ellipsoid has been adopted in this study due to its effectiveness. Centred-Ellipsoid is the most effective kernel among studies formulations. To decrease the computational complexity and improve memory utilization, the dimensions of the data were reduced using the Candid Covariance-Free Incremental Principal Component Analysis (CCIPCA) algorithm. Extensive experiments were conducted to evaluate the proposed lightweight anomaly detection scheme. Results in terms of detection accuracy, memory utilization, computational complexity, and communication overhead show that the proposed scheme is effective and efficient compared few existing schemes evaluated. The proposed anomaly detection scheme achieved the accuracy higher than 98%, with (𝑛𝑑) memory utilization and no communication overhead.

Download Full-text

Anomaly detection system based on principal component analysis and support vector machine

Wuhan University Journal of Natural Sciences ◽

10.1007/bf02831871 ◽

2006 ◽

Vol 11 (6) ◽

pp. 1769-1772 ◽

Cited By ~ 2

Author(s):

Li Zhanchun ◽

Li Zhitang ◽

Liu Bin

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Anomaly Detection ◽

Detection System ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Anomaly Detection System

Download Full-text

Longitudinal Crack Detection Approach Based on Principal Component Analysis and Support Vector Machine for Slab Continuous Casting

steel research international ◽

10.1002/srin.202100168 ◽

2021 ◽

Author(s):

Haiyang Duan ◽

Jingjing Wei ◽

Lin Qi ◽

Xudong Wang ◽

Yu Liu ◽

...

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Continuous Casting ◽

Crack Detection ◽

Longitudinal Crack ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Slab Continuous Casting ◽

Detection Approach

Download Full-text

Spam Detection Approach Based on C-Support Vector Machine and Kernel Principal-Component Analysis

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing ◽

10.1109/iih-msp.2014.64 ◽

2014 ◽

Author(s):

Shu Geng ◽

Liu Lv ◽

Rongjun Liu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Spam Detection ◽

Detection Approach

Download Full-text

Multimode Monitoring of Oxy-Gas Combustion Through Flame Imaging, Principal Component Analysis, and Kernel Support Vector Machine

Combustion Science and Technology ◽

10.1080/00102202.2016.1250749 ◽

2016 ◽

Vol 189 (5) ◽

pp. 776-792 ◽

Cited By ~ 3

Author(s):

Xiaojing Bai ◽

Gang Lu ◽

Md Moinul Hossain ◽

Yong Yan ◽

Shi Liu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Gas Combustion ◽

Kernel Support Vector Machine ◽

Flame Imaging

Download Full-text

Multiclass classification of leukemia cancer data using Fuzzy Support Vector Machine (FSVM) with feature selection using Principal Component Analysis (PCA)

Journal of Physics Conference Series ◽

10.1088/1742-6596/1725/1/012012 ◽

2021 ◽

Vol 1725 ◽

pp. 012012

Author(s):

I R Fauzi ◽

Z Rustam ◽

A Wibowo

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Feature Selection ◽

Principal Component ◽

Component Analysis ◽

Multiclass Classification ◽

Support Vector ◽

Fuzzy Support Vector Machine ◽

Cancer Data

Download Full-text

Face Recognition Based on Principal Component Analysis and Support Vector Machine Algorithms

10.23919/ccc52363.2021.9550727 ◽

2021 ◽

Author(s):

Yanbang Zhang ◽

Fen Zhang ◽

Lei Guo

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Face Recognition ◽

Principal Component ◽

Component Analysis ◽

Support Vector

Download Full-text

Fault diagnosis of modular multilevel converter based on principal component analysis and support vector machine

Journal of Physics Conference Series ◽

10.1088/1742-6596/2030/1/012086 ◽

2021 ◽

Vol 2030 (1) ◽

pp. 012086

Author(s):

Siyu Jiang ◽

Bin Wang ◽

Wanwan Xu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Fault Diagnosis ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Modular Multilevel Converter ◽

Multilevel Converter

Download Full-text

An Anomaly Detection Model Using Principal Component Analysis Technique for Medical Wireless Sensor Networks

10.1109/icodsa53588.2021.9617547 ◽

2021 ◽

Author(s):

Nabeel Abdulrazaq Yaseen ◽

Abbas Abd-Alhussein Hadad ◽

Mustafa Sabah Taha

Keyword(s):

Principal Component Analysis ◽

Wireless Sensor Networks ◽

Sensor Networks ◽

Anomaly Detection ◽

Principal Component ◽

Component Analysis ◽

Wireless Sensor ◽

Detection Model ◽

Analysis Technique

Download Full-text

Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression

International Journal of Engine Research ◽

10.1177/14680874211070736 ◽

2022 ◽

pp. 146808742110707

Author(s):

Aran Mohammad ◽

Reza Rezaei ◽

Christopher Hayduk ◽

Thaddaeus Delebinski ◽

Saeid Shahpouri ◽

...

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Factor Analysis ◽

Dimensionality Reduction ◽

Principal Component ◽

Component Analysis ◽

Data Driven ◽

Support Vector ◽

Emission Models ◽

Emission Modeling

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.

Download Full-text