Combining Entropy Measures for Anomaly Detection

The combination of different sources of information is a problem that arises in several situations, for instance, when data are analysed using different similarity measures. Often, each source of information is given as a similarity, distance, or a kernel matrix. In this paper, we propose a new class of methods which consists of producing, for anomaly detection purposes, a single Mercer kernel (that acts as a similarity measure) from a set of local entropy kernels and, at the same time, avoids the task of model selection. This kernel is used to build an embedding of data in a variety that will allow the use of a (modified) one-class Support Vector Machine to detect outliers. We study several information combination schemes and their limiting behaviour when the data sample size increases within an Information Geometry context. In particular, we study the variety of the given positive definite kernel matrices to obtain the desired kernel combination as belonging to that variety. The proposed methodology has been evaluated on several real and artificial problems.

Download Full-text

Bridge health anomaly detection using deep support vector data description

Neurocomputing ◽

10.1016/j.neucom.2020.08.087 ◽

2021 ◽

Author(s):

JianXi Yang ◽

Fei Yang ◽

Likai Zhang ◽

Ren Li ◽

Shixin Jiang ◽

...

Keyword(s):

Anomaly Detection ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

Deep Support

Download Full-text

Preserving Privacy in Multimedia Social Networks Using Machine Learning Anomaly Detection

Security and Communication Networks ◽

10.1155/2020/5874935 ◽

2020 ◽

Vol 2020 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Randa Aljably ◽

Yuan Tian ◽

Mznah Al-Rodhaan

Keyword(s):

Machine Learning ◽

Social Networks ◽

Access Control ◽

Anomaly Detection ◽

Privacy Preservation ◽

Support Vector ◽

Detection Techniques ◽

Access Control Models ◽

Control Models ◽

Multimedia Social Networks

Nowadays, user’s privacy is a critical matter in multimedia social networks. However, traditional machine learning anomaly detection techniques that rely on user’s log files and behavioral patterns are not sufficient to preserve it. Hence, the social network security should have multiple security measures to take into account additional information to protect user’s data. More precisely, access control models could complement machine learning algorithms in the process of privacy preservation. The models could use further information derived from the user’s profiles to detect anomalous users. In this paper, we implement a privacy preservation algorithm that incorporates supervised and unsupervised machine learning anomaly detection techniques with access control models. Due to the rich and fine-grained policies, our control model continuously updates the list of attributes used to classify users. It has been successfully tested on real datasets, with over 95% accuracy using Bayesian classifier, and 95.53% on receiver operating characteristic curve using deep neural networks and long short-term memory recurrent neural network classifiers. Experimental results show that this approach outperforms other detection techniques such as support vector machine, isolation forest, principal component analysis, and Kolmogorov–Smirnov test.

Download Full-text

Anomaly Detection for Hyperspectral Imagery Based on Incremental Support Vector Data Description

2010 International Conference on Multimedia Technology ◽

10.1109/icmult.2010.5631355 ◽

2010 ◽

Author(s):

Liyan Zhang ◽

Yonghua Sun ◽

Dan Meng ◽

Xiaojuan Li

Keyword(s):

Anomaly Detection ◽

Hyperspectral Imagery ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description

Download Full-text

Robust Template Decomposition without Weight Restriction for Cellular Neural Networks Implementing Arbitrary Boolean Functions Using Support Vector Classifiers

Mathematical Problems in Engineering ◽

10.1155/2013/614543 ◽

2013 ◽

Vol 2013 ◽

pp. 1-9

Author(s):

Yih-Lon Lin ◽

Jer-Guang Hsieh ◽

Jyh-Horng Jeng

Keyword(s):

Neural Networks ◽

Boolean Function ◽

Past Research ◽

Cellular Neural Network ◽

Decomposition Methods ◽

Cellular Neural Networks ◽

Support Vector ◽

Decomposition Algorithms ◽

Weight Restriction ◽

The Given

If the given Boolean function is linearly separable, a robust uncoupled cellular neural network can be designed as a maximal margin classifier. On the other hand, if the given Boolean function is linearly separable but has a small geometric margin or it is not linearly separable, a popular approach is to find a sequence of robust uncoupled cellular neural networks implementing the given Boolean function. In the past research works using this approach, the control template parameters and thresholds are restricted to assume only a given finite set of integers, and this is certainly unnecessary for the template design. In this study, we try to remove this restriction. Minterm- and maxterm-based decomposition algorithms utilizing the soft margin and maximal margin support vector classifiers are proposed to design a sequence of robust templates implementing an arbitrary Boolean function. Several illustrative examples are simulated to demonstrate the efficiency of the proposed method by comparing our results with those produced by other decomposition methods with restricted weights.

Download Full-text

High-dimensional and wide-scale anomaly detection using enhancing support vector machine

2018 26th Signal Processing and Communications Applications Conference (SIU) ◽

10.1109/siu.2018.8404818 ◽

2018 ◽

Author(s):

Ibrahim Gumus ◽

Yahya Sirin

Keyword(s):

Support Vector Machine ◽

Anomaly Detection ◽

High Dimensional ◽

Support Vector ◽

Wide Scale

Download Full-text

Scenario-based Generalization Bound for Anomaly Detection Support Vector Machine Ensembles

Proceedings of the 30th European Safety and Reliability Conference and 15th Probabilistic Safety Assessment and Management Conference ◽

10.3850/978-981-14-8593-0_5708-cd ◽

2020 ◽

Author(s):

Roberto Rocchetta ◽

Milan Petkovic ◽

Qi Gao

Keyword(s):

Support Vector Machine ◽

Anomaly Detection ◽

Support Vector ◽

Generalization Bound

Download Full-text

Anomaly Detection in Medical Wireless Sensor Networks using SVM and Linear Regression Models

International Journal of E-Health and Medical Communications ◽

10.4018/ijehmc.2014010102 ◽

2014 ◽

Vol 5 (1) ◽

pp. 20-45 ◽

Cited By ~ 32

Author(s):

Osman Salem ◽

Alexey Guerassimov ◽

Ahmed Mehaoua ◽

Anthony Marcus ◽

Borko Furht

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Anomaly Detection ◽

Patient Data ◽

Sensor Nodes ◽

Sensor Data ◽

Wireless Sensor ◽

Smart Devices ◽

Support Vector ◽

Processing Unit

This paper details the architecture and describes the preliminary experimentation with the proposed framework for anomaly detection in medical wireless body area networks for ubiquitous patient and healthcare monitoring. The architecture integrates novel data mining and machine learning algorithms with modern sensor fusion techniques. Knowing wireless sensor networks are prone to failures resulting from their limitations (i.e. limited energy resources and computational power), using this framework, the authors can distinguish between irregular variations in the physiological parameters of the monitored patient and faulty sensor data, to ensure reliable operations and real time global monitoring from smart devices. Sensor nodes are used to measure characteristics of the patient and the sensed data is stored on the local processing unit. Authorized users may access this patient data remotely as long as they maintain connectivity with their application enabled smart device. Anomalous or faulty measurement data resulting from damaged sensor nodes or caused by malicious external parties may lead to misdiagnosis or even death for patients. The authors' application uses a Support Vector Machine to classify abnormal instances in the incoming sensor data. If found, the authors apply a periodically rebuilt, regressive prediction model to the abnormal instance and determine if the patient is entering a critical state or if a sensor is reporting faulty readings. Using real patient data in our experiments, the results validate the robustness of our proposed framework. The authors further discuss the experimental analysis with the proposed approach which shows that it is quickly able to identify sensor anomalies and compared with several other algorithms, it maintains a higher true positive and lower false negative rate.

Download Full-text

Performance comparison of intrusion detection system based anomaly detection using artificial neural network and support vector machine

10.1063/1.4958506 ◽

2016 ◽

Cited By ~ 6

Author(s):

Aditya Nur Cahyo ◽

Risanuri Hidayat ◽

Dani Adhipta

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Support Vector Machine ◽

Intrusion Detection ◽

Anomaly Detection ◽

Intrusion Detection System ◽

Detection System ◽

Performance Comparison ◽

Support Vector ◽

Artificial Neural

Download Full-text

MACHINE LEARNING METHODS IN MONITORING OPERATING BEHAVIOUR OF MARINE TWO-STROKE DIESEL ENGINE

Transport ◽

10.3846/transport.2020.14038 ◽

2020 ◽

Vol 35 (5) ◽

pp. 462-473

Author(s):

Aleksandar Vorkapić ◽

Radoslav Radonja ◽

Karlo Babić ◽

Sanda Martinčić-Ipšić

Keyword(s):

Machine Learning ◽

Anomaly Detection ◽

Fuel Consumption ◽

Performance Monitoring ◽

Absolute Error ◽

Machine Learning Algorithms ◽

Support Vector ◽

Operating Parameters ◽

Detection Model ◽

Modelling Framework

The aim of this article is to enhance performance monitoring of a two-stroke electronically controlled ship propulsion engine on the operating envelope. This is achieved by setting up a machine learning model capable of monitoring influential operating parameters and predicting the fuel consumption. Model is tested with different machine learning algorithms, namely linear regression, multilayer perceptron, Support Vector Machines (SVM) and Random Forests (RF). Upon verification of modelling framework and analysing the results in order to improve the prediction accuracy, the best algorithm is selected based on standard evaluation metrics, i.e. Root Mean Square Error (RMSE) and Relative Absolute Error (RAE). Experimental results show that, by taking an adequate combination and processing of relevant sensory data, SVM exhibit the lowest RMSE 7.1032 and RAE 0.5313%. RF achieve the lowest RMSE 22.6137 and RAE 3.8545% in a setting when minimal number of input variables is considered, i.e. cylinder indicated pressures and propulsion engine revolutions. Further, article deals with the detection of anomalies of operating parameters, which enables the evaluation of the propulsion engine condition and the early identification of failures and deterioration. Such a time-dependent, self-adopting anomaly detection model can be used for comparison with the initial condition recorded during the test and sea run or after survey and docking. Finally, we propose a unified model structure, incorporating fuel consumption prediction and anomaly detection model with on-board decision-making process regarding navigation and maintenance.

Download Full-text

IoT Dataset Validation Using Machine Learning Techniques for Traffic Anomaly Detection

Electronics ◽

10.3390/electronics10222857 ◽

2021 ◽

Vol 10 (22) ◽

pp. 2857

Author(s):

Laura Vigoya ◽

Diego Fernandez ◽

Victor Carneiro ◽

Francisco Nóvoa

Keyword(s):

Machine Learning ◽

Anomaly Detection ◽

False Positive Rate ◽

Machine Learning Techniques ◽

Support Vector ◽

High Detection Rate ◽

Security Vulnerabilities ◽

Smart Systems ◽

Learning Techniques ◽

Positive Rate

With advancements in engineering and science, the application of smart systems is increasing, generating a faster growth of the IoT network traffic. The limitations due to IoT restricted power and computing devices also raise concerns about security vulnerabilities. Machine learning-based techniques have recently gained credibility in a successful application for the detection of network anomalies, including IoT networks. However, machine learning techniques cannot work without representative data. Given the scarcity of IoT datasets, the DAD emerged as an instrument for knowing the behavior of dedicated IoT-MQTT networks. This paper aims to validate the DAD dataset by applying Logistic Regression, Naive Bayes, Random Forest, AdaBoost, and Support Vector Machine to detect traffic anomalies in IoT. To obtain the best results, techniques for handling unbalanced data, feature selection, and grid search for hyperparameter optimization have been used. The experimental results show that the proposed dataset can achieve a high detection rate in all the experiments, providing the best mean accuracy of 0.99 for the tree-based models, with a low false-positive rate, ensuring effective anomaly detection.

Download Full-text