sBiLSAN: Stacked Bidirectional Self-attention LSTM Network for Anomaly Detection and Diagnosis from System Logs

Multi-task learning based Encoder-Decoder: A comprehensive detection and diagnosis system for multi-sensor data

Advances in Mechanical Engineering ◽

10.1177/16878140211013138 ◽

2021 ◽

Vol 13 (5) ◽

pp. 168781402110131

Author(s):

Junfeng Wu ◽

Li Yao ◽

Bin Liu ◽

Zheyuan Ding ◽

Lei Zhang

Keyword(s):

Anomaly Detection ◽

Event Detection ◽

Large Scale ◽

Multivariate Time Series ◽

Sensor Data ◽

Unified Framework ◽

Diagnosis System ◽

Learning Framework ◽

Task Learning ◽

Detection And Diagnosis

As more and more sensor data have been collected, automated detection, and diagnosis systems are urgently needed to lessen the increasing monitoring burden and reduce the risk of system faults. A plethora of researches have been done on anomaly detection, event detection, anomaly diagnosis respectively. However, none of current approaches can explore all these respects in one unified framework. In this work, a Multi-Task Learning based Encoder-Decoder (MTLED) which can simultaneously detect anomalies, diagnose anomalies, and detect events is proposed. In MTLED, feature matrix is introduced so that features are extracted for each time point and point-wise anomaly detection can be realized in an end-to-end way. Anomaly diagnosis and event detection share the same feature matrix with anomaly detection in the multi-task learning framework and also provide important information for system monitoring. To train such a comprehensive detection and diagnosis system, a large-scale multivariate time series dataset which contains anomalies of multiple types is generated with simulation tools. Extensive experiments on the synthetic dataset verify the effectiveness of MTLED and its multi-task learning framework, and the evaluation on a real-world dataset demonstrates that MTLED can be used in other application scenarios through transfer learning.

Download Full-text

Real-Time Evasion Attacks against Deep Learning-Based Anomaly Detection from Distributed System Logs

Proceedings of the Eleventh ACM Conference on Data and Application Security and Privacy ◽

10.1145/3422337.3447833 ◽

2021 ◽

Author(s):

J. Dinal Herath ◽

Ping Yang ◽

Guanhua Yan

Keyword(s):

Deep Learning ◽

Anomaly Detection ◽

Real Time ◽

Distributed System ◽

System Logs

Download Full-text

D2: Anomaly Detection and Diagnosis in Networked Embedded Systems by Program Profiling and Symptom Mining

2013 IEEE 34th Real-Time Systems Symposium ◽

10.1109/rtss.2013.28 ◽

2013 ◽

Cited By ~ 6

Author(s):

Wei Dong ◽

Chun Chen ◽

Jiajun Bu ◽

Xue Liu ◽

Yunhao Liu

Keyword(s):

Embedded Systems ◽

Anomaly Detection ◽

Networked Embedded Systems ◽

Program Profiling ◽

Detection And Diagnosis

Download Full-text

An Efficient Network Behavior Anomaly Detection using a Hybrid DBN-LSTM Network

Computers & Security ◽

10.1016/j.cose.2021.102600 ◽

2022 ◽

pp. 102600

Author(s):

Aiguo Chen ◽

Yang Fu ◽

Xu Zheng ◽

Guoming lu

Keyword(s):

Anomaly Detection ◽

Network Behavior ◽

Lstm Network

Download Full-text

LogGAN: A Sequence-Based Generative Adversarial Network for Anomaly Detection Based on System Logs

Science of Cyber Security - Lecture Notes in Computer Science ◽

10.1007/978-3-030-34637-9_5 ◽

2019 ◽

pp. 61-76 ◽

Cited By ~ 1

Author(s):

Bin Xia ◽

Junjie Yin ◽

Jian Xu ◽

Yun Li

Keyword(s):

Anomaly Detection ◽

Generative Adversarial Network ◽

Adversarial Network ◽

System Logs

Download Full-text

ConAnomaly: Content-Based Anomaly Detection for System Logs

Sensors ◽

10.3390/s21186125 ◽

2021 ◽

Vol 21 (18) ◽

pp. 6125

Author(s):

Dan Lv ◽

Nurbol Luktarhan ◽

Yiyong Chen

Keyword(s):

Anomaly Detection ◽

Semantic Information ◽

Short Term Memory ◽

Weighted Average ◽

Detection Methods ◽

Detection Model ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

System Logs ◽

System Maintenance

Enterprise systems typically produce a large number of logs to record runtime states and important events. Log anomaly detection is efficient for business management and system maintenance. Most existing log-based anomaly detection methods use log parser to get log event indexes or event templates and then utilize machine learning methods to detect anomalies. However, these methods cannot handle unknown log types and do not take advantage of the log semantic information. In this article, we propose ConAnomaly, a log-based anomaly detection model composed of a log sequence encoder (log2vec) and multi-layer Long Short Term Memory Network (LSTM). We designed log2vec based on the Word2vec model, which first vectorized the words in the log content, then deleted the invalid words through part of speech tagging, and finally obtained the sequence vector by the weighted average method. In this way, ConAnomaly not only captures semantic information in the log but also leverages log sequential relationships. We evaluate our proposed approach on two log datasets. Our experimental results show that ConAnomaly has good stability and can deal with unseen log types to a certain extent, and it provides better performance than most log-based anomaly detection methods.

Download Full-text

Anomaly Detection and Diagnosis Algorithms for Discrete Symbol Sequences with Applications to Airline Safety

IEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews) ◽

10.1109/tsmcc.2008.2007248 ◽

2009 ◽

Vol 39 (1) ◽

pp. 101-113 ◽

Cited By ~ 83

Author(s):

S. Budalakoti ◽

A.N. Srivastava ◽

M.E. Otey ◽

M.E. Otey

Keyword(s):

Anomaly Detection ◽

Airline Safety ◽

Detection And Diagnosis ◽

Symbol Sequences

Download Full-text

Valid Probabilistic Anomaly Detection Models for System Logs

Wireless Communications and Mobile Computing ◽

10.1155/2020/8827185 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Chunbo Liu ◽

Lanlan Pan ◽

Zhaojun Gu ◽

Jialiang Wang ◽

Yitong Ren ◽

...

Keyword(s):

Anomaly Detection ◽

Large Scale ◽

Learning Algorithm ◽

Recall Rate ◽

Support Vector ◽

Fusion Algorithm ◽

Flexible Tool ◽

System Logs ◽

Output Only ◽

Better Than

System logs can record the system status and important events during system operation in detail. Detecting anomalies in the system logs is a common method for modern large-scale distributed systems. Yet threshold-based classification models used for anomaly detection output only two values: normal or abnormal, which lacks probability of estimating whether the prediction results are correct. In this paper, a statistical learning algorithm Venn-Abers predictor is adopted to evaluate the confidence of prediction results in the field of system log anomaly detection. It is able to calculate the probability distribution of labels for a set of samples and provide a quality assessment of predictive labels to some extent. Two Venn-Abers predictors LR-VA and SVM-VA have been implemented based on Logistic Regression and Support Vector Machine, respectively. Then, the differences among different algorithms are considered so as to build a multimodel fusion algorithm by Stacking. And then a Venn-Abers predictor based on the Stacking algorithm called Stacking-VA is implemented. The performances of four types of algorithms (unimodel, Venn-Abers predictor based on unimodel, multimodel, and Venn-Abers predictor based on multimodel) are compared in terms of validity and accuracy. Experiments are carried out on a log dataset of the Hadoop Distributed File System (HDFS). For the comparative experiments on unimodels, the results show that the validities of LR-VA and SVM-VA are better than those of the two corresponding underlying models. Compared with the underlying model, the accuracy of the SVM-VA predictor is better than that of LR-VA predictor, and more significantly, the recall rate increases from 81% to 94%. In the case of experiments on multiple models, the algorithm based on Stacking multimodel fusion is significantly superior to the underlying classifier. The average accuracy of Stacking-VA is larger than 0.95, which is more stable than the prediction results of LR-VA and SVM-VA. Experimental results show that the Venn-Abers predictor is a flexible tool that can make accurate and valid probability predictions in the field of system log anomaly detection.

Download Full-text

Experimentations with OpenStack System Logs and Support Vector Machine for an Anomaly Detection Model in a Private Cloud Infrastructure

2020 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD) ◽

10.1109/icabcd49160.2020.9183878 ◽

2020 ◽

Author(s):

Matthew Akanle ◽

Emmanuel Adetiba ◽

Victor Akande ◽

Adekunle Akinrinmade ◽

Sunday Ajala ◽

...

Keyword(s):

Support Vector Machine ◽

Anomaly Detection ◽

Support Vector ◽

Cloud Infrastructure ◽

Private Cloud ◽

Detection Model ◽

System Logs

Download Full-text

Markov Chain Modeling for Anomaly Detection in High Performance Computing System Logs

Proceedings of the Fourth International Workshop on HPC User Support Tools - HUST'17 ◽

10.1145/3152493.3152559 ◽

2017 ◽

Cited By ~ 3

Author(s):

Abida Haque ◽

Alexandra DeLucia ◽

Elisabeth Baseman

Keyword(s):

Markov Chain ◽

Anomaly Detection ◽

High Performance Computing ◽

High Performance ◽

Computing System ◽

High Performance Computing System ◽

System Logs ◽

Markov Chain Modeling ◽

Performance Computing

Download Full-text