MEOD: Memory-Efficient Outlier Detection on Streaming Data

In this paper, a memory-efficient outlier detection (MEOD) approach for streaming data is proposed. The approach uses a local correlation integral (LOCI) algorithm for outlier detection, finding the outlier based on the density of neighboring points defined by a given radius. The radius value detection problem is converted into an optimization problem. The radius value is determined using a particle swarm optimization (PSO)-based approach. The results of the MEOD technique application are compared with existing approaches in terms of memory, time, and accuracy, such as the memory-efficient incremental local outlier factor (MiLOF) detection technique. The MEOD technique finds outlier points similar to MiLOF with nearly equal accuracy but requires less memory for processing.

Download Full-text

Advanced Memory Efficient Outlier Detection Approach for Streaming Data using Swarm Optimization

10.1109/tsp52935.2021.9522667 ◽

2021 ◽

Author(s):

Ankita Karale ◽

Milena Lazarova ◽

Pavlina Koleva ◽

Vladimir Poulkov

Keyword(s):

Outlier Detection ◽

Streaming Data ◽

Swarm Optimization ◽

Detection Approach ◽

Memory Efficient

Download Full-text

Incremental outlier detection in data streams using local correlation integral

Proceedings of the 2009 ACM symposium on Applied Computing - SAC '09 ◽

10.1145/1529282.1529623 ◽

2009 ◽

Cited By ~ 4

Author(s):

Xinjie Lu ◽

Tian Yang ◽

Zaifei Liao ◽

Manzoor Elahi ◽

Wei Liu ◽

...

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Local Correlation ◽

Correlation Integral

Download Full-text

TADILOF: Time Aware Density-Based Incremental Local Outlier Detection in Data Streams

Sensors ◽

10.3390/s20205829 ◽

2020 ◽

Vol 20 (20) ◽

pp. 5829 ◽

Cited By ~ 1

Author(s):

Jen-Wei Huang ◽

Meng-Xun Zhong ◽

Bijay Prasad Jaysawal

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Data Stream ◽

State Of The Art ◽

Streaming Data ◽

Current State ◽

Data Points ◽

Local Outlier ◽

Time Aware ◽

Over Time

Outlier detection in data streams is crucial to successful data mining. However, this task is made increasingly difficult by the enormous growth in the quantity of data generated by the expansion of Internet of Things (IoT). Recent advances in outlier detection based on the density-based local outlier factor (LOF) algorithms do not consider variations in data that change over time. For example, there may appear a new cluster of data points over time in the data stream. Therefore, we present a novel algorithm for streaming data, referred to as time-aware density-based incremental local outlier detection (TADILOF) to overcome this issue. In addition, we have developed a means for estimating the LOF score, termed "approximate LOF," based on historical information following the removal of outdated data. The results of experiments demonstrate that TADILOF outperforms current state-of-the-art methods in terms of AUC while achieving similar performance in terms of execution time. Moreover, we present an application of the proposed scheme to the development of an air-quality monitoring system.

Download Full-text

Fast Memory Efficient Local Outlier Detection in Data Streams (Extended Abstract)

2017 IEEE 33rd International Conference on Data Engineering (ICDE) ◽

10.1109/icde.2017.32 ◽

2017 ◽

Cited By ~ 2

Author(s):

Mahsa Salehi ◽

Christopher Leckie ◽

James C. Bezdek ◽

Tharshan Vaithianathan ◽

Xuyun Zhang

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Fast Memory ◽

Local Outlier ◽

Memory Efficient

Download Full-text

Genetic-based Summarization for Local Outlier Detection in Data Stream

International Journal of Intelligent Systems and Applications ◽

10.5815/ijisa.2021.01.05 ◽

2021 ◽

Vol 13 (1) ◽

pp. 58-68

Author(s):

Mohamed Sakr ◽

◽

Walid Atwa ◽

Arabi Keshk

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Approximate Solutions ◽

Streaming Data ◽

Detection Algorithms ◽

Processing Power ◽

Static Data ◽

Large Memory ◽

Two Phases ◽

Local Outlier

Outlier detection is one of the important tasks in data mining. Detecting outliers over streaming data has become an important task in many applications, such as network analysis, fraud detections, and environment monitoring. One of the well-known outlier detection algorithms called Local Outlier Factor (LOF). However, the original LOF has many drawbacks that can’t be used with data streams: 1- it needs a lot of processing power (CPU) and large memory to detect the outliers. 2- it deals with static data which mean that in any change in data the LOF recalculates the outliers from the beginning on the whole data. These drawbacks make big challenges for existing outlier detection algorithms in terms of their accuracies when they are implemented in the streaming environment. In this paper, we propose a new algorithm called GSILOF that focuses on detecting outliers from data streams using genetics. GSILOF solve the problem of large memory needed as it has fixed memory bound. GSILOF has two phases. First, the summarization phase that tries to summarize the past data arrived. Second, the detection phase detects the outliers from the new arriving data. The summarization phase uses a genetic algorithm to try to find the subset of points that can represent the whole original set. our experiments have been done over real datasets. Our experiments confirming the effectiveness of the proposed approach and the high quality of approximate solutions in a set of real-world streaming data.

Download Full-text

LOCI: fast outlier detection using the local correlation integral

Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405) ◽

10.1109/icde.2003.1260802 ◽

2004 ◽

Cited By ~ 251

Author(s):

S. Papadimitriou ◽

H. Kitagawa ◽

P.B. Gibbons ◽

C. Faloutsos

Keyword(s):

Outlier Detection ◽

Local Correlation ◽

Correlation Integral

Download Full-text

Fast Memory Efficient Local Outlier Detection in Data Streams

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2016.2597833 ◽

2016 ◽

Vol 28 (12) ◽

pp. 3246-3260 ◽

Cited By ~ 33

Author(s):

Mahsa Salehi ◽

Christopher Leckie ◽

James C. Bezdek ◽

Tharshan Vaithianathan ◽

Xuyun Zhang

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Fast Memory ◽

Local Outlier ◽

Memory Efficient

Download Full-text

A Framework for Local Outlier Detection from Spatio-Temporal Trajectory Datasets

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9412274 ◽

2021 ◽

Author(s):

Xumin Cai ◽

Berkay Aydin ◽

Anli Ji ◽

Rafal Angryk

Keyword(s):

Outlier Detection ◽

Spatio Temporal ◽

Temporal Trajectory ◽

Local Outlier

Download Full-text

SDCOR: Scalable density-based clustering for local outlier detection in massive-scale datasets

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.107256 ◽

2021 ◽

pp. 107256

Author(s):

Sayyed Ahmad Naghavi Nozad ◽

Maryam Amir Haeri ◽

Gianluigi Folino

Keyword(s):

Outlier Detection ◽

Density Based Clustering ◽

Massive Scale ◽

Local Outlier

Download Full-text

Robust CNN Compression Framework for Security-Sensitive Embedded Systems

Applied Sciences ◽

10.3390/app11031093 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1093

Author(s):

Jeonghyun Lee ◽

Sangkyun Lee

Keyword(s):

Embedded Systems ◽

Optimization Problem ◽

State Of The Art ◽

Classification Problems ◽

Proximal Gradient Method ◽

Knowledge Distillation ◽

New Type ◽

Adversarial Examples ◽

Adversarial Training ◽

Memory Efficient

Convolutional neural networks (CNNs) have achieved tremendous success in solving complex classification problems. Motivated by this success, there have been proposed various compression methods for downsizing the CNNs to deploy them on resource-constrained embedded systems. However, a new type of vulnerability of compressed CNNs known as the adversarial examples has been discovered recently, which is critical for security-sensitive systems because the adversarial examples can cause malfunction of CNNs and can be crafted easily in many cases. In this paper, we proposed a compression framework to produce compressed CNNs robust against such adversarial examples. To achieve the goal, our framework uses both pruning and knowledge distillation with adversarial training. We formulate our framework as an optimization problem and provide a solution algorithm based on the proximal gradient method, which is more memory-efficient than the popular ADMM-based compression approaches. In experiments, we show that our framework can improve the trade-off between adversarial robustness and compression rate compared to the existing state-of-the-art adversarial pruning approach.

Download Full-text