An Effective Concept Drift Detection Method on Streaming Data Using Probability Estimates

AbstractIn present times, data science become popular to support and improve decision-making process. Due to the accessibility of a wide application perspective of data streaming, class imbalance and concept drifting become crucial learning problems. The advent of deep learning (DL) models finds useful for the classification of concept drift in data streaming applications. This paper presents an effective class imbalance with concept drift detection (CIDD) using Adadelta optimizer-based deep neural networks (ADODNN), named CIDD-ADODNN model for the classification of highly imbalanced streaming data. The presented model involves four processes namely preprocessing, class imbalance handling, concept drift detection, and classification. The proposed model uses adaptive synthetic (ADASYN) technique for handling class imbalance data, which utilizes a weighted distribution for diverse minority class examples based on the level of difficulty in learning. Next, a drift detection technique called adaptive sliding window (ADWIN) is employed to detect the existence of the concept drift. Besides, ADODNN model is utilized for the classification processes. For increasing the classifier performance of the DNN model, ADO-based hyperparameter tuning process takes place to determine the optimal parameters of the DNN model. The performance of the presented model is evaluated using three streaming datasets namely intrusion detection (NSL KDDCup) dataset, Spam dataset, and Chess dataset. A detailed comparative results analysis takes place and the simulation results verified the superior performance of the presented model by obtaining a maximum accuracy of 0.9592, 0.9320, and 0.7646 on the applied KDDCup, Spam, and Chess dataset, respectively.

Download Full-text

Concept Drift Detection on Streaming Data with Dynamic Outlier Aggregation

Lecture Notes in Business Information Processing - Process Mining Workshops ◽

10.1007/978-3-030-72693-5_16 ◽

2021 ◽

pp. 206-217

Author(s):

Ludwig Zellner ◽

Florian Richter ◽

Janina Sontheim ◽

Andrea Maldonado ◽

Thomas Seidl

Keyword(s):

Concept Drift ◽

Streaming Data ◽

Concept Drift Detection

Download Full-text

A Novel Concept Drift Detection Method for Incremental Learning in Nonstationary Environments

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2019.2900956 ◽

2020 ◽

Vol 31 (1) ◽

pp. 309-320 ◽

Cited By ~ 6

Author(s):

Zhe Yang ◽

Sameer Al-Dahidi ◽

Piero Baraldi ◽

Enrico Zio ◽

Lorenzo Montelatici

Keyword(s):

Incremental Learning ◽

Detection Method ◽

Concept Drift ◽

Concept Drift Detection ◽

Novel Concept

Download Full-text

A novel concept drift detection method in data streams using ensemble classifiers

Intelligent Data Analysis ◽

10.3233/ida-150207 ◽

2016 ◽

Vol 20 (6) ◽

pp. 1329-1350 ◽

Cited By ~ 8

Author(s):

Mahdie Dehghan ◽

Hamid Beigy ◽

Poorya ZareMoodi

Keyword(s):

Data Streams ◽

Detection Method ◽

Concept Drift ◽

Ensemble Classifiers ◽

Concept Drift Detection ◽

Novel Concept

Download Full-text

A Tree-based Concept Drift Detection Method by Three-way Decisions

Proceedings of the 2017 2nd International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2017) ◽

10.2991/amcce-17.2017.28 ◽

2017 ◽

Author(s):

BaoHe Su

Keyword(s):

Detection Method ◽

Concept Drift ◽

Concept Drift Detection

Download Full-text

Data stream mining: methods and challenges for handling concept drift

SN Applied Sciences ◽

10.1007/s42452-019-1433-0 ◽

2019 ◽

Vol 1 (11) ◽

Cited By ~ 5

Author(s):

Scott Wares ◽

John Isaacs ◽

Eyad Elyan

Keyword(s):

Data Stream ◽

Concept Drift ◽

Relevant Literature ◽

Streaming Data ◽

Future Research ◽

Stream Mining ◽

Detection Algorithms ◽

The Past ◽

Concept Drift Detection ◽

The Impact

Abstract Mining and analysing streaming data is crucial for many applications, and this area of research has gained extensive attention over the past decade. However, there are several inherent problems that continue to challenge the hardware and the state-of-the art algorithmic solutions. Examples of such problems include the unbound size, varying speed and unknown data characteristics of arriving instances from a data stream. The aim of this research is to portray key challenges faced by algorithmic solutions for stream mining, particularly focusing on the prevalent issue of concept drift. A comprehensive discussion of concept drift and its inherent data challenges in the context of stream mining is presented, as is a critical, in-depth review of relevant literature. Current issues with the evaluative procedure for concept drift detectors is also explored, highlighting problems such as a lack of established base datasets and the impact of temporal dependence on concept drift detection. By exposing gaps in the current literature, this study suggests recommendations for future research which should aid in the progression of stream mining and concept drift detection algorithms.

Download Full-text