Mining data streams with concept drifts using genetic algorithm

The treatment of large data streams in the presence of concept drifts is one of the main challenges in the field of data mining, particularly when the algorithms have to deal with concepts that disappear and then reappear. This paper presents a new algorithm, called Fast Adapting Ensemble (FAE), which adapts very quickly to both abrupt and gradual concept drifts, and has been specifically designed to deal with recurring concepts. FAE processes the learning examples in blocks of the same size, but it does not have to wait for the batch to be complete in order to adapt its base classification mechanism. FAE incorporates a drift detector to improve the handling of abrupt concept drifts and stores a set of inactive classifiers that represent old concepts, which are activated very quickly when these concepts reappear. We compare our new algorithm with various well-known learning algorithms, taking into account, common benchmark datasets. The experiments show promising results from the proposed algorithm (regarding accuracy and runtime), handling different types of concept drifts.

Download Full-text

Exploiting fractal dimension and a distributed evolutionary approach to classify data streams with concept drifts

Applied Soft Computing ◽

10.1016/j.asoc.2018.11.009 ◽

2019 ◽

Vol 75 ◽

pp. 284-297 ◽

Cited By ~ 2

Author(s):

Gianluigi Folino ◽

Massimo Guarascio ◽

Giuseppe Papuzzo

Keyword(s):

Fractal Dimension ◽

Data Streams ◽

Evolutionary Approach ◽

Concept Drifts

Download Full-text

Mining Data Streams: Systems and Algorithms

Machine Learning and Knowledge Discovery for Engineering Systems Health Management ◽

10.1201/b11580-1 ◽

2016 ◽

pp. 3-37

Author(s):

Charu C. Aggarwal ◽

Deepak S. Turaga

Keyword(s):

Data Streams ◽

Mining Data Streams

Download Full-text

Mining Data Streams

Mining of Massive Datasets ◽

10.1017/cbo9781139924801.005 ◽

2014 ◽

pp. 123-153

Author(s):

Jure Leskovec ◽

Anand Rajaraman ◽

Jeffrey David Ullman

Keyword(s):

Data Streams ◽

Mining Data Streams

Download Full-text

Mining Data Streams

Data Mining and Knowledge Discovery Handbook ◽

10.1007/0-387-25465-x_36 ◽

2006 ◽

pp. 777-792 ◽

Cited By ~ 2

Author(s):

Haixun Wang ◽

Philip S. Yu ◽

Jiawei Han

Keyword(s):

Data Streams ◽

Mining Data Streams

Download Full-text

Knowledge Discovery From Evolving Data Streams

Advances in Business Information Systems and Analytics - Machine Learning Techniques for Improved Business Analytics ◽

10.4018/978-1-5225-3534-8.ch002 ◽

2019 ◽

pp. 19-39

Author(s):

Prasanna Lakshmi Kompalli

Keyword(s):

Real Time ◽

Data Streams ◽

Data Stream ◽

Concept Drift ◽

Data Stream Mining ◽

Time Data ◽

Stream Mining ◽

New Challenges ◽

Mining Data Streams ◽

Different Sources

Data coming from different sources is referred to as data streams. Data stream mining is an online learning technique where each data point must be processed as the data arrives and discarded as the processing is completed. Progress of technologies has resulted in the monitoring these data streams in real time. Data streams has created many new challenges to the researchers in real time. The main features of this type of data are they are fast flowing, large amounts of data which are continuous and growing in nature, and characteristics of data might change in course of time which is termed as concept drift. This chapter addresses the problems in mining data streams with concept drift. Due to which, isolating the correct literature would be a grueling task for researchers and practitioners. This chapter tries to provide a solution as it would be an amalgamation of all techniques used for data stream mining with concept drift.

Download Full-text