Mining data streams with concept drifts using genetic algorithm

2011 ◽  
Vol 36 (3) ◽  
pp. 163-178 ◽  
Author(s):  
Periasamy Vivekanandan ◽  
Raju Nedunchezhian
2015 ◽  
Vol 2015 ◽  
pp. 1-14 ◽  
Author(s):  
Agustín Ortíz Díaz ◽  
José del Campo-Ávila ◽  
Gonzalo Ramos-Jiménez ◽  
Isvani Frías Blanco ◽  
Yailé Caballero Mota ◽  
...  

The treatment of large data streams in the presence of concept drifts is one of the main challenges in the field of data mining, particularly when the algorithms have to deal with concepts that disappear and then reappear. This paper presents a new algorithm, called Fast Adapting Ensemble (FAE), which adapts very quickly to both abrupt and gradual concept drifts, and has been specifically designed to deal with recurring concepts. FAE processes the learning examples in blocks of the same size, but it does not have to wait for the batch to be complete in order to adapt its base classification mechanism. FAE incorporates a drift detector to improve the handling of abrupt concept drifts and stores a set of inactive classifiers that represent old concepts, which are activated very quickly when these concepts reappear. We compare our new algorithm with various well-known learning algorithms, taking into account, common benchmark datasets. The experiments show promising results from the proposed algorithm (regarding accuracy and runtime), handling different types of concept drifts.


2014 ◽  
pp. 123-153
Author(s):  
Jure Leskovec ◽  
Anand Rajaraman ◽  
Jeffrey David Ullman

Author(s):  
Haixun Wang ◽  
Philip S. Yu ◽  
Jiawei Han

Author(s):  
Prasanna Lakshmi Kompalli

Data coming from different sources is referred to as data streams. Data stream mining is an online learning technique where each data point must be processed as the data arrives and discarded as the processing is completed. Progress of technologies has resulted in the monitoring these data streams in real time. Data streams has created many new challenges to the researchers in real time. The main features of this type of data are they are fast flowing, large amounts of data which are continuous and growing in nature, and characteristics of data might change in course of time which is termed as concept drift. This chapter addresses the problems in mining data streams with concept drift. Due to which, isolating the correct literature would be a grueling task for researchers and practitioners. This chapter tries to provide a solution as it would be an amalgamation of all techniques used for data stream mining with concept drift.


2012 ◽  
pp. 108-138
Author(s):  
Anand Rajaraman ◽  
Jeffrey David Ullman

2008 ◽  
Vol 12 (6) ◽  
pp. 37-49 ◽  
Author(s):  
Jing Gao ◽  
Bolin Ding ◽  
Wei Fan ◽  
Jiawei Han ◽  
Philip S. Yu
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document