Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining

Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.

Download Full-text

VEDAS: A Mobile and Distributed Data Stream Mining System for Real-Time Vehicle Monitoring

Proceedings of the 2004 SIAM International Conference on Data Mining ◽

10.1137/1.9781611972740.28 ◽

2004 ◽

Cited By ~ 67

Author(s):

Hillol Kargupta ◽

Ruchita Bhargava ◽

Kun Liu ◽

Michael Powers ◽

Patrick Blair ◽

...

Keyword(s):

Real Time ◽

Data Stream ◽

Data Stream Mining ◽

Distributed Data ◽

Stream Mining ◽

Mining System ◽

Vehicle Monitoring

Download Full-text

MineFleet : The Vehicle Data Stream Mining System for Ubiquitous Environments

Ubiquitous Knowledge Discovery - Lecture Notes in Computer Science ◽

10.1007/978-3-642-16392-0_14 ◽

2010 ◽

pp. 235-254 ◽

Cited By ~ 4

Author(s):

Hillol Kargupta ◽

Michael Gilligan ◽

Vasundhara Puttagunta ◽

Kakali Sarkar ◽

Martin Klein ◽

...

Keyword(s):

Data Stream ◽

Data Stream Mining ◽

Stream Mining ◽

Mining System ◽

Vehicle Data

Download Full-text

Real-Time Clinical Decision Support System with Data Stream Mining

Journal of Biomedicine and Biotechnology ◽

10.1155/2012/580186 ◽

2012 ◽

Vol 2012 ◽

pp. 1-8 ◽

Cited By ~ 18

Author(s):

Yang Zhang ◽

Simon Fong ◽

Jinan Fiaidhi ◽

Sabah Mohammed

Keyword(s):

Real Time ◽

Support System ◽

Data Stream ◽

Early Years ◽

Data Stream Mining ◽

Diabetes Diagnosis ◽

Stream Mining ◽

Mining System ◽

Data Set ◽

Clinical Support

This research aims to describe a new design of data stream mining system that can analyze medical data stream and make real-time prediction. The motivation of the research is due to a growing concern of combining software technology and medical functions for the development of software application that can be used in medical field of chronic disease prognosis and diagnosis, children healthcare, diabetes diagnosis, and so forth. Most of the existing software technologies are case-based data mining systems. They only can analyze finite and structured data set and can only work well in their early years and can hardly meet today's medical requirement. In this paper, we describe a clinical-support-system based data stream mining technology; the design has taken into account all the shortcomings of the existing clinical support systems.

Download Full-text