concept drifts Latest Research Papers

When predicting data streams, changes in data distribution may decrease model accuracy over time, thereby making the model obsolete. This phenomenon is known as concept drift. Detecting concept drifts and then adapting to them are critical operations to maintain model performance. However, model adaptation can only be made if labeled data is available. Labeling data is both costly and time-consuming because it has to be done by humans. Only part of the data can be labeled in the data stream because the data size is massive and appears at high speed. To solve these problems simultaneously, we apply a technique to update the model by employing both labeled and unlabeled instances to do so. The experiment results show that our proposed method can adapt to the concept drift with pseudo-labels and maintain its accuracy even though label availability is drastically reduced from 95% to 5%. The proposed method also has the highest overall accuracy and outperforms other methods in 5 of 10 datasets.

Download Full-text

Addressing Concept Drifts Using Deep Learning for Heart Disease Prediction: A Review

10.1007/978-981-16-3346-1_13 ◽

2021 ◽

pp. 157-167

Author(s):

Ketan Sanjay Desale ◽

Swati V. Shinde

Keyword(s):

Heart Disease ◽

Deep Learning ◽

Disease Prediction ◽

Concept Drifts

Download Full-text

ADAW: Age decay accuracy weighted ensemble method for drifting data stream mining

Intelligent Data Analysis ◽

10.3233/ida-205249 ◽

2021 ◽

Vol 25 (5) ◽

pp. 1131-1152

Author(s):

Ritesh Srivastava ◽

Veena Mittal

Keyword(s):

Data Streams ◽

Data Stream ◽

Dynamic Environment ◽

Ensemble Methods ◽

Life Time ◽

Novel Approach ◽

Data Source ◽

Methods Of Learning ◽

Concept Drifts ◽

Forgetting Mechanism

Dynamic environment data generators are very often in real-world that produce data streams. A data source of a dynamic environment generates data streams in which the underlying data distribution changes very frequently with respect to time and hence results in concept drifts. As compared to the stationary environment, learning in the dynamic environment is very difficult due to the presence of concept drifts. Learning in dynamic environment requires evolutionary and adaptive approaches to be accommodated with the learning algorithms. Ensemble methods are commonly used to build classifiers for learning in a dynamic environment. The ensemble methods of learning are generally described at three very crucial aspects, namely, the learning and testing method employed, result integration method and forgetting mechanism for old concepts. In this paper, we propose a novel approach called Age Decay Accuracy Weighted (ADAW) ensemble architecture for learning in concept drifting data streams. The ADAW method assigned weights to the component classifiers based on its accuracy and its remaining life-time in the ensemble is such a way that ensures maximum accuracy. We empirically evaluated ADAW on benchmark artificial drifting data stream generators and real datasets and compared its performance with ten well-known state-of-the-art existing methods. The experimental results show that ADAW outperforms over the existing methods.

Download Full-text

A new predictive method supporting streaming data with hybrid recurring concept drifts in process industry

Computers & Industrial Engineering ◽

10.1016/j.cie.2021.107625 ◽

2021 ◽

pp. 107625

Author(s):

Linjin Sun ◽

Yangjian Ji ◽

Mingrui Zhu ◽

Fu Gu ◽

Feng Dai ◽

...

Keyword(s):

Streaming Data ◽

Process Industry ◽

Predictive Method ◽

Concept Drifts

Download Full-text

Challenges in Streaming Data Analysis for Building an Adaptive Model for Handling Concept Drifts

10.1109/icscan53069.2021.9526376 ◽

2021 ◽

Author(s):

P Shahad ◽

Ebin Deni Raj

Keyword(s):

Data Analysis ◽

Streaming Data ◽

Adaptive Model ◽

Concept Drifts

Download Full-text

Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3466616 ◽

2021 ◽

Vol 16 (2) ◽

pp. 1-30

Author(s):

Juan I. G. Hidalgo ◽

Silas G. T. C. Santos ◽

Roberto S. M. Barros

Keyword(s):

Parameter Estimation ◽

Real World ◽

Data Streams ◽

Data Stream ◽

Concept Drift ◽

Estimation Method ◽

Estimation Procedure ◽

Dynamic Parameter ◽

Real World Datasets ◽

Concept Drifts

A data stream can be defined as a system that continually generates a lot of data over time. Today, processing data streams requires new demands and challenging tasks in the data mining and machine learning areas. Concept Drift is a problem commonly characterized as changes in the distribution of the data within a data stream. The implementation of new methods for dealing with data streams where concept drifts occur requires algorithms that can adapt to several scenarios to improve its performance in the different experimental situations where they are tested. This research proposes a strategy for dynamic parameter adjustment in the presence of concept drifts. Parameter Estimation Procedure (PEP) is a general method proposed for dynamically adjusting parameters which is applied to the diversity parameter (λ) of several classification ensembles commonly used in the area. To this end, the proposed estimation method (PEP) was used to create Boosting-like Online Learning Ensemble with Parameter Estimation (BOLE-PE), Online AdaBoost-based M1 with Parameter Estimation (OABM1-PE), and Oza and Russell’s Online Bagging with Parameter Estimation (OzaBag-PE), based on the existing ensembles BOLE, OABM1, and OzaBag, respectively. To validate them, experiments were performed with artificial and real-world datasets using Hoeffding Tree (HT) as base classifier. The accuracy results were statistically evaluated using a variation of the Friedman test and the Nemenyi post-hoc test. The experimental results showed that the application of the dynamic estimation in the diversity parameter (λ) produced good results in most scenarios, i.e., the modified methods have improved accuracy in the experiments with both artificial and real-world datasets.

Download Full-text

Statistical Tests to Identify Virtual Concept Drifts

10.1109/ijcnn52387.2021.9534387 ◽

2021 ◽

Author(s):

Paulo M. Goncalves ◽

Sylvain Chartier ◽

Roberto Souto Maior de Barros

Keyword(s):

Statistical Tests ◽

Concept Drifts

Download Full-text

Predict COVID-19 Spreading With C-SMOTE

Business Information Systems ◽

10.52825/bis.v1i.45 ◽

2021 ◽

pp. 27-38

Author(s):

Alessio Bernardo ◽

Emanuele Della Valle

Keyword(s):

Machine Learning ◽

State Of The Art ◽

High Impact ◽

Statistical Evidence ◽

The Other ◽

Classification Algorithms ◽

Minority Class ◽

Art Methods ◽

Concept Drifts

Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.

Download Full-text

ADES: A New Ensemble Diversity-Based Approach for Handling Concept Drift

Mobile Information Systems ◽

10.1155/2021/5549300 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Tinofirei Museba ◽

Fulufhelo Nelwamondo ◽

Khmaies Ouahada

Keyword(s):

Machine Learning ◽

Real World ◽

Data Streams ◽

Predictive Models ◽

Concept Drift ◽

Dynamic Environments ◽

Real World Data ◽

World Data ◽

Different Types ◽

Concept Drifts

Beyond applying machine learning predictive models to static tasks, a significant corpus of research exists that applies machine learning predictive models to streaming environments that incur concept drift. With the prevalence of streaming real-world applications that are associated with changes in the underlying data distribution, the need for applications that are capable of adapting to evolving and time-varying dynamic environments can be hardly overstated. Dynamic environments are nonstationary and change with time and the target variables to be predicted by the learning algorithm and often evolve with time, a phenomenon known as concept drift. Most work in handling concept drift focuses on updating the prediction model so that it can recover from concept drift while little effort has been dedicated to the formulation of a learning system that is capable of learning different types of drifting concepts at any time with minimum overheads. This work proposes a novel and evolving data stream classifier called Adaptive Diversified Ensemble Selection Classifier (ADES) that significantly optimizes adaptation to different types of concept drifts at any time and improves convergence to new concepts by exploiting different amounts of ensemble diversity. The ADES algorithm generates diverse base classifiers, thereby optimizing the margin distribution to exploit ensemble diversity to formulate an ensemble classifier that generalizes well to unseen instances and provides fast recovery from different types of concept drift. Empirical experiments conducted on both artificial and real-world data streams demonstrate that ADES can adapt to different types of drifts at any given time. The prediction performance of ADES is compared to three other ensemble classifiers designed to handle concept drift using both artificial and real-world data streams. The comparative evaluation performed demonstrated the ability of ADES to handle different types of concept drifts. The experimental results, including statistical test results, indicate comparable performances with other algorithms designed to handle concept drift and prove their significance and effectiveness.

Download Full-text

A METHOD FOR INTERPRETING CONCEPT DRIFTS IN A STREAMING ENVIRONMENT

10.17771/pucrio.acad.54157 ◽

2021 ◽

Author(s):

JOAO GUILHERME MATTOS DE O SANTOS

Keyword(s):

Concept Drifts

Download Full-text

concept drifts
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Handling concept drifts and limited label problems using semi-supervised combine-merge Gaussian mixture model

Addressing Concept Drifts Using Deep Learning for Heart Disease Prediction: A Review

ADAW: Age decay accuracy weighted ensemble method for drifting data stream mining

A new predictive method supporting streaming data with hybrid recurring concept drifts in process industry

Challenges in Streaming Data Analysis for Building an Adaptive Model for Handling Concept Drifts

Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

Statistical Tests to Identify Virtual Concept Drifts

Predict COVID-19 Spreading With C-SMOTE

ADES: A New Ensemble Diversity-Based Approach for Handling Concept Drift

A METHOD FOR INTERPRETING CONCEPT DRIFTS IN A STREAMING ENVIRONMENT

Export Citation Format

concept driftsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Handling concept drifts and limited label problems using semi-supervised combine-merge Gaussian mixture model

Addressing Concept Drifts Using Deep Learning for Heart Disease Prediction: A Review

ADAW: Age decay accuracy weighted ensemble method for drifting data stream mining

A new predictive method supporting streaming data with hybrid recurring concept drifts in process industry

Challenges in Streaming Data Analysis for Building an Adaptive Model for Handling Concept Drifts

Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

Statistical Tests to Identify Virtual Concept Drifts

Predict COVID-19 Spreading With C-SMOTE

ADES: A New Ensemble Diversity-Based Approach for Handling Concept Drift

A METHOD FOR INTERPRETING CONCEPT DRIFTS IN A STREAMING ENVIRONMENT

concept drifts
Recently Published Documents