Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

The Multi-Armed Bandit (MAB) problem has been extensively studied in order to address real-world challenges related to sequential decision making. In this setting, an agent selects the best action to be performed at time-step t, based on the past rewards received by the environment. This formulation implicitly assumes that the expected payoff for each action is kept stationary by the environment through time. Nevertheless, in many real-world applications this assumption does not hold and the agent has to face a non-stationary environment, that is, with a changing reward distribution. Thus, we present a new MAB algorithm, named f-Discounted-Sliding-Window Thompson Sampling (f-dsw TS), for non-stationary environments, that is, when the data streaming is affected by concept drift. The f-dsw TS algorithm is based on Thompson Sampling (TS) and exploits a discount factor on the reward history and an arm-related sliding window to contrast concept drift in non-stationary environments. We investigate how to combine these two sources of information, namely the discount factor and the sliding window, by means of an aggregation function f(.). In particular, we proposed a pessimistic (f=min), an optimistic (f=max), as well as an averaged (f=mean) version of the f-dsw TS algorithm. A rich set of numerical experiments is performed to evaluate the f-dsw TS algorithm compared to both stationary and non-stationary state-of-the-art TS baselines. We exploited synthetic environments (both randomly-generated and controlled) to test the MAB algorithms under different types of drift, that is, sudden/abrupt, incremental, gradual and increasing/decreasing drift. Furthermore, we adapt four real-world active learning tasks to our framework—a prediction task on crimes in the city of Baltimore, a classification task on insects species, a recommendation task on local web-news, and a time-series analysis on microbial organisms in the tropical air ecosystem. The f-dsw TS approach emerges as the best performing MAB algorithm. At least one of the versions of f-dsw TS performs better than the baselines in synthetic environments, proving the robustness of f-dsw TS under different concept drift types. Moreover, the pessimistic version (f=min) results as the most effective in all real-world tasks.

Download Full-text

Energy-aware very fast decision tree

International Journal of Data Science and Analytics ◽

10.1007/s41060-021-00246-4 ◽

2021 ◽

Author(s):

Eva García-Martín ◽

Niklas Lavesson ◽

Håkan Grahn ◽

Emiliano Casalicchio ◽

Veselka Boeva

Keyword(s):

Energy Consumption ◽

Decision Tree ◽

Concept Drift ◽

Algorithm Design ◽

Energy Aware ◽

Battery Capacity ◽

Very Fast Decision Tree ◽

Public Datasets ◽

Additional Constraints ◽

Fast Decision

AbstractRecently machine learning researchers are designing algorithms that can run in embedded and mobile devices, which introduces additional constraints compared to traditional algorithm design approaches. One of these constraints is energy consumption, which directly translates to battery capacity for these devices. Streaming algorithms, such as the Very Fast Decision Tree (VFDT), are designed to run in such devices due to their high velocity and low memory requirements. However, they have not been designed with an energy efficiency focus. This paper addresses this challenge by presenting the nmin adaptation method, which reduces the energy consumption of the VFDT algorithm with only minor effects on accuracy. nmin adaptation allows the algorithm to grow faster in those branches where there is more confidence to create a split, and delays the split on the less confident branches. This removes unnecessary computations related to checking for splits but maintains similar levels of accuracy. We have conducted extensive experiments on 29 public datasets, showing that the VFDT with nmin adaptation consumes up to 31% less energy than the original VFDT, and up to 96% less energy than the CVFDT (VFDT adapted for concept drift scenarios), trading off up to 1.7 percent of accuracy.

Download Full-text

Concept Drift Adaptation Techniques in Distributed Environment for Real-World Data Streams

Smart Cities ◽

10.3390/smartcities4010021 ◽

2021 ◽

Vol 4 (1) ◽

pp. 349-371

Author(s):

Hassan Mehmood ◽

Panos Kostakos ◽

Marta Cortes ◽

Theodoros Anagnostopoulos ◽

Susanna Pirttikangas ◽

...

Keyword(s):

Real World ◽

Data Streams ◽

Smart City ◽

Smart Cities ◽

Concept Drift ◽

Distributed Environment ◽

Real World Data ◽

Unique Challenge ◽

World Data ◽

Concept Drift Detection

Real-world data streams pose a unique challenge to the implementation of machine learning (ML) models and data analysis. A notable problem that has been introduced by the growth of Internet of Things (IoT) deployments across the smart city ecosystem is that the statistical properties of data streams can change over time, resulting in poor prediction performance and ineffective decisions. While concept drift detection methods aim to patch this problem, emerging communication and sensing technologies are generating a massive amount of data, requiring distributed environments to perform computation tasks across smart city administrative domains. In this article, we implement and test a number of state-of-the-art active concept drift detection algorithms for time series analysis within a distributed environment. We use real-world data streams and provide critical analysis of results retrieved. The challenges of implementing concept drift adaptation algorithms, along with their applications in smart cities, are also discussed.

Download Full-text

Addressing Event-Driven Concept Drift in Twitter Stream: a Stance Detection Application

IEEE Access ◽

10.1109/access.2021.3083578 ◽

2021 ◽

pp. 1-1

Author(s):

Alessio Bechini ◽

Alessandro Bondielli ◽

Pietro Ducange ◽

Francesco Marcelloni ◽

Alessandro Renda

Keyword(s):

Concept Drift ◽

Event Driven

Download Full-text

Towards Online Learning and Concept Drift for Offloading Complex Event Processing in the Edge

2020 IEEE/ACM Symposium on Edge Computing (SEC) ◽

10.1109/sec50012.2020.00024 ◽

2020 ◽

Author(s):

Joao Alexandre Neto ◽

Jorge C. B. Fonseca ◽

Kiev Gama

Keyword(s):

Online Learning ◽

Concept Drift ◽

Complex Event Processing ◽

Event Processing

Download Full-text

Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

Empirical Analysis on Stream Classification and Clustering with Concept Drift in MOA

Random Tree Data Stream Classifier With Sliding Window Estimator And Concept Drift

Concept drift detection and localization in process mining

Analyzing and repairing concept drift adaptation in data stream classification

Streaming Data Classification using Hybrid Classifiers to tackle Stability-Plasticity Dilemma and Concept Drift

Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm

Energy-aware very fast decision tree

Concept Drift Adaptation Techniques in Distributed Environment for Real-World Data Streams

Addressing Event-Driven Concept Drift in Twitter Stream: a Stance Detection Application

Towards Online Learning and Concept Drift for Offloading Complex Event Processing in the Edge

Export Citation Format