Whole Time Series Data Streams Clustering: Dynamic Profiling of the Electricity Consumption

Data from smart grids are challenging to analyze due to their very large size, high dimensionality, skewness, sparsity, and number of seasonal fluctuations, including daily and weekly effects. With the data arriving in a sequential form the underlying distribution is subject to changes over the time intervals. Time series data streams have their own specifics in terms of the data processing and data analysis because, usually, it is not possible to process the whole data in memory as the large data volumes are generated fast so the processing and the analysis should be done incrementally using sliding windows. Despite the proposal of many clustering techniques applicable for grouping the observations of a single data stream, only a few of them are focused on splitting the whole data streams into the clusters. In this article we aim to explore individual characteristics of electricity usage and recommend the most suitable tariff to the customer so they can benefit from lower prices. This work investigates various algorithms (and their improvements) what allows us to formulate the clusters, in real time, based on smart meter data.

Download Full-text

Dynamic clustering of residential electricity consumption time series data based on Hausdorff distance

Electric Power Systems Research ◽

10.1016/j.epsr.2016.05.023 ◽

2016 ◽

Vol 140 ◽

pp. 517-526 ◽

Cited By ~ 12

Author(s):

Ignacio Benítez ◽

José-Luis Díez ◽

Alfredo Quijano ◽

Ignacio Delgado

Keyword(s):

Time Series ◽

Hausdorff Distance ◽

Time Series Data ◽

Electricity Consumption ◽

Series Data ◽

Dynamic Clustering ◽

Residential Electricity ◽

Residential Electricity Consumption

Download Full-text

Electricity Consumption Prediction Based on Time Series Data Features Integrate with Long Short-Term Memory Model

Business Intelligence and Information Technology - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-92632-8_80 ◽

2021 ◽

pp. 844-853

Author(s):

Jiaqiu Wang ◽

Hao Mou ◽

Hai Lin ◽

Yining Jin ◽

Ruijie Wang

Keyword(s):

Time Series ◽

Time Series Data ◽

Short Term Memory ◽

Electricity Consumption ◽

Memory Model ◽

Series Data ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Consumption Prediction

Download Full-text

Multi-Dimensional Regression Analysis of Time-Series Data Streams

VLDB '02: Proceedings of the 28th International Conference on Very Large Databases ◽

10.1016/b978-155860869-6/50036-6 ◽

2002 ◽

pp. 323-334 ◽

Cited By ~ 139

Author(s):

Yixin Chen ◽

Guozhu Dong ◽

Jiawei Han ◽

Benjamin W. Wah ◽

Jianyoung Wang

Keyword(s):

Time Series ◽

Regression Analysis ◽

Data Streams ◽

Time Series Data ◽

Series Data ◽

Analysis Of Time Series

Download Full-text

Detecting Current Outliers: Continuous Outlier Detection over Time-Series Data Streams

Lecture Notes in Computer Science - Database and Expert Systems Applications ◽

10.1007/978-3-540-85654-2_26 ◽

2008 ◽

pp. 255-268 ◽

Cited By ~ 6

Author(s):

Kozue Ishida ◽

Hiroyuki Kitagawa

Keyword(s):

Time Series ◽

Outlier Detection ◽

Data Streams ◽

Time Series Data ◽

Series Data ◽

Over Time

Download Full-text

A CDF-Based Symbolic Time-Series Data Mining Approach for Electricity Consumption Analysis

HCI International 2018 – Posters' Extended Abstracts - Communications in Computer and Information Science ◽

10.1007/978-3-319-92285-0_71 ◽

2018 ◽

pp. 515-521

Author(s):

I-Chin Wu ◽

Yi-An Chen ◽

Zan-Xian Wang

Keyword(s):

Data Mining ◽

Time Series ◽

Time Series Data ◽

Electricity Consumption ◽

Series Data ◽

Time Series Data Mining ◽

Data Mining Approach

Download Full-text

A MPAA-Based Iterative Clustering Algorithm Augmented by Nearest Neighbors Search for Time-Series Data Streams

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/11430919_40 ◽

2005 ◽

pp. 333-342 ◽

Cited By ~ 9

Author(s):

Jessica Lin ◽

Michai Vlachos ◽

Eamonn Keogh ◽

Dimitrios Gunopulos ◽

Jianwei Liu ◽

...

Keyword(s):

Time Series ◽

Data Streams ◽

Clustering Algorithm ◽

Time Series Data ◽

Nearest Neighbors ◽

Series Data

Download Full-text

Inferential Precision in Single-Case Time-Series Data Streams: How Well Does the EM Procedure Perform When Missing Observations Occur in Autocorrelated Data?

Behavior Therapy ◽

10.1016/j.beth.2011.10.001 ◽

2012 ◽

Vol 43 (3) ◽

pp. 679-685 ◽

Cited By ~ 10

Author(s):

Justin D. Smith ◽

Jeffrey J. Borckardt ◽

Michael R. Nash

Keyword(s):

Time Series ◽

Data Streams ◽

Time Series Data ◽

Single Case ◽

Series Data ◽

Missing Observations ◽

Autocorrelated Data

Download Full-text

An Adaptive Forecasting Method for Time-Series Data Streams

ACTA AUTOMATICA SINICA ◽

10.1360/aas-007-0197 ◽

2007 ◽

Vol 33 (2) ◽

pp. 0197 ◽

Cited By ~ 7

Author(s):

Yong-Li WANG

Keyword(s):

Time Series ◽

Data Streams ◽

Time Series Data ◽

Series Data ◽

Forecasting Method

Download Full-text

Hybrid Algorithm for Anomaly Removal in Time Series Data Mining

10.20944/preprints202111.0440.v1 ◽

2021 ◽

Author(s):

Abdul Razaque ◽

Marzhan Abenova ◽

Munif Alotaibi ◽

Bandar Alotaibi ◽

Hamoud Alshammari ◽

...

Keyword(s):

Data Mining ◽

Time Series ◽

Hybrid Algorithm ◽

Time Series Data ◽

State Of The Art ◽

Large Data ◽

Series Data ◽

Multidimensional Data ◽

Search Problem ◽

Short Text

Time series data are significant and are derived from temporal data, which involve real numbers representing values collected regularly over time. Time series have a great impact on many types of data. However, time series have anomalies. We introduce hybrid algorithm named novel matrix profile (NMP) to solve the all-pairs similarity search problem for time series data. The proposed NMP inherits the features from two state-of-the art algorithms: similarity time-series automatic multivariate prediction (STAMP), and short text online microblogging protocol (STOMP). The proposed algorithm caches the output in an easy-to-access fashion for single- and multidimensional data. The proposed NMP algorithm can be used on large data sets and generates approximate solutions of high quality in a reasonable time. The proposed NMP can also handle several data mining tasks. It is implemented on a Python platform. To determine its effectiveness, it is compared with the state-of-the-art matrix profile algorithms i.e., STAMP and STOMP. The results confirm that the proposed NMP provides higher accuracy than the compared algorithms.

Download Full-text

Session details: Data streams & time-series data

Proceedings of the 2010 international conference on Management of data - SIGMOD '10 ◽

10.1145/3254333 ◽

2010 ◽

Author(s):

Alex Labrinidis

Keyword(s):

Time Series ◽

Data Streams ◽

Time Series Data ◽

Series Data

Download Full-text