Efficient Subsequence Search on Streaming Data Based on Time Warping Distance

Many algorithms have been proposed to deal with subsequence similarity search problem in time series data stream. Dynamic Time Warping (DTW), which has been accepted as the best distance measure in time series similarity search, has been used in many research works. SPRING and its variance were proposed to solve such problem by mitigating the complexity of DTW. Unfortunately, these algorithms produce meaningless result since no normalization is taken into account before the distance calculation. Recently, GPUs and FPGAs were used in similarity search supporting subsequence normalization to reduce the computation complexity, but it is still far from practical use. In this work, we propose a novel Meaningful Subsequence Matching (MSM) algorithm which produces meaningful result in subsequence matching by considering global constraint, uniform scaling, and normalization. Our method significantly outperforms the existing algorithms in terms of both computational cost and accuracy.

Download Full-text

Dynamic time warping under product quantization, with applications to time series data similarity search

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3132017 ◽

2021 ◽

pp. 1-1

Author(s):

Haowen Zhang ◽

Yabo Dong ◽

Jing Li ◽

Duanqing Xu

Keyword(s):

Time Series ◽

Similarity Search ◽

Dynamic Time Warping ◽

Time Series Data ◽

Series Data ◽

Time Warping ◽

Product Quantization ◽

Data Similarity ◽

Dynamic Time

Download Full-text

Notice of Violation of IEEE Publication Principles - A Local Segmented Dynamic Time Warping Distance Measure Algorithm for Time Series Data Mining

2006 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2006.258647 ◽

2006 ◽

Cited By ~ 2

Author(s):

Xiao-li Dong ◽

Cheng-kui Gu ◽

Zheng-ou Wang

Keyword(s):

Data Mining ◽

Time Series ◽

Dynamic Time Warping ◽

Time Series Data ◽

Distance Measure ◽

Series Data ◽

Time Warping ◽

Time Series Data Mining ◽

Dynamic Time

Download Full-text

A New Symbolization and Distance Measure Based Anomaly Mining Approach for Hydrological Time Series

International Journal of Web Services Research ◽

10.4018/ijwsr.2016070102 ◽

2016 ◽

Vol 13 (3) ◽

pp. 26-45 ◽

Cited By ~ 4

Author(s):

Pengcheng Zhang ◽

Yan Xiao ◽

Yuelong Zhu ◽

Jun Feng ◽

Dingsheng Wan ◽

...

Keyword(s):

Data Mining ◽

Time Series ◽

Time Series Data ◽

Distance Measure ◽

Series Data ◽

Feature Points ◽

Time Warping ◽

Dynamic Time ◽

Fitting Error ◽

Selection Of

Most of the time series data mining tasks attempt to discover data patterns that appear frequently. Abnormal data is often ignored as noise. There are some data mining techniques based on time series to extract anomaly. However, most of these techniques cannot suit big unstable data existing in various fields. Their key problems are high fitting error after dimension reduction and low accuracy of mining results. This paper studies an approach of mining time series abnormal patterns in the hydrological field. The authors propose a new idea to solve the problem of hydrological anomaly mining based on time series. They propose Feature Points Symbolic Aggregate Approximation (FP_SAX) to improve the selection of feature points, and then measures the distance of strings by Symbol Distance based Dynamic Time Warping (SD_DTW). Finally, the distances generated are sorted. A set of dedicated experiments are performed to validate the authors' approach. The results show that their approach has lower fitting error and higher accuracy compared to other approaches.

Download Full-text

Fuzzy Prediction Intervals Using Credibility Distributions

Engineering Proceedings ◽

10.3390/engproc2021005051 ◽

2021 ◽

Vol 5 (1) ◽

pp. 51

Author(s):

Enriqueta Vercher ◽

Abel Rubio ◽

José D. Bermúdez

Keyword(s):

Time Series ◽

Time Series Data ◽

Computational Cost ◽

Expected Value ◽

Prediction Intervals ◽

Series Data ◽

Comparative Results ◽

Automatic Forecasting

We present a new forecasting scheme based on the credibility distribution of fuzzy events. This approach allows us to build prediction intervals using the first differences of the time series data. Additionally, the credibility expected value enables us to estimate the k-step-ahead pointwise forecasts. We analyze the coverage of the prediction intervals and the accuracy of pointwise forecasts using different credibility approaches based on the upper differences. The comparative results were obtained working with yearly time series from the M4 Competition. The performance and computational cost of our proposal, compared with automatic forecasting procedures, are presented.

Download Full-text

An OGS-Based Dynamic Time Warping Algorithm for Time Series Data

Contributions to Economics - Innovation in the High-Tech Economy ◽

10.1007/978-3-642-41585-2_10 ◽

2013 ◽

pp. 115-121

Author(s):

Mi Zhou

Keyword(s):

Time Series ◽

Dynamic Time Warping ◽

Time Series Data ◽

Series Data ◽

Time Warping ◽

Dynamic Time

Download Full-text

Scalable Algorithm for Subsequence Similarity Search in Very Large Time Series Data on Cluster of Phi KNL

Communications in Computer and Information Science - Data Analytics and Management in Data Intensive Domains ◽

10.1007/978-3-030-23584-0_9 ◽

2019 ◽

pp. 149-164 ◽

Cited By ~ 1

Author(s):

Yana Kraeva ◽

Mikhail Zymbler

Keyword(s):

Time Series ◽

Similarity Search ◽

Large Time ◽

Time Series Data ◽

Series Data ◽

Scalable Algorithm

Download Full-text

Similarity search and performance prediction of shield tunnels in operation through time series data mining

Automation in Construction ◽

10.1016/j.autcon.2020.103178 ◽

2020 ◽

Vol 114 ◽

pp. 103178

Author(s):

Hehua Zhu ◽

Xin Wang ◽

Xueqin Chen ◽

Lianyang Zhang

Keyword(s):

Data Mining ◽

Time Series ◽

Performance Prediction ◽

Similarity Search ◽

Time Series Data ◽

Series Data ◽

Time Series Data Mining ◽

And Performance

Download Full-text

Two-Phase Imputation with Regional-Gradient-Guided Bootstrapping Algorithm and Dynamics Time Warping for Incomplete Time Series Data

Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-642-14932-0_76 ◽

2010 ◽

pp. 615-622

Author(s):

Sathit Prasomphan ◽

Chidchanok Lursinsap ◽

Sirapat Chiewchanwattana

Keyword(s):

Time Series ◽

Time Series Data ◽

Series Data ◽

Time Warping ◽

Two Phase ◽

Regional Gradient

Download Full-text

Time-Series Classification Based on Fusion Features of Sequence and Visualization

Applied Sciences ◽

10.3390/app10124124 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4124

Author(s):

Baoquan Wang ◽

Tonghai Jiang ◽

Xi Zhou ◽

Bo Ma ◽

Fan Zhao ◽

...

Keyword(s):

Time Series ◽

Human Brain ◽

Time Series Data ◽

Short Term Memory ◽

Computational Cost ◽

Open Data ◽

Attention Mechanism ◽

Series Data ◽

Sequence Features ◽

Fusion Features

For the task of time-series data classification (TSC), some methods directly classify raw time-series (TS) data. However, certain sequence features are not evident in the time domain and the human brain can extract visual features based on visualization to classify data. Therefore, some researchers have converted TS data to image data and used image processing methods for TSC. While human perceptionconsists of a combination of human senses from different aspects, existing methods only use sequence features or visualization features. Therefore, this paper proposes a framework for TSC based on fusion features (TSC-FF) of sequence features extracted from raw TS and visualization features extracted from Area Graphs converted from TS. Deep learning methods have been proven to be useful tools for automatically learning features from data; therefore, we use long short-term memory with an attention mechanism (LSTM-A) to learn sequence features and a convolutional neural network with an attention mechanism (CNN-A) for visualization features, in order to imitate the human brain. In addition, we use the simplest visualization method of Area Graph for visualization features extraction, avoiding loss of information and additional computational cost. This article aims to prove that using deep neural networks to learn features from different aspects and fusing them can replace complex, artificially constructed features, as well as remove the bias due to manually designed features, in order to avoid the limitations of domain knowledge. Experiments on several open data sets show that the framework achieves promising results, compared with other methods.

Download Full-text

ChainLink: Indexing Big Time Series Data For Long Subsequence Matching

2020 IEEE 36th International Conference on Data Engineering (ICDE) ◽

10.1109/icde48307.2020.00052 ◽

2020 ◽

Cited By ~ 1

Author(s):

Noura Alghamdi ◽

Liang Zhang ◽

Huayi Zhang ◽

Elke A. Rundensteiner ◽

Mohamed Y. Eltabakh

Keyword(s):

Time Series ◽

Time Series Data ◽

Series Data ◽

Subsequence Matching

Download Full-text