SAX-Based Representation with Longest Common Subsequence Dissimilarity Measure for Time Series Data Classification

AbstractMining time series data is of great significance in various areas. To efficiently find representative patterns in these data, this article focuses on the definition of a valid dissimilarity measure and the acceleration of partitioning clustering, a common group of techniques used to discover typical shapes of time series. Dissimilarity measure is a crucial component in clustering. It is required, by some particular applications, to be invariant to specific transformations. The rationale for using the angle between two time series to define a dissimilarity is analyzed. Moreover, our proposed measure satisfies the triangle inequality with specific restrictions. This property can be employed to accelerate clustering. An integrated algorithm is proposed. The experiments show that angle-based dissimilarity captures the essence of time series patterns that are invariant to amplitude scaling. In addition, the accelerated algorithm outperforms the standard one as redundancies are pruned. Our approach has been applied to discover typical patterns of information diffusion in an online social network. Analyses revealed the formation mechanisms of different patterns.

Download Full-text

On privacy-preserving time series data classification

International Journal of Data Mining Modelling and Management ◽

10.1504/ijdmmm.2010.032145 ◽

2010 ◽

Vol 2 (2) ◽

pp. 117

Author(s):

Ye Zhu ◽

Yongjian Fu ◽

Huirong Fu

Keyword(s):

Time Series ◽

Time Series Data ◽

Data Classification ◽

Privacy Preserving ◽

Series Data

Download Full-text

Shape-based Representation and Abstraction of Time Series Data along with a Dynamic Time Shape Wrapping as a Dissimilarity Measure

10.23919/icac50006.2021.9594127 ◽

2021 ◽

Author(s):

Fatma Ezzahra Gmati ◽

Salem Chakhar ◽

Wided Lejouad Chaari ◽

Mark Xu

Keyword(s):

Time Series ◽

Time Series Data ◽

Dissimilarity Measure ◽

Series Data ◽

Time Shape ◽

Dynamic Time

Download Full-text

Time-Series Data Classification and Analysis Associated With Machine Learning Algorithms for Cognitive Perception and Phenomenon

IEEE Access ◽

10.1109/access.2020.3018477 ◽

2020 ◽

Vol 8 ◽

pp. 222417-222428

Author(s):

Taikyeong Jeong

Keyword(s):

Machine Learning ◽

Time Series ◽

Time Series Data ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Series Data

Download Full-text

Clustering Methodology for Time Series Mining

Scientific Journal of Riga Technical University Computer Sciences ◽

10.2478/v10143-010-0011-0 ◽

2009 ◽

Vol 40 (1) ◽

pp. 81-86

Author(s):

Pēteris Grabusts ◽

Arkady Borisov

Keyword(s):

Time Series ◽

Time Series Analysis ◽

Clustering Algorithm ◽

Time Series Data ◽

Similarity Measures ◽

Longest Common Subsequence ◽

Series Data ◽

Time Series Clustering ◽

Series Analysis ◽

Time Series Mining

Clustering Methodology for Time Series MiningA time series is a sequence of real data, representing the measurements of a real variable at time intervals. Time series analysis is a sufficiently well-known task; however, in recent years research has been carried out with the purpose to try to use clustering for the intentions of time series analysis. The main motivation for representing a time series in the form of clusters is to better represent the main characteristics of the data. The central goal of the present research paper was to investigate clustering methodology for time series data mining, to explore the facilities of time series similarity measures and to use them in the analysis of time series clustering results. More complicated similarity measures include Longest Common Subsequence method (LCSS). In this paper, two tasks have been completed. The first task was to define time series similarity measures. It has been established that LCSS method gives better results in the detection of time series similarity than the Euclidean distance. The second task was to explore the facilities of the classical k-means clustering algorithm in time series clustering. As a result of the experiment a conclusion has been drawn that the results of time series clustering with the help of k-means algorithm correspond to the results obtained with LCSS method, thus the clustering results of the specific time series are adequate.

Download Full-text

Time Series Data Classification Using Discriminative Interpolation with Sparsity

2017 International Conference on Computational Science and Computational Intelligence (CSCI) ◽

10.1109/csci.2017.79 ◽

2017 ◽

Cited By ~ 1

Author(s):

Nenad Mijatovic ◽

Rana Haber ◽

Anand Rangarajan ◽

Anthony O. Smith ◽

Adrian M. Peter

Keyword(s):

Time Series ◽

Time Series Data ◽

Data Classification ◽

Series Data

Download Full-text

Feature Selection Method for Multivariate Time Series Data Classification

Journal of the Korean Institute of Industrial Engineers ◽

10.7232/jkiie.2017.43.6.413 ◽

2017 ◽

Vol 43 (6) ◽

pp. 413-421

Author(s):

Gilseung Ahn ◽

Hwanchul Lee ◽

Sun Hur

Keyword(s):

Time Series ◽

Feature Selection ◽

Time Series Data ◽

Multivariate Time Series ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method ◽

Series Data

Download Full-text

Data Balanced Bagging Ensemble of Convolutional-LSTM Neural Networks for Time Series Data Classification with an Imbalanced Dataset

2021 IEEE International Symposium on Circuits and Systems (ISCAS) ◽

10.1109/iscas51556.2021.9401389 ◽

2021 ◽

Author(s):

Matthew Ward ◽

Kevin Malmsten ◽

Hassan Salamy ◽

Cheol-Hong Min

Keyword(s):

Neural Networks ◽

Time Series ◽

Time Series Data ◽

Data Classification ◽

Series Data ◽

Imbalanced Dataset ◽

Convolutional Lstm ◽

Bagging Ensemble

Download Full-text

An IoT Time Series Data Security Model for Adversarial Attack Based on Thermometer Encoding

Security and Communication Networks ◽

10.1155/2021/5537041 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Zhongguo Yang ◽

Irshad Ahmed Abbasi ◽

Fahad Algarni ◽

Sikandar Ali ◽

Mingzhu Zhang

Keyword(s):

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Data Classification ◽

Classification Model ◽

Series Data ◽

Security Model ◽

Learning Methods ◽

Model Based ◽

Adversarial Attack

Nowadays, an Internet of Things (IoT) device consists of algorithms, datasets, and models. Due to good performance of deep learning methods, many devices integrated well-trained models in them. IoT empowers users to communicate and control physical devices to achieve vital information. However, these models are vulnerable to adversarial attacks, which largely bring potential risks to the normal application of deep learning methods. For instance, very little changes even one point in the IoT time-series data could lead to unreliable or wrong decisions. Moreover, these changes could be deliberately generated by following an adversarial attack strategy. We propose a robust IoT data classification model based on an encode-decode joint training model. Furthermore, thermometer encoding is taken as a nonlinear transformation to the original training examples that are used to reconstruct original time series examples through the encode-decode model. The trained ResNet model based on reconstruction examples is more robust to the adversarial attack. Experiments show that the trained model can successfully resist to fast gradient sign method attack to some extent and improve the security of the time series data classification model.

Download Full-text

SAX-Based Representation with Longest Common Subsequence Dissimilarity Measure for Time Series Data Classification

A New Similarity Measure for Time Series Data Mining Based on Longest Common Subsequence

Efficient Time Series Clustering and Its Application to Social Network Mining

On privacy-preserving time series data classification

Shape-based Representation and Abstraction of Time Series Data along with a Dynamic Time Shape Wrapping as a Dissimilarity Measure

Time-Series Data Classification and Analysis Associated With Machine Learning Algorithms for Cognitive Perception and Phenomenon

Clustering Methodology for Time Series Mining

Time Series Data Classification Using Discriminative Interpolation with Sparsity

Feature Selection Method for Multivariate Time Series Data Classification

Data Balanced Bagging Ensemble of Convolutional-LSTM Neural Networks for Time Series Data Classification with an Imbalanced Dataset

An IoT Time Series Data Security Model for Adversarial Attack Based on Thermometer Encoding

Export Citation Format