Edge4TSC: Binary Distribution Tree-Enabled Time Series Classification in Edge Environment

Chao Ma; Xiaochuan Shi; Wei Li; Weiping Zhu

doi:10.3390/s20071908

Edge4TSC: Binary Distribution Tree-Enabled Time Series Classification in Edge Environment

Sensors ◽

10.3390/s20071908 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1908

Author(s):

Chao Ma ◽

Xiaochuan Shi ◽

Wei Li ◽

Weiping Zhu

Keyword(s):

Time Series ◽

Deep Learning ◽

Classification Accuracy ◽

Time Series Data ◽

Series Representation ◽

Series Data ◽

Feature Engineering ◽

Time Series Classification ◽

Binary Distribution ◽

New Time

In the past decade, time series data have been generated from various fields at a rapid speed, which offers a huge opportunity for mining valuable knowledge. As a typical task of time series mining, Time Series Classification (TSC) has attracted lots of attention from both researchers and domain experts due to its broad applications ranging from human activity recognition to smart city governance. Specifically, there is an increasing requirement for performing classification tasks on diverse types of time series data in a timely manner without costly hand-crafting feature engineering. Therefore, in this paper, we propose a framework named Edge4TSC that allows time series to be processed in the edge environment, so that the classification results can be instantly returned to the end-users. Meanwhile, to get rid of the costly hand-crafting feature engineering process, deep learning techniques are applied for automatic feature extraction, which shows competitive or even superior performance compared to state-of-the-art TSC solutions. However, because time series presents complex patterns, even deep learning models are not capable of achieving satisfactory classification accuracy, which motivated us to explore new time series representation methods to help classifiers further improve the classification accuracy. In the proposed framework Edge4TSC, by building the binary distribution tree, a new time series representation method was designed for addressing the classification accuracy concern in TSC tasks. By conducting comprehensive experiments on six challenging time series datasets in the edge environment, the potential of the proposed framework for its generalization ability and classification accuracy improvement is firmly validated with a number of helpful insights.

Time Series Classification with Discrete Wavelet Transformed Data

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194016400088 ◽

2016 ◽

Vol 26 (09n10) ◽

pp. 1361-1377 ◽

Cited By ~ 4

Author(s):

Daoyuan Li ◽

Tegawende F. Bissyande ◽

Jacques Klein ◽

Yves Le Traon

Keyword(s):

Time Series ◽

Classification Accuracy ◽

Wavelet Transforms ◽

Time Series Data ◽

Knowledge Engineering ◽

Series Data ◽

Discrete Wavelet ◽

Time Series Classification ◽

Time Series Mining ◽

Compressed Data

Time series mining has become essential for extracting knowledge from the abundant data that flows out from many application domains. To overcome storage and processing challenges in time series mining, compression techniques are being used. In this paper, we investigate the loss/gain of performance of time series classification approaches when fed with lossy-compressed data. This extended empirical study is essential for reassuring practitioners, but also for providing more insights on how compression techniques can even be effective in smoothing and reducing noise in time series data. From a knowledge engineering perspective, we show that time series may be compressed by 90% using discrete wavelet transforms and still achieve remarkable classification accuracy, and that residual details left by popular wavelet compression techniques can sometimes even help to achieve higher classification accuracy than the raw time series data, as they better capture essential local features.

Hexadecimal Aggregate Approximation Representation and Classification of Time Series Data

Algorithms ◽

10.3390/a14120353 ◽

2021 ◽

Vol 14 (12) ◽

pp. 353

Author(s):

Zhenwen He ◽

Chunfeng Zhang ◽

Xiaogang Ma ◽

Gang Liu

Keyword(s):

Time Series ◽

Classification Accuracy ◽

Euclidean Distance ◽

Time Series Data ◽

Series Representation ◽

Series Data ◽

General Representation ◽

Symbolic Aggregate Approximation ◽

Space Cost

Time series data are widely found in finance, health, environmental, social, mobile and other fields. A large amount of time series data has been produced due to the general use of smartphones, various sensors, RFID and other internet devices. How a time series is represented is key to the efficient and effective storage and management of time series data, as well as being very important to time series classification. Two new time series representation methods, Hexadecimal Aggregate approXimation (HAX) and Point Aggregate approXimation (PAX), are proposed in this paper. The two methods represent each segment of a time series as a transformable interval object (TIO). Then, each TIO is mapped to a spatial point located on a two-dimensional plane. Finally, the HAX maps each point to a hexadecimal digit so that a time series is converted into a hex string. The experimental results show that HAX has higher classification accuracy than Symbolic Aggregate approXimation (SAX) but a lower one than some SAX variants (SAX-TD, SAX-BD). The HAX has the same space cost as SAX but is lower than these variants. The PAX has higher classification accuracy than HAX and is extremely close to the Euclidean distance (ED) measurement; however, the space cost of PAX is generally much lower than the space cost of ED. HAX and PAX are general representation methods that can also support geoscience time series clustering, indexing and query except for classification.

TapNet: Multivariate Time Series Classification with Attentional Prototypical Network

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6165 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6845-6852 ◽

Cited By ~ 3

Author(s):

Xuchao Zhang ◽

Yifeng Gao ◽

Jessica Lin ◽

Chang-Tien Lu

Keyword(s):

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Multivariate Time Series ◽

Feature Representation ◽

Series Data ◽

Time Series Classification ◽

Random Group ◽

Low Dimensional ◽

Low Dimensional Features

With the advance of sensor technologies, the Multivariate Time Series classification (MTSC) problem, perhaps one of the most essential problems in the time series data mining domain, has continuously received a significant amount of attention in recent decades. Traditional time series classification approaches based on Bag-of-Patterns or Time Series Shapelet have difficulty dealing with the huge amounts of feature candidates generated in high-dimensional multivariate data but have promising performance even when the training set is small. In contrast, deep learning based methods can learn low-dimensional features efficiently but suffer from a shortage of labelled data. In this paper, we propose a novel MTSC model with an attentional prototype network to take the strengths of both traditional and deep learning based approaches. Specifically, we design a random group permutation method combined with multi-layer convolutional networks to learn the low-dimensional features from multivariate time series data. To handle the issue of limited training labels, we propose a novel attentional prototype network to train the feature representation based on their distance to class prototypes with inadequate data labels. In addition, we extend our model into its semi-supervised setting by utilizing the unlabeled data. Extensive experiments on 18 datasets in a public UEA Multivariate time series archive with eight state-of-the-art baseline methods exhibit the effectiveness of the proposed model.

Load forecasting of refrigerated display cabinet based on CEEMD–IPSO–LSTM combined model

Open Physics ◽

10.1515/phys-2021-0043 ◽

2021 ◽

Vol 19 (1) ◽

pp. 360-374

Author(s):

Yuan Pei ◽

Lei Zhenglin ◽

Zeng Qinghui ◽

Wu Yixiao ◽

Lu Yanli ◽

...

Keyword(s):

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Load Forecasting ◽

Series Data ◽

Forecasting Model ◽

Combined Model ◽

Forecasting Accuracy ◽

Forecasting Method ◽

Consumption Reduction

Abstract The load of the showcase is a nonlinear and unstable time series data, and the traditional forecasting method is not applicable. Deep learning algorithms are introduced to predict the load of the showcase. Based on the CEEMD–IPSO–LSTM combination algorithm, this paper builds a refrigerated display cabinet load forecasting model. Compared with the forecast results of other models, it finally proves that the CEEMD–IPSO–LSTM model has the highest load forecasting accuracy, and the model’s determination coefficient is 0.9105, which is obviously excellent. Compared with other models, the model constructed in this paper can predict the load of showcases, which can provide a reference for energy saving and consumption reduction of display cabinet.

Deep Learning for Anomaly Detection in Time-Series Data: Review, Analysis, and Guidelines

IEEE Access ◽

10.1109/access.2021.3107975 ◽

2021 ◽

Vol 9 ◽

pp. 120043-120065

Author(s):

Kukjin Choi ◽

Jihun Yi ◽

Changhwa Park ◽

Sungroh Yoon

Keyword(s):

Time Series ◽

Deep Learning ◽

Anomaly Detection ◽

Time Series Data ◽

Series Data ◽

Review Analysis

Implementation of IoT Framework with Data Analysis Using Deep Learning Methods for Occupancy Prediction in a Building

Future Internet ◽

10.3390/fi13030067 ◽

2021 ◽

Vol 13 (3) ◽

pp. 67

Author(s):

Eric Hitimana ◽

Gaurav Bajpai ◽

Richard Musabe ◽

Louis Sibomana ◽

Jayavel Kayalvizhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Multivariate Time Series ◽

Machine Learning Algorithms ◽

Series Data ◽

Support Vector ◽

Human Beings ◽

Feed Forward Network

Many countries worldwide face challenges in controlling building incidence prevention measures for fire disasters. The most critical issues are the localization, identification, detection of the room occupant. Internet of Things (IoT) along with machine learning proved the increase of the smartness of the building by providing real-time data acquisition using sensors and actuators for prediction mechanisms. This paper proposes the implementation of an IoT framework to capture indoor environmental parameters for occupancy multivariate time-series data. The application of the Long Short Term Memory (LSTM) Deep Learning algorithm is used to infer the knowledge of the presence of human beings. An experiment is conducted in an office room using multivariate time-series as predictors in the regression forecasting problem. The results obtained demonstrate that with the developed system it is possible to obtain, process, and store environmental information. The information collected was applied to the LSTM algorithm and compared with other machine learning algorithms. The compared algorithms are Support Vector Machine, Naïve Bayes Network, and Multilayer Perceptron Feed-Forward Network. The outcomes based on the parametric calibrations demonstrate that LSTM performs better in the context of the proposed application.

Applications of Anomaly Detection Using Deep Learning on Time Series Data

2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech) ◽

10.1109/dasc/picom/datacom/cyberscitec.2018.00078 ◽

2018 ◽

Cited By ~ 2

Author(s):

Van Quan Nguyen ◽

Linh Van Ma ◽

Jin-young Kim ◽

Kwangki Kim ◽

Jinsul Kim

Keyword(s):

Time Series ◽

Deep Learning ◽

Anomaly Detection ◽

Time Series Data ◽

Series Data

Outlier Detection Using Convolutional Neural Network for Wireless Sensor Network

International Journal of Business Data Communications and Networking ◽

10.4018/ijbdcn.286705 ◽

2021 ◽

Vol 17 (2) ◽

pp. 0-0

Keyword(s):

Neural Network ◽

Time Series ◽

Deep Learning ◽

Wireless Sensor Network ◽

Convolutional Neural Network ◽

Sensor Network ◽

Time Series Data ◽

Wireless Sensor ◽

Series Data ◽

Detection Accuracy

Over the recent years, the term deep learning has been considered as one of the primary choice for handling huge amount of data. Having deeper hidden layers, it surpasses classical methods for detection of outlier in wireless sensor network. The Convolutional Neural Network (CNN) is a biologically inspired computational model which is one of the most popular deep learning approaches. It comprises neurons that self-optimize through learning. EEG generally known as Electroencephalography is a tool used for investigation of brain function and EEG signal gives time-series data as output. In this paper, we propose a state-of-the-art technique designed by processing the time-series data generated by the sensor nodes stored in a large dataset into discrete one-second frames and these frames are projected onto a 2D map images. A convolutional neural network (CNN) is then trained to classify these frames. The result improves detection accuracy and encouraging.

Discriminate Supervised Weighted Scheme for the Classification of Time Series Signals

International Journal of Sociotechnology and Knowledge Development ◽

10.4018/ijskd.2021070101 ◽

2021 ◽

Vol 13 (3) ◽

pp. 1-16

Author(s):

Elangovan Ramanujam ◽

S. Padmavathi

Keyword(s):

Time Series ◽

Time Series Data ◽

State Of The Art ◽

Statistical Significance ◽

Series Data ◽

Bag Of Words ◽

Time Series Classification ◽

Problem Of Time ◽

Weighted Matrix

Innovations and applicability of time series data mining techniques have significantly increased the researchers' interest in the problem of time series classification. Several algorithms have been proposed for this purpose categorized under shapelet, interval, motif, and whole series-based techniques. Among this, the bag-of-words technique, an extensive application of the text mining approach, performs well due to its simplicity and effectiveness. To extend the efficiency of the bag-of-words technique, this paper proposes a discriminate supervised weighted scheme to identify the characteristic and representative pattern of a class for efficient classification. This paper uses a modified weighted matrix that discriminates the representative and non-representative pattern which enables the interpretability in classification. Experimentation has been carried out to compare the performance of the proposed technique with state-of-the-art techniques in terms of accuracy and statistical significance.

Corporation financial distress prediction with deep learning: analysis of public listed companies in Malaysia

Business Process Management Journal ◽

10.1108/bpmj-06-2020-0273 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Zulkifli Halim ◽

Shuhaida Mohamed Shuhidan ◽

Zuraidah Mohd Sanusi

Keyword(s):

Time Series ◽

Deep Learning ◽

Financial Distress ◽

Time Series Data ◽

Series Data ◽

Learning Models ◽

Content Type ◽

Financial Distress Prediction ◽

Distress Prediction ◽

Gated Recurrent Unit

PurposeIn the previous study of financial distress prediction, deep learning techniques performed better than traditional techniques over time-series data. This study investigates the performance of deep learning models: recurrent neural network, long short-term memory and gated recurrent unit for the financial distress prediction among the Malaysian public listed corporation over the time-series data. This study also compares the performance of logistic regression, support vector machine, neural network, decision tree and the deep learning models on single-year data.Design/methodology/approachThe data used are the financial data of public listed companies that been classified as PN17 status (distress) and non-PN17 (not distress) in Malaysia. This study was conducted using machine learning library of Python programming language.FindingsThe findings indicate that all deep learning models used for this study achieved 90% accuracy and above with long short-term memory (LSTM) and gated recurrent unit (GRU) getting 93% accuracy. In addition, deep learning models consistently have good performance compared to the other models over single-year data. The results show LSTM and GRU getting 90% and recurrent neural network (RNN) 88% accuracy. The results also show that LSTM and GRU get better precision and recall compared to RNN. The findings of this study show that the deep learning approach will lead to better performance in financial distress prediction studies. To be added, time-series data should be highlighted in any financial distress prediction studies since it has a big impact on credit risk assessment.Research limitations/implicationsThe first limitation of this study is the hyperparameter tuning only applied for deep learning models. Secondly, the time-series data are only used for deep learning models since the other models optimally fit on single-year data.Practical implicationsThis study proposes recommendations that deep learning is a new approach that will lead to better performance in financial distress prediction studies. Besides that, time-series data should be highlighted in any financial distress prediction studies since the data have a big impact on the assessment of credit risk.Originality/valueTo the best of authors' knowledge, this article is the first study that uses the gated recurrent unit in financial distress prediction studies based on time-series data for Malaysian public listed companies. The findings of this study can help financial institutions/investors to find a better and accurate approach for credit risk assessment.