scholarly journals A Visual Analytics Framework for Reviewing Multivariate Time-Series Data with Dimensionality Reduction

Author(s):  
Takanori Fujiwara ◽  
Shilpika ◽  
Naohisa Sakamoto ◽  
Jorji Nonaka ◽  
Keiji Yamamoto ◽  
...  
2021 ◽  
Author(s):  
◽  
Mohammed Ali

In this thesis, we focus on time-series data, which is commonly used by domain experts in different domains to explore and understand phenomena or behaviors under consideration, as-sisting them in making decisions, predicting the future or solving problems. Utilizing sensor devices is one of the common ways of collecting time-series data. These devices collect large volumes of raw data, including multi-dimensional time-series data, and each value is associated with the time-stamp corresponding to when it was recorded. However, finding interesting pat-terns or behaviors in a large amount of data is not simple due to the nature of the data and other challenges related to its size and scalability, high dimensionality, complexity, representation, and unique structure.Researchers tend to use time-series chart visualization, which is usually unsuitable because of the small screen resolution which cannot accommodate the large size of the data. Hence, occlusion and overplotting issues occur, limiting or complicating the exploration and analysis tasks. Another challenge concerns the labeling of patterns in large time-series data, which is time-consuming and requires a great deal of expert knowledge.These issues are addressed in this thesis to improve the exploration, analysis and presen-tation of time-series data and enable users to gain insights into large and multi-dimensional time-series datasets using a combination of dimensionality reduction techniques and interac-tive visual methods. The provided solutions will help researchers from various domains who deal with large and multi-dimensional time-series data to efficiently explore and analyze such data with little effort and in record time.Initially, we explore the area of integration between machine learning algorithms and inter-active visualization techniques for exploring and understanding time-series data, specifically looking at clustering and classification for time-series data in visual analytics. The survey is considered to be a valuable guide for both new researchers and experts in the emerging field of integrating machine learning algorithms into visual analytics.Next, we present a novel approach that aims to explore, analyze, and present large temporal datasets through one image. The proposed approach uses a sliding window and dimensionality reduction techniques to depict a large time-series data as points into a 2D scatter plot. The approach provides novel solutions to many pattern discovery issues and can deal with both univariate and multivariate time-series data.Following this, our proposed approach is combined with both visualization and interaction techniques into one system called TimeCluster, which is a visual analytics tool allowing users to visualize, explore and interact with large time-series data. The system addresses different issues such as anomaly detection, the discovery of frequent patterns, and the labeling of in-teresting patterns in large time-series data all in a single system. We deploy our system with different time-series datasets and report real-world case studies of its utility.Later, the linkage between the 1D view (time-series chart) to the 2D view of the 2D embed-ding of time-series data, and parallel interactions such as selection and labeling, are employed to explore and examine the effectiveness of recent developments in machine learning and di-mension reduction in the context of time-series data exploration. We design a user study to evaluate and validate the effectiveness of the linkage between both a 1D and 2D visualization, and how their fitness in the context of projecting time-series data is, where different dimen-sionality reduction techniques are examined, evaluated and compared within our experimental setting.Lastly, we conclude our findings and outline possible areas for future work.


Water ◽  
2021 ◽  
Vol 13 (12) ◽  
pp. 1633
Author(s):  
Elena-Simona Apostol ◽  
Ciprian-Octavian Truică ◽  
Florin Pop ◽  
Christian Esposito

Due to the exponential growth of the Internet of Things networks and the massive amount of time series data collected from these networks, it is essential to apply efficient methods for Big Data analysis in order to extract meaningful information and statistics. Anomaly detection is an important part of time series analysis, improving the quality of further analysis, such as prediction and forecasting. Thus, detecting sudden change points with normal behavior and using them to discriminate between abnormal behavior, i.e., outliers, is a crucial step used to minimize the false positive rate and to build accurate machine learning models for prediction and forecasting. In this paper, we propose a rule-based decision system that enhances anomaly detection in multivariate time series using change point detection. Our architecture uses a pipeline that automatically manages to detect real anomalies and remove the false positives introduced by change points. We employ both traditional and deep learning unsupervised algorithms, in total, five anomaly detection and five change point detection algorithms. Additionally, we propose a new confidence metric based on the support for a time series point to be an anomaly and the support for the same point to be a change point. In our experiments, we use a large real-world dataset containing multivariate time series about water consumption collected from smart meters. As an evaluation metric, we use Mean Absolute Error (MAE). The low MAE values show that the algorithms accurately determine anomalies and change points. The experimental results strengthen our assumption that anomaly detection can be improved by determining and removing change points as well as validates the correctness of our proposed rules in real-world scenarios. Furthermore, the proposed rule-based decision support systems enable users to make informed decisions regarding the status of the water distribution network and perform effectively predictive and proactive maintenance.


2021 ◽  
Vol 13 (3) ◽  
pp. 67
Author(s):  
Eric Hitimana ◽  
Gaurav Bajpai ◽  
Richard Musabe ◽  
Louis Sibomana ◽  
Jayavel Kayalvizhi

Many countries worldwide face challenges in controlling building incidence prevention measures for fire disasters. The most critical issues are the localization, identification, detection of the room occupant. Internet of Things (IoT) along with machine learning proved the increase of the smartness of the building by providing real-time data acquisition using sensors and actuators for prediction mechanisms. This paper proposes the implementation of an IoT framework to capture indoor environmental parameters for occupancy multivariate time-series data. The application of the Long Short Term Memory (LSTM) Deep Learning algorithm is used to infer the knowledge of the presence of human beings. An experiment is conducted in an office room using multivariate time-series as predictors in the regression forecasting problem. The results obtained demonstrate that with the developed system it is possible to obtain, process, and store environmental information. The information collected was applied to the LSTM algorithm and compared with other machine learning algorithms. The compared algorithms are Support Vector Machine, Naïve Bayes Network, and Multilayer Perceptron Feed-Forward Network. The outcomes based on the parametric calibrations demonstrate that LSTM performs better in the context of the proposed application.


2018 ◽  
Vol 15 (147) ◽  
pp. 20180695 ◽  
Author(s):  
Simone Cenci ◽  
Serguei Saavedra

Biotic interactions are expected to play a major role in shaping the dynamics of ecological systems. Yet, quantifying the effects of biotic interactions has been challenging due to a lack of appropriate methods to extract accurate measurements of interaction parameters from experimental data. One of the main limitations of existing methods is that the parameters inferred from noisy, sparsely sampled, nonlinear data are seldom uniquely identifiable. That is, many different parameters can be compatible with the same dataset and can generalize to independent data equally well. Hence, it is difficult to justify conclusive assertions about the effect of biotic interactions without information about their associated uncertainty. Here, we develop an ensemble method based on model averaging to quantify the uncertainty associated with the effect of biotic interactions on community dynamics from non-equilibrium ecological time-series data. Our method is able to detect the most informative time intervals for each biotic interaction within a multivariate time series and can be easily adapted to different regression schemes. Overall, this novel approach can be used to associate a time-dependent uncertainty with the effect of biotic interactions. Moreover, because we quantify uncertainty with minimal assumptions about the data-generating process, our approach can be applied to any data for which interactions among variables strongly affect the overall dynamics of the system.


Author(s):  
Jochen Garcke ◽  
Rodrigo Iza-Teran ◽  
Marvin Marks ◽  
Mandar Pathare ◽  
Dirk Schollbach ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document