Deep learning for clustering of multivariate clinical patient trajectories with missing values

Abstract Background Precision medicine requires a stratification of patients by disease presentation that is sufficiently informative to allow for selecting treatments on a per-patient basis. For many diseases, such as neurological disorders, this stratification problem translates into a complex problem of clustering multivariate and relatively short time series because (i) these diseases are multifactorial and not well described by single clinical outcome variables and (ii) disease progression needs to be monitored over time. Additionally, clinical data often additionally are hindered by the presence of many missing values, further complicating any clustering attempts. Findings The problem of clustering multivariate short time series with many missing values is generally not well addressed in the literature. In this work, we propose a deep learning–based method to address this issue, variational deep embedding with recurrence (VaDER). VaDER relies on a Gaussian mixture variational autoencoder framework, which is further extended to (i) model multivariate time series and (ii) directly deal with missing values. We validated VaDER by accurately recovering clusters from simulated and benchmark data with known ground truth clustering, while varying the degree of missingness. We then used VaDER to successfully stratify patients with Alzheimer disease and patients with Parkinson disease into subgroups characterized by clinically divergent disease progression profiles. Additional analyses demonstrated that these clinical differences reflected known underlying aspects of Alzheimer disease and Parkinson disease. Conclusions We believe our results show that VaDER can be of great value for future efforts in patient stratification, and multivariate time-series clustering in general.

Download Full-text

Estimation and classification of temporal trends to support integrated ecosystem assessment

ICES Journal of Marine Science ◽

10.1093/icesjms/fsaa111 ◽

2020 ◽

Author(s):

Hiroko Kato Solvang ◽

Benjamin Planque

Keyword(s):

Time Series ◽

Barents Sea ◽

Multivariate Time Series ◽

Marine Ecosystem ◽

Temporal Trends ◽

Dynamic Factor ◽

Short Time Series ◽

Common Trends ◽

Common Trend ◽

Short Time

Abstract We propose a trend estimation and classification (TREC) approach to estimating dominant common trends among multivariate time series observations. Our methods are based on two statistical procedures that includes trend modelling and discriminant analysis for classifying similar trend (common trend) classes. We use simulations to evaluate the proposed approach and compare it with a relevant dynamic factor analysis in the time domain, which was recently proposed to estimate common trends in fisheries time series. We apply the TREC approach to the multivariate short time series datasets investigated by the ICES integrated assessment working groups for the Norwegian Sea and the Barents Sea. The proposed approach is robust for application to short time series, and it directly identifies and classifies the dominant trends underlying observations. Based on the classified trend classes, we suggest that communication among stakeholders like marine managers, industry representatives, non-governmental organizations, and governmental agencies can be enhanced by finding the common tendency between a biological community in a marine ecosystem and the environmental factors, as well as by the icons produced by generalizing common trend patterns.

Download Full-text

An Auto Regressive Deep Learning Model for Sales Tax Forecasting from Multiple Short Time Series

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA) ◽

10.1109/icmla.2019.00221 ◽

2019 ◽

Author(s):

Elham Buxton ◽

Kenneth Kriz ◽

Matthew Cremeens ◽

Kim Jay

Keyword(s):

Time Series ◽

Deep Learning ◽

Learning Model ◽

Sales Tax ◽

Short Time Series ◽

Auto Regressive ◽

Short Time ◽

Deep Learning Model

Download Full-text

Finding the direction of lowest resilience in multivariate complex systems

Journal of The Royal Society Interface ◽

10.1098/rsif.2019.0629 ◽

2019 ◽

Vol 16 (159) ◽

pp. 20190629 ◽

Cited By ~ 1

Author(s):

Els Weinans ◽

J. Jelle Lever ◽

Sebastian Bathiany ◽

Rick Quax ◽

Jordi Bascompte ◽

...

Keyword(s):

Time Series ◽

Complex Systems ◽

Financial Markets ◽

Multivariate Time Series ◽

Network Systems ◽

Short Time Series ◽

Novel Approach ◽

Data Resolution ◽

Multivariate Systems ◽

Short Time

The dynamics of complex systems, such as ecosystems, financial markets and the human brain, emerge from the interactions of numerous components. We often lack the knowledge to build reliable models for the behaviour of such network systems. This makes it difficult to predict potential instabilities. We show that one could use the natural fluctuations in multivariate time series to reveal network regions with particularly slow dynamics. The multidimensional slowness points to the direction of minimal resilience, in the sense that simultaneous perturbations on this set of nodes will take longest to recover. We compare an autocorrelation-based method with a variance-based method for different time-series lengths, data resolution and different noise regimes. We show that the autocorrelation-based method is less robust for short time series or time series with a low resolution but more robust for varying noise levels. This novel approach may help to identify unstable regions of multivariate systems or to distinguish safe from unsafe perturbations.

Download Full-text

Classification of short time series in early Parkinsons disease with deep learning of fuzzy recurrence plots

IEEE/CAA Journal of Automatica Sinica ◽

10.1109/jas.2019.1911774 ◽

2019 ◽

Vol 6 (6) ◽

pp. 1306-1317 ◽

Cited By ~ 4

Author(s):

Tuan D. Pham ◽

Karin Wardell ◽

Anders Eklund ◽

Goran Salerud

Keyword(s):

Time Series ◽

Deep Learning ◽

Recurrence Plots ◽

Short Time Series ◽

Parkinsons Disease ◽

Short Time

Download Full-text

A strategy for meta-analysis of short time series microarray datasets

Frontiers in Bioscience ◽

10.2741/3512 ◽

2009 ◽

Vol Volume (14) ◽

pp. 4058 ◽

Cited By ~ 2

Author(s):

Ruping Sun

Keyword(s):

Time Series ◽

Meta Analysis ◽

Short Time Series ◽

Microarray Datasets ◽

Short Time ◽

Time Series Microarray

Download Full-text

Identifying bidirectional total and non-linear information flow in functional corticomuscular coupling during a dorsiflexion task: a pilot study

Journal of NeuroEngineering and Rehabilitation ◽

10.1186/s12984-021-00872-w ◽

2021 ◽

Vol 18 (1) ◽

Author(s):

Tie Liang ◽

Qingyu Zhang ◽

Xiaoguang Liu ◽

Bin Dong ◽

Xiuling Liu ◽

...

Keyword(s):

Time Series ◽

Information Flow ◽

Gamma Band ◽

Motor Dysfunction ◽

Beta Band ◽

Short Time Series ◽

Information Interaction ◽

Stroke Group ◽

Short Time ◽

Maximal Information Coefficient

Abstract Background The key challenge to constructing functional corticomuscular coupling (FCMC) is to accurately identify the direction and strength of the information flow between scalp electroencephalography (EEG) and surface electromyography (SEMG). Traditional TE and TDMI methods have difficulty in identifying the information interaction for short time series as they tend to rely on long and stable data, so we propose a time-delayed maximal information coefficient (TDMIC) method. With this method, we aim to investigate the directional specificity of bidirectional total and nonlinear information flow on FCMC, and to explore the neural mechanisms underlying motor dysfunction in stroke patients. Methods We introduced a time-delayed parameter in the maximal information coefficient to capture the direction of information interaction between two time series. We employed the linear and non-linear system model based on short data to verify the validity of our algorithm. We then used the TDMIC method to study the characteristics of total and nonlinear information flow in FCMC during a dorsiflexion task for healthy controls and stroke patients. Results The simulation results showed that the TDMIC method can better detect the direction of information interaction compared with TE and TDMI methods. For healthy controls, the beta band (14–30 Hz) had higher information flow in FCMC than the gamma band (31–45 Hz). Furthermore, the beta-band total and nonlinear information flow in the descending direction (EEG to EMG) was significantly higher than that in the ascending direction (EMG to EEG), whereas in the gamma band the ascending direction had significantly higher information flow than the descending direction. Additionally, we found that the strong bidirectional information flow mainly acted on Cz, C3, CP3, P3 and CPz. Compared to controls, both the beta-and gamma-band bidirectional total and nonlinear information flows of the stroke group were significantly weaker. There is no significant difference in the direction of beta- and gamma-band information flow in stroke group. Conclusions The proposed method could effectively identify the information interaction between short time series. According to our experiment, the beta band mainly passes downward motor control information while the gamma band features upward sensory feedback information delivery. Our observation demonstrate that the center and contralateral sensorimotor cortex play a major role in lower limb motor control. The study further demonstrates that brain damage caused by stroke disrupts the bidirectional information interaction between cortex and effector muscles in the sensorimotor system, leading to motor dysfunction.

Download Full-text

Forecasting Video QoE with Deep Learning from Multivariate Time-series

IEEE Open Journal of Signal Processing ◽

10.1109/ojsp.2021.3099065 ◽

2021 ◽

pp. 1-1

Author(s):

Hossein Ebrahimidinaki ◽

Shervin Shirmohammadi ◽

Emil Janulewicz ◽

David Cote

Keyword(s):

Time Series ◽

Deep Learning ◽

Multivariate Time Series

Download Full-text

Multivariate Time Series Forecasting Based Cloud Computing For Consumer Price Index Using Deep Learning Algorithms

2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI) ◽

10.1109/isriti51436.2020.9315465 ◽

2020 ◽

Author(s):

Soffa Zahara ◽

Sugianto

Keyword(s):

Time Series ◽

Cloud Computing ◽

Deep Learning ◽

Price Index ◽

Multivariate Time Series ◽

Learning Algorithms ◽

Consumer Price Index ◽

Time Series Forecasting

Download Full-text

Improved Lempel-Ziv Algorithm Based on Complexity Measurement of Short Time Series

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) ◽

10.1109/fskd.2007.357 ◽

2007 ◽

Cited By ~ 4

Author(s):

Feng-tao Liu ◽

Yong Tang

Keyword(s):

Time Series ◽

Short Time Series ◽

Complexity Measurement ◽

Short Time

Download Full-text

Implementation of IoT Framework with Data Analysis Using Deep Learning Methods for Occupancy Prediction in a Building

Future Internet ◽

10.3390/fi13030067 ◽

2021 ◽

Vol 13 (3) ◽

pp. 67

Author(s):

Eric Hitimana ◽

Gaurav Bajpai ◽

Richard Musabe ◽

Louis Sibomana ◽

Jayavel Kayalvizhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Multivariate Time Series ◽

Machine Learning Algorithms ◽

Series Data ◽

Support Vector ◽

Human Beings ◽

Feed Forward Network

Many countries worldwide face challenges in controlling building incidence prevention measures for fire disasters. The most critical issues are the localization, identification, detection of the room occupant. Internet of Things (IoT) along with machine learning proved the increase of the smartness of the building by providing real-time data acquisition using sensors and actuators for prediction mechanisms. This paper proposes the implementation of an IoT framework to capture indoor environmental parameters for occupancy multivariate time-series data. The application of the Long Short Term Memory (LSTM) Deep Learning algorithm is used to infer the knowledge of the presence of human beings. An experiment is conducted in an office room using multivariate time-series as predictors in the regression forecasting problem. The results obtained demonstrate that with the developed system it is possible to obtain, process, and store environmental information. The information collected was applied to the LSTM algorithm and compared with other machine learning algorithms. The compared algorithms are Support Vector Machine, Naïve Bayes Network, and Multilayer Perceptron Feed-Forward Network. The outcomes based on the parametric calibrations demonstrate that LSTM performs better in the context of the proposed application.

Download Full-text