Statistical Analysis of Discrete-valued Time Series by Parsimonious High-order Markov Chains

Problems of statistical analysis of discrete-valued time series are considered. Two approaches for construction of parsimonious (small-parametric) models for observed discrete data are proposed based on high-order Markov chains.Consistent statistical estimators for parameters of the developed models and some known models, and also statistical tests on the values of parameters are constructed. Probabilistic properties of the constructed statistical inferences are given. The developed theory is also applied for statistical analysis of spatio-temporal data. Theoretical results are illustrated by computer experiments on real statistical data.

Download Full-text

Markov Chain of Conditional Order: Properties and Statistical Analysis

Austrian Journal of Statistics ◽

10.17713/ajs.v43i3.32 ◽

2014 ◽

Vol 43 (3) ◽

pp. 205-216 ◽

Cited By ~ 5

Author(s):

Yuriy Kharin ◽

Mikhail Maltsau

Keyword(s):

Markov Chain ◽

Statistical Analysis ◽

Statistical Tests ◽

Computer Experiments ◽

Real Data ◽

High Order ◽

Finite Markov Chain ◽

Order Markov Chain ◽

Special Case

The paper deals with finite Markov chain of conditional order, that is a special case of high-order Markov chain with a small number of parameters. Statistical estimators for parameters and statistical tests for parametric hypotheses are constructed and their properties are analyzed. Results of computer experiments on simulated and real data are presented.

Download Full-text

Statistical Analysis of Spatio-Temporal Data Based on Poisson Conditional Autoregressive Model

Informatica ◽

10.15388/informatica.2015.39 ◽

2015 ◽

Vol 26 (1) ◽

pp. 67-87

Author(s):

Yuriy Kharin ◽

Maryna Zhurak

Keyword(s):

Statistical Analysis ◽

Autoregressive Model ◽

Temporal Data ◽

Conditional Autoregressive Model ◽

Conditional Autoregressive ◽

Spatio Temporal

Download Full-text

Coastal Change Patterns from Time Series Clustering of Permanent Laser Scan Data

10.5194/esurf-2020-34 ◽

2020 ◽

Author(s):

Mieke Kuschnerus ◽

Roderik Lindenbergh ◽

Sander Vos

Keyword(s):

Time Series ◽

Laser Scanning ◽

Clustering Algorithm ◽

Coastal Areas ◽

Small Scale ◽

Temporal Data ◽

Agglomerative Clustering ◽

Data Set ◽

Deformation Processes ◽

Spatio Temporal

Abstract. Sandy coasts are constantly changing environments governed by complex interacting processes. Permanent laser scanning is a promising technique to monitor such coastal areas and support analysis of geomorphological deformation processes. This novel technique delivers 3D representations of a part of the coast at hourly temporal and centimetre spatial resolution and allows to observe small scale changes in elevation over extended periods of time. These observations have the potential to improve understanding and modelling of coastal deformation processes. However, to be of use to coastal researchers and coastal management, an efficient way to find and extract deformation processes from the large spatio-temporal data set is needed. In order to allow data mining in an automated way, we extract time series in elevation or range and use unsupervised learning algorithms to derive a partitioning of the observed area according to change patterns. We compare three well known clustering algorithms, k-means, agglomerative clustering and DBSCAN, and identify areas that undergo similar evolution during one month. We test if they fulfil our criteria for a suitable clustering algorithm on our exemplary data set. The three clustering methods are applied to time series of 30 epochs (during one month) extracted from a data set of daily scans covering a part of the coast at Kijkduin, the Netherlands. A small section of the beach, where a pile of sand was accumulated by a bulldozer is used to evaluate the performance of the algorithms against a ground truth. The k-means algorithm and agglomerative clustering deliver similar clusters, and both allow to identify a fixed number of dominant deformation processes in sandy coastal areas, such as sand accumulation by a bulldozer or erosion in the intertidal area. The DBSCAN algorithm finds clusters for only about 44 % of the area and turns out to be more suitable for the detection of outliers, caused for example by temporary objects on the beach. Our study provides a methodology to efficiently mine a spatio-temporal data set for predominant deformation patterns with the associated regions, where they occur.

Download Full-text

Parameter Estimators for Gaussian Models with Censored Time Series and Spatio-temporal Data

COMPSTAT ◽

10.1007/978-3-662-01131-7_42 ◽

1998 ◽

pp. 323-328 ◽

Cited By ~ 1

Author(s):

C. A. Glasbey ◽

I. M. Nevison ◽

A. G. M. Hunter

Keyword(s):

Time Series ◽

Temporal Data ◽

Gaussian Models ◽

Spatio Temporal ◽

Parameter Estimators

Download Full-text

A Spatio-Temporal Data Fusion Model for Generating NDVI Time Series in Heterogeneous Regions

Remote Sensing ◽

10.3390/rs9111125 ◽

2017 ◽

Vol 9 (11) ◽

pp. 1125 ◽

Cited By ~ 13

Author(s):

Chunhua Liao ◽

Jinfei Wang ◽

Ian Pritchard ◽

Jiangui Liu ◽

Jiali Shang

Keyword(s):

Time Series ◽

Data Fusion ◽

Temporal Data ◽

Fusion Model ◽

Ndvi Time Series ◽

Spatio Temporal

Download Full-text

Image Time Series Classification based on a Planar Spatio-temporal Data Representation

Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications ◽

10.5220/0008949202760283 ◽

2020 ◽

Author(s):

Mohamed Chelali ◽

Camille Kurtz ◽

Anne Puissant ◽

Nicole Vincent

Keyword(s):

Time Series ◽

Data Representation ◽

Temporal Data ◽

Time Series Classification ◽

Spatio Temporal

Download Full-text

Characterizing popularity dynamics of hot topics using micro-blogs spatio-temporal data

Journal Of Big Data ◽

10.1186/s40537-019-0266-4 ◽

2019 ◽

Vol 6 (1) ◽

Cited By ~ 1

Author(s):

Lianren Wu ◽

Jinjie Li ◽

Jiayin Qi

Keyword(s):

Time Series ◽

Spatial Analysis ◽

Power Law ◽

Clustering Algorithm ◽

Temporal Data ◽

Literature Reference ◽

Spectral Centroid ◽

Popularity Dynamics ◽

Temporal And Spatial ◽

Spatio Temporal

AbstractIn this paper, a quantitative temporal and spatial analysis of the dynamics of hot topics popularity in Micro-blogging system was provided. Firstly, the popularity time series of 1167 hot topics were counted and calculated by Excel. Secondly, based on MATLAB software,the popularity time series were clustered into six clusters by K-spectral centroid (K-SC) clustering algorithm. Thirdly, we analyzed temporal patterns and spatial patterns of popularity dynamics of topics by statistical methods. The results show that temporal popularity of micro-blogging topics is rapidly dying, and the distribution of popularity is subject to the power law form. In addition, most of the Micro-blogging topics are global topic. Our results can provide a literature reference for studying the influence of online hot topics and the evolution of public opinion.

Download Full-text

Analysis of the high order ADL(p, q) models used to describe connections between time series

Российский технологический журнал ◽

10.32362/2500-316x-2020-8-2-7-22 ◽

2020 ◽

Vol 8 (2) ◽

pp. 7-22 ◽

Cited By ~ 1

Author(s):

T. R. Kalugin ◽

A. K. Kim ◽

D. A. Petrusevich

Keyword(s):

Time Series ◽

Statistical Tests ◽

High Order ◽

Index Pair ◽

Time Period ◽

Production And Consumption ◽

Consumption Index ◽

Dynamic Series ◽

The Mathematical Model ◽

Index Time Series

In the paper the mathematical models describing connection between two time series are researched. At first each of them is investigated separately, and the ARIMA(p, d, q) model is constructed. These models are based on the time series characteristics obtained during the analysis stage. The connection between two time series is confirmed with the aid of cointegration statistical tests. Then the mathematical model of the connection between series is constructed. The ADL(p, q) model describes this dependence. It’s shown that for the time series under investigation the orders p, q of the ADL(p, q) model are connected with the ARIMA(p, d, q) orders of the describing each series separately. This step makes the set of the investigated ADL(p, q) models much smaller. In the previous papers it was also shown that the ARIMA(p, d, q) automatical fitting functions in popular packages use limitations on the p, q orders of the time series process: q ≤ 5, p ≤ 5. The wish to use the simplest models is also built in the structure of the Akaike (AIC) and Bayes (BIC) informational criteria. In the paper the maximal values of the ADL(p, q) model orders are supposed to be the orders of the appropriate ARIMA(p, d, q) series. In the previous work it was shown that using high order ARIMA(p, d, q) it is possible to fit the models better. In this paper the experiments on the ADL(p, q) models construction are presented. The wage index and money income index time series pair is researched, and also the gas, water and energy production and consumption index/real agricultural production index pair is investigated. The data in the 2000–2018 time period is taken from the dynamic series of macroeconomic statistics of the Russian Federation.

Download Full-text

Tri-Clustering Based Exploration of Temporal Resolution Impacts on Spatio-Temporal Clusters in Geo-Referenced Time Series

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9040210 ◽

2020 ◽

Vol 9 (4) ◽

pp. 210

Author(s):

Xiaojing Wu ◽

Donghai Zheng

Keyword(s):

Time Series ◽

Temporal Resolution ◽

Input Data ◽

Clustering Algorithm ◽

Temperature Data ◽

Similar Data ◽

Temporal Data ◽

Clustering Methods ◽

Spatio Temporal ◽

Data Elements

Unprecedented amounts of spatio-temporal data instigates an urgent need for patterns exploration in it. Clustering analysis is useful in extracting patterns from big data by grouping similar data elements into clusters. Compared with one-way clustering and co-clustering methods, tri-clustering methods are more capable of exploring complex patterns. However, the explored patterns or clusters could be different due to varying temporal resolutions of input data. This study presents a tri-clustering based method to explore the impacts of different temporal resolutions on spatio-temporal clusters identified in geo-referenced time series (GTS), one type of spatio-temporal data. Dutch daily temperature data at 28 stations over 20 years was used to illustrate this study. The temperature data at daily, monthly, and yearly resolutions were subjected to the Bregman cube average tri-clustering algorithm with I-divergence (BCAT_I) to detect spatio-temporal clusters, which were then compared in terms of patterns exhibited, compositions, and changed elements. Results confirm the temporal resolution impacts on the spatio-temporal clusters identified in the Dutch temperature data: most compositions of clusters are varying when changing the temporal resolutions of input data in the GTS. Nevertheless, there is almost no change of elements in certain clusters (12 stations in the northeast of the country; years 1996, 2010) at all temporal resolutions, suggesting them as the “true” clusters in the case study dataset.

Download Full-text

Evaluation of spatio-temporal data fusion methods for generating NDVI time series in cropland areas

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2016.7729664 ◽

2016 ◽

Author(s):

Chunhua Liao ◽

Jinfei Wang

Keyword(s):

Time Series ◽

Data Fusion ◽

Temporal Data ◽

Fusion Methods ◽

Ndvi Time Series ◽

Spatio Temporal

Download Full-text