scholarly journals Сomparative analysis of algorithms for change points detection in regression models of time series

2021 ◽  
Vol 9 (2) ◽  
pp. 137-150
Author(s):  
Viacheslav Riabtsev ◽  
Dmytro Sharadkin ◽  
Yurii Kliat

2015 ◽  
Vol 7 (2) ◽  
pp. 262-279 ◽  
Author(s):  
Zhichao Guo ◽  
Yuanhua Feng ◽  
Thomas Gries

Purpose – The purpose of this paper is to investigate changes of China’s agri-food exports to Germany caused by China’s accession to WTO and the global financial crisis in a quantitative way. The paper aims to detect structural breaks and compare differences before and after the change points. Design/methodology/approach – The structural breaks detection procedures in this paper can be applied to find out two different types of change points, i.e. in the middle and at the end of one time series. Then time series and regression models are used to compare differences of trade relationship before and after the detected change points. The methods can be employed in any economic series and work well in practice. Findings – The results indicate that structural breaks in 2002 and 2009 are caused by China’s accession to WTO and the financial crisis. Time series and regression models show that the development of China’s exports to Germany in agri-food products has different features in different sub-periods. Before 1999, there is no significant relationship between China’s exports to Germany and Germany’s imports from the world. Between 2002 and 2008 the former depends on the latter very strongly, and China’s exports to Germany developed quickly and stably. It decreased, however suddenly in 2009, caused by the great reduction of Germany’s imports from the world in that year. But China’s market share in Germany still had a small gain. Analysis of two categories in agri-food trade also leads to similar conclusions. Comparing the two events we see rather different patterns even if they both indicate structural breaks in the development of China’s agri-food exports to Germany. Originality/value – This paper partly originally proposes two statistical algorithms for detecting different kinds of structural breaks in the middle part and at the end of a short-time series, respectively.


2021 ◽  
Vol 149 ◽  
Author(s):  
Helmut Küchenhoff ◽  
Felix Günther ◽  
Michael Höhle ◽  
Andreas Bender

Abstract We analysed the coronavirus disease 2019 epidemic curve from March to the end of April 2020 in Germany. We use statistical models to estimate the number of cases with disease onset on a given day and use back-projection techniques to obtain the number of new infections per day. The respective time series are analysed by a trend regression model with change points. The change points are estimated directly from the data. We carry out the analysis for the whole of Germany and the federal state of Bavaria, where we have more detailed data. Both analyses show a major change between 9 and 13 March for the time series of infections: from a strong increase to a decrease. Another change was found between 25 March and 29 March, where the decline intensified. Furthermore, we perform an analysis stratified by age. A main result is a delayed course of the pandemic for the age group 80 + resulting in a turning point at the end of March. Our results differ from those by other authors as we take into account the reporting delay, which turned out to be time dependent and therefore changes the structure of the epidemic curve compared to the curve of newly reported cases.


Pathogens ◽  
2021 ◽  
Vol 10 (4) ◽  
pp. 480
Author(s):  
Rania Kousovista ◽  
Christos Athanasiou ◽  
Konstantinos Liaskonis ◽  
Olga Ivopoulou ◽  
George Ismailos ◽  
...  

Acinetobacter baumannii is one of the most difficult-to-treat pathogens worldwide, due to developed resistance. The aim of this study was to evaluate the use of widely prescribed antimicrobials and the respective resistance rates of A. baumannii, and to explore the relationship between antimicrobial use and the emergence of A. baumannii resistance in a tertiary care hospital. Monthly data on A. baumannii susceptibility rates and antimicrobial use, between January 2014 and December 2017, were analyzed using time series analysis (Autoregressive Integrated Moving Average (ARIMA) models) and dynamic regression models. Temporal correlations between meropenem, cefepime, and ciprofloxacin use and the corresponding rates of A. baumannii resistance were documented. The results of ARIMA models showed statistically significant correlation between meropenem use and the detection rate of meropenem-resistant A. baumannii with a lag of two months (p = 0.024). A positive association, with one month lag, was identified between cefepime use and cefepime-resistant A. baumannii (p = 0.028), as well as between ciprofloxacin use and its resistance (p < 0.001). The dynamic regression models offered explanation of variance for the resistance rates (R2 > 0.60). The magnitude of the effect on resistance for each antimicrobial agent differed significantly.


Water ◽  
2021 ◽  
Vol 13 (12) ◽  
pp. 1633
Author(s):  
Elena-Simona Apostol ◽  
Ciprian-Octavian Truică ◽  
Florin Pop ◽  
Christian Esposito

Due to the exponential growth of the Internet of Things networks and the massive amount of time series data collected from these networks, it is essential to apply efficient methods for Big Data analysis in order to extract meaningful information and statistics. Anomaly detection is an important part of time series analysis, improving the quality of further analysis, such as prediction and forecasting. Thus, detecting sudden change points with normal behavior and using them to discriminate between abnormal behavior, i.e., outliers, is a crucial step used to minimize the false positive rate and to build accurate machine learning models for prediction and forecasting. In this paper, we propose a rule-based decision system that enhances anomaly detection in multivariate time series using change point detection. Our architecture uses a pipeline that automatically manages to detect real anomalies and remove the false positives introduced by change points. We employ both traditional and deep learning unsupervised algorithms, in total, five anomaly detection and five change point detection algorithms. Additionally, we propose a new confidence metric based on the support for a time series point to be an anomaly and the support for the same point to be a change point. In our experiments, we use a large real-world dataset containing multivariate time series about water consumption collected from smart meters. As an evaluation metric, we use Mean Absolute Error (MAE). The low MAE values show that the algorithms accurately determine anomalies and change points. The experimental results strengthen our assumption that anomaly detection can be improved by determining and removing change points as well as validates the correctness of our proposed rules in real-world scenarios. Furthermore, the proposed rule-based decision support systems enable users to make informed decisions regarding the status of the water distribution network and perform effectively predictive and proactive maintenance.


Author(s):  
Rati WONGSATHAN

The novel coronavirus 2019 (COVID-19) pandemic was declared a global health crisis. The real-time accurate and predictive model of the number of infected cases could help inform the government of providing medical assistance and public health decision-making. This work is to model the ongoing COVID-19 spread in Thailand during the 1st and 2nd phases of the pandemic using the simple but powerful method based on the model-free and time series regression models. By employing the curve fitting, the model-free method using the logistic function, hyperbolic tangent function, and Gaussian function was applied to predict the number of newly infected patients and accumulate the total number of cases, including peak and viral cessation (ending) date. Alternatively, with a significant time-lag of historical data input, the regression model predicts those parameters from 1-day-ahead to 1-month-ahead. To obtain optimal prediction models, the parameters of the model-free method are fine-tuned through the genetic algorithm, whereas the generalized least squares update the parameters of the regression model. Assuming the future trend continues to follow the past pattern, the expected total number of patients is approximately 2,689 - 3,000 cases. The estimated viral cessation dates are May 2, 2020 (using Gaussian function), May 4, 2020 (using a hyperbolic function), and June 5, 2020 (using a logistic function), whereas the peak time occurred on April 5, 2020. Moreover, the model-free method performs well for long-term prediction, whereas the regression model is suitable for short-term prediction. Furthermore, the performances of the regression models yield a highly accurate forecast with lower RMSE and higher R2 up to 1-week-ahead. HIGHLIGHTS COVID-19 model for Thailand during the first and second phases of the epidemic The model-free method using the logistic function, hyperbolic tangent function, and Gaussian function  applied to predict the basic measures of the outbreak Regression model predicts those measures from one-day-ahead to one-month-ahead The parameters of the model-free method are fine-tuned through the genetic algorithm  GRAPHICAL ABSTRACT


Sign in / Sign up

Export Citation Format

Share Document