scholarly journals Evaluation of Test Statistics for Detection of Outliers and Shifts

2020 ◽  
Vol 4 (2) ◽  
pp. 54-75
Author(s):  
Amena Urooj ◽  
Zahid Asghar

Existence of outliers and structural breaks having mutually unknown nature, in time series data, offer challenges to data analysts in model identification, estimation and validation. Detection of these outliers has been an important area of research in time series since long. To analyze the impact of these structural breaks and outliers on model identification, estimation and their inferential analysis, we use two data generating processes: MA(1) and ARMA(1,1). The performance of the test statistics for detecting additive outlier(AO), innovative outlier(IO), level shift(LS) and transient change(TC) is investigated using simulation strategy through power of a test, empirical level of significance, empirical critical values, misspecification frequencies and sampling distribution of estimators for the two models. The empirical critical values are found higher than the theoretical cut-off points, empirical power of the test statistics is not satisfactory for small sample size, large cut-off points and large model coefficient. We have explored confusion between LS, AO, TC and IO at different critical values(c) by varying sample size. We have also collected empirical evidence from time series data for Pakistan using 3-stage iterative procedure to detect multiple outliers and structural breaks. We find that neglecting shocks lead to wrong identification, biased estimation and excess kurtosis. JEL Classification Codes: C15, C18, C63, C32, C87, C51, C52, C82 AMS Classification Codes: 62, 65, 91, DI, 62-08, 62J20, 00A72, 91-08, 91-10, 91-11 62P20, 91B82, 91B84, 62M07, 62M09, 62M10, 62M15, 62M20

2008 ◽  
Vol 18 (12) ◽  
pp. 3679-3687 ◽  
Author(s):  
AYDIN A. CECEN ◽  
CAHIT ERKAL

We present a critical remark on the pitfalls of calculating the correlation dimension and the largest Lyapunov exponent from time series data when trend and periodicity exist. We consider a special case where a time series Zi can be expressed as the sum of two subsystems so that Zi = Xi + Yi and at least one of the subsystems is deterministic. We show that if the trend and periodicity are not properly removed, correlation dimension and Lyapunov exponent estimations yield misleading results, which can severely compromise the results of diagnostic tests and model identification. We also establish an analytic relationship between the largest Lyapunov exponents of the subsystems and that of the whole system. In addition, the impact of a periodic parameter perturbation on the Lyapunov exponent for the logistic map and the Lorenz system is discussed.


2021 ◽  
Vol 11 (8) ◽  
pp. 3561
Author(s):  
Diego Duarte ◽  
Chris Walshaw ◽  
Nadarajah Ramesh

Across the world, healthcare systems are under stress and this has been hugely exacerbated by the COVID pandemic. Key Performance Indicators (KPIs), usually in the form of time-series data, are used to help manage that stress. Making reliable predictions of these indicators, particularly for emergency departments (ED), can facilitate acute unit planning, enhance quality of care and optimise resources. This motivates models that can forecast relevant KPIs and this paper addresses that need by comparing the Autoregressive Integrated Moving Average (ARIMA) method, a purely statistical model, to Prophet, a decomposable forecasting model based on trend, seasonality and holidays variables, and to the General Regression Neural Network (GRNN), a machine learning model. The dataset analysed is formed of four hourly valued indicators from a UK hospital: Patients in Department; Number of Attendances; Unallocated Patients with a DTA (Decision to Admit); Medically Fit for Discharge. Typically, the data exhibit regular patterns and seasonal trends and can be impacted by external factors such as the weather or major incidents. The COVID pandemic is an extreme instance of the latter and the behaviour of sample data changed dramatically. The capacity to quickly adapt to these changes is crucial and is a factor that shows better results for GRNN in both accuracy and reliability.


Water ◽  
2021 ◽  
Vol 13 (4) ◽  
pp. 416
Author(s):  
Bwalya Malama ◽  
Devin Pritchard-Peterson ◽  
John J. Jasbinsek ◽  
Christopher Surfleet

We report the results of field and laboratory investigations of stream-aquifer interactions in a watershed along the California coast to assess the impact of groundwater pumping for irrigation on stream flows. The methods used include subsurface sediment sampling using direct-push drilling, laboratory permeability and particle size analyses of sediment, piezometer installation and instrumentation, stream discharge and stage monitoring, pumping tests for aquifer characterization, resistivity surveys, and long-term passive monitoring of stream stage and groundwater levels. Spectral analysis of long-term water level data was used to assess correlation between stream and groundwater level time series data. The investigations revealed the presence of a thin low permeability silt-clay aquitard unit between the main aquifer and the stream. This suggested a three layer conceptual model of the subsurface comprising unconfined and confined aquifers separated by an aquitard layer. This was broadly confirmed by resistivity surveys and pumping tests, the latter of which indicated the occurrence of leakage across the aquitard. The aquitard was determined to be 2–3 orders of magnitude less permeable than the aquifer, which is indicative of weak stream-aquifer connectivity and was confirmed by spectral analysis of stream-aquifer water level time series. The results illustrate the importance of site-specific investigations and suggest that even in systems where the stream is not in direct hydraulic contact with the producing aquifer, long-term stream depletion can occur due to leakage across low permeability units. This has implications for management of stream flows, groundwater abstraction, and water resources management during prolonged periods of drought.


2007 ◽  
pp. 88
Author(s):  
Wataru Suzuki ◽  
Yanfei Zhou

This article represents the first step in filling a large gap in knowledge concerning why Public Assistance (PA) use recently rose so fast in Japan. Specifically, we try to address this problem not only by performing a Blanchard and Quah decomposition on long-term monthly time series data (1960:04-2006:10), but also by estimating prefecturelevel longitudinal data. Two interesting findings emerge from the time series analysis. The first is that permanent shock imposes a continuously positive impact on the PA rate and is the main driving factor behind the recent increase in welfare use. The second finding is that the impact of temporary shock will last for a long time. The rate of the use of welfare is quite rigid because even if the PA rate rises due to temporary shocks, it takes about 8 or 9 years for it to regain its normal level. On the other hand, estimations of prefecture-level longitudinal data indicate that the Financial Capability Index (FCI) of the local government2 and minimum wage both impose negative effects on the PA rate. We also find that the rapid aging of Japan's population presents a permanent shock in practice, which makes it the most prominent contribution to surging welfare use.


2020 ◽  
Vol 6 (1) ◽  
pp. 273-282
Author(s):  
Majid Hussain Phul ◽  
Muhammad Saleem Rahpoto ◽  
Ghulam Muhammad Mangnejo

This research paper empirically investigates the outcome of Political stability on economic growth (EG) of Pakistan for the period of 1988 to 2018. Political stability (PS), gross fixed capital formation (GFCF), total labor force (TLF) and Inflation (INF) are important explanatory variables. Whereas for model selection GDPr is used as the dependent variable. To check the stationary of time series data Augmented Dickey Fuller (ADF) unit root (UR) test has been used,  and whereas to find out the long run relationship among variables, OLS method has been used. The analysis the impact of PS on EG (EG) in the short run, VAR model has been used. The outcomes show that all the variables (PS, GFCF, TLF and INF) have a significantly positive effect on the EG of Pakistan in the long run period. But the effect of PS on GDP is smaller. Further, in this research we are trying to see the short run relationship between GDP and other explanatory variables. The outcomes show that PS does not have such effect on GDP in the short run analysis. While GFCF, TLF and INF have significantly positive effect on GDP of Pakistan in the short run period.


2020 ◽  
Vol 2 (1) ◽  
pp. 128-145
Author(s):  
Yuafanda Kholfi Hartono ◽  
Sumarto Eka Putra

Indonesia Japan Economic Partnership Agreement (IJ-EPA) is a bilateral free-trade agreement between Indonesia and Japan that has been started from July 1st, 2008. After more than a decade of its implementation, there is a question that we need to be addressed: Does liberalization of IJ-EPA make Indonesia’s export to Japan increase? This question is important since the government gives a trade-off by giving lower tariff for certain commodities agreed in agreement to increase export. Using Interrupted time series (ITS) analysis based on time-series data from Statistics Indonesia (BPS), this article found that the impact of IJ-EPA decreased for Indonesia export to Japan. Furthermore, this paper proposed some potential commodities that can increase the effectiveness of this FTA. The importance of this topic is that Indonesia will maximize the benefit in implementing of agreement that they made from the third biggest destination export of their total export value, so it will be in line with the government's goal to expand export market to solve current account deficit. In addition, the method that used in this paper can be implemented to other countries so that they can maximize the effect of Free Trade Agreement, especially for their export.


2000 ◽  
Vol 16 (6) ◽  
pp. 927-997 ◽  
Author(s):  
Hyungsik R. Moon ◽  
Peter C.B. Phillips

Time series data are often well modeled by using the device of an autoregressive root that is local to unity. Unfortunately, the localizing parameter (c) is not consistently estimable using existing time series econometric techniques and the lack of a consistent estimator complicates inference. This paper develops procedures for the estimation of a common localizing parameter using panel data. Pooling information across individuals in a panel aids the identification and estimation of the localizing parameter and leads to consistent estimation in simple panel models. However, in the important case of models with concomitant deterministic trends, it is shown that pooled panel estimators of the localizing parameter are asymptotically biased. Some techniques are developed to overcome this difficulty, and consistent estimators of c in the region c < 0 are developed for panel models with deterministic and stochastic trends. A limit distribution theory is also established, and test statistics are constructed for exploring interesting hypotheses, such as the equivalence of local to unity parameters across subgroups of the population. The methods are applied to the empirically important problem of the efficient extraction of deterministic trends. They are also shown to deliver consistent estimates of distancing parameters in nonstationary panel models where the initial conditions are in the distant past. In the development of the asymptotic theory this paper makes use of both sequential and joint limit approaches. An important limitation in the operation of the joint asymptotics that is sometimes needed in our development is the rate condition n/T → 0. So the results in the paper are likely to be most relevant in panels where T is large and n is moderately large.


2010 ◽  
Vol 4 ◽  
pp. BBI.S5983 ◽  
Author(s):  
Daisuke Tominaga

Time series of gene expression often exhibit periodic behavior under the influence of multiple signal pathways, and are represented by a model that incorporates multiple harmonics and noise. Most of these data, which are observed using DNA microarrays, consist of few sampling points in time, but most periodicity detection methods require a relatively large number of sampling points. We have previously developed a detection algorithm based on the discrete Fourier transform and Akaike's information criterion. Here we demonstrate the performance of the algorithm for small-sample time series data through a comparison with conventional and newly proposed periodicity detection methods based on a statistical analysis of the power of harmonics. We show that this method has higher sensitivity for data consisting of multiple harmonics, and is more robust against noise than other methods. Although “combinatorial explosion” occurs for large datasets, the computational time is not a problem for small-sample datasets. The MATLAB/GNU Octave script of the algorithm is available on the author's web site: http://www.cbrc.jp/%7Etominaga/piccolo/ .


Vibration ◽  
2019 ◽  
Vol 2 (1) ◽  
pp. 25-46 ◽  
Author(s):  
Merten Stender ◽  
Sebastian Oberst ◽  
Norbert Hoffmann

Time recordings of impulse-type oscillation responses are short and highly transient. These characteristics may complicate the usage of classical spectral signal processing techniques for (a) describing the dynamics and (b) deriving discriminative features from the data. However, common model identification and validation techniques mostly rely on steady-state recordings, characteristic spectral properties and non-transient behavior. In this work, a recent method, which allows reconstructing differential equations from time series data, is extended for higher degrees of automation. With special focus on short and strongly damped oscillations, an optimization procedure is proposed that fine-tunes the reconstructed dynamical models with respect to model simplicity and error reduction. This framework is analyzed with particular focus on the amount of information available to the reconstruction, noise contamination and nonlinearities contained in the time series input. Using the example of a mechanical oscillator, we illustrate how the optimized reconstruction method can be used to identify a suitable model and how to extract features from uni-variate and multivariate time series recordings in an engineering-compliant environment. Moreover, the determined minimal models allow for identifying the qualitative nature of the underlying dynamical systems as well as testing for the degree and strength of nonlinearity. The reconstructed differential equations would then be potentially available for classical numerical studies, such as bifurcation analysis. These results represent a physically interpretable enhancement of data-driven modeling approaches in structural dynamics.


Sign in / Sign up

Export Citation Format

Share Document