scholarly journals Hypothesis Testing for Autocorrelated Short Climate Time Series

2014 ◽  
Vol 53 (3) ◽  
pp. 637-651 ◽  
Author(s):  
Virginie Guemas ◽  
Ludovic Auger ◽  
Francisco J. Doblas-Reyes

AbstractCommonly used statistical tests of hypothesis, also termed inferential tests, that are available to meteorologists and climatologists all require independent data in the time series to which they are applied. However, most of the time series that are usually handled are actually serially dependent. A common approach to handle such a serial dependence is to replace in those statistical tests the actual number of data by an estimated effective number of independent data that is computed from a classical and widely used formula that relies on the autocorrelation function. Despite being perfectly demonstrable under some hypotheses, this formula provides unreliable results on practical cases, for two different reasons. First, the formula has to be applied using the estimated autocorrelation function, which bears a large uncertainty because of the usual shortness of the available time series. After the impact of this uncertainty is illustrated, some recommendations of preliminary treatment of the time series prior to any application of this formula are made. Second, the derivation of this formula is done under the hypothesis of identically distributed data, which is often not valid in real climate or meteorological problems. It is shown how this issue is due to real physical processes that induce temporal coherence, and an illustration is given of how not respecting the hypotheses affects the results provided by the formula.

Author(s):  
Richard McCleary ◽  
David McDowall ◽  
Bradley J. Bartos

The general AutoRegressive Integrated Moving Average (ARIMA) model can be written as the sum of noise and exogenous components. If an exogenous impact is trivially small, the noise component can be identified with the conventional modeling strategy. If the impact is nontrivial or unknown, the sample AutoCorrelation Function (ACF) will be distorted in unknown ways. Although this problem can be solved most simply when the outcome of interest time series is long and well-behaved, these time series are unfortunately uncommon. The preferred alternative requires that the structure of the intervention is known, allowing the noise function to be identified from the residualized time series. Although few substantive theories specify the “true” structure of the intervention, most specify the dichotomous onset and duration of an impact. Chapter 5 describes this strategy for building an ARIMA intervention model and demonstrates its application to example interventions with abrupt and permanent, gradually accruing, gradually decaying, and complex impacts.


2014 ◽  
Vol 21 (3) ◽  
pp. 651-657 ◽  
Author(s):  
N. Molkenthin ◽  
K. Rehfeld ◽  
V. Stolbova ◽  
L. Tupikina ◽  
J. Kurths

Abstract. Climate networks are constructed from climate time series data using correlation measures. It is widely accepted that the geographical proximity, as well as other geographical features such as ocean and atmospheric currents, have a large impact on the observable time-series similarity. Therefore it is to be expected that the spatial sampling will influence the reconstructed network. Here we investigate this by comparing analytical flow networks, networks generated with the START model and networks from temperature data from the Asian monsoon domain. We evaluate them on a regular grid, a grid with added random jittering and two variations of clustered sampling. We find that the impact of the spatial sampling on most network measures only distorts the plots if the node distribution is significantly inhomogeneous. As a simple diagnostic measure for the detection of inhomogeneous sampling we suggest the Voronoi cell size distribution.


2014 ◽  
Vol 11 (91) ◽  
pp. 20130585 ◽  
Author(s):  
Bernard Cazelles ◽  
Kévin Cazelles ◽  
Mario Chavez

Wavelet analysis is now frequently used to extract information from ecological and epidemiological time series. Statistical hypothesis tests are conducted on associated wavelet quantities to assess the likelihood that they are due to a random process. Such random processes represent null models and are generally based on synthetic data that share some statistical characteristics with the original time series. This allows the comparison of null statistics with those obtained from original time series. When creating synthetic datasets, different techniques of resampling result in different characteristics shared by the synthetic time series. Therefore, it becomes crucial to consider the impact of the resampling method on the results. We have addressed this point by comparing seven different statistical testing methods applied with different real and simulated data. Our results show that statistical assessment of periodic patterns is strongly affected by the choice of the resampling method, so two different resampling techniques could lead to two different conclusions about the same time series. Moreover, our results clearly show the inadequacy of resampling series generated by white noise and red noise that are nevertheless the methods currently used in the wide majority of wavelets applications. Our results highlight that the characteristics of a time series, namely its Fourier spectrum and autocorrelation, are important to consider when choosing the resampling technique. Results suggest that data-driven resampling methods should be used such as the hidden Markov model algorithm and the ‘beta-surrogate’ method.


2020 ◽  
Author(s):  
Debjyoti Talukdar ◽  
Dr. Vrijesh Tripathi

BACKGROUND Rapid spread of SARS nCoV-2 virus in Caribbean region has prompted heightened surveillance with more than 350,000 COVID-19 confirmed cases in 13 Caribbean countries namely Antigua and Barbados, Bahamas, Barbados, Cuba, Dominica, Dominican Republic, Grenada, Haiti, Jamaica, Saint Kitts and Nevis, Saint Lucia, Saint Vincent and the Grenadines, Trinidad and Tobago. OBJECTIVE The aim of our study is to analyze the impact of coronavirus (SARS nCoV-2) in 13 Caribbean countries in terms of confirmed cases, number of deaths and recovered cases. Current and projected forecasts using advanced autoregressive integrated moving average (ARIMA) models will enable local health organisations to plan future courses of action in terms of lockdown and managing essential public services. METHODS The study uses the auto regressive integrated moving average (ARIMA) model based upon time series pattern as per data retrieved from John Hopkins University, freely accessible on public domain and used for research and academic purposes. The data was analyzed using STATA 14 SE software between the time period - Jan 22, 2020 till May 27, 2020 using ARIMA time series analysis. It involves generalizing an autoregressive moving average model to better understand the data and predict future points in the time series until June 15, 2020. RESULTS The results show the predicted trend in terms of COVID-19 confirmed, mortality and recovered cases for 13 Caribbean countries. The projected ARIMA model forecast for the time period - May 25, 2020 to May 31, 2020 show 20278 (95% CI 19433.21 - 21123.08) confirmed cases, 631 (95% CI 615.90 - 646.51) deaths and 11501 (95% CI 10912.45 - 12089) recovered cases related to SARS nCoV-2 virus. The final ARIMA model chosen for confirmed COVID-19 cases, number of deaths and recovered cases are ARIMA (4,2,2), ARIMA (2,1,2) and ARIMA (4,1,2) respectively. All chosen models were compared with other models in terms of various factors like AIC/BIC (Akaike Information Criterion/Bayesian Information Criterion), log likelihood, p-value significance, coefficient < 1 and 5% significance. The autocorrelation function (ACF) and partial autocorrelation function (PACF) graphs were plotted to reduce bias and select the best fitting model. CONCLUSIONS As per the results of the forecasted COVID-19 models, there is a steady rise in terms of confirmed, recovered and mortality cases during the time period March 1, 2020 until May 27, 2020. It shows an increasing trend for confirmed and recovered COVID-19 cases and slowing of the number of mortality cases over a period of time. The predicted model will help the local health administration to devise public policies in terms of awareness measures, lockdown and essential health services accordingly.


Author(s):  
Lekë Pula ◽  
Alban Elshani

Abstract The aim of the study is to examine the impact of public expenditure on economic growth of Kosovo. Time series data span for the period of time 2002-2015. The structure of the econometric model is built on Keynesian theories and endogenous growth model. The model estimation is performed only after implementing the Augmented Dickey-Fuller (ADF) Unit Root test to estimate if time series are stationary. Several tests have been implemented to determine model validity. The model has met all the assumptions of statistical tests: error term residuals have a normal distribution (Jarque-Bera test), there is no auto-correlation between variables (Breusch-Godfrey Serial test), and error variances are constant, known as the principle of homoscedasticity (Breusch-Pagan-Godfrey test). Gross domestic product is used as a dependent variable in the model, while public expenditure (G), foreign direct investment (FDI), export (EXP) and total budget revenue (TrTax) are used as the endogenous variables. The study results have revealed that there is a positive and statistically significant effect of public expenditures and exports on economic growth. Total budget revenue has a positive impact on economic growth but this has not been proved to be statistically significant. The authors of the research have also found out that FDI is negative and statistically insignificant.


2019 ◽  
Vol 129 (4) ◽  
pp. 127-131
Author(s):  
Agnieszka Parfin ◽  
Krystian Wdowiak ◽  
Marzena Furtak-Niczyporuk ◽  
Jolanta Herda

AbstractIntroduction. The COVID-19 is the name of an infectious disease caused by a new strain of coronavirus SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2). It was first diagnosed in December 2019 in patients in Wuhan City, Hubei Province, China. The symptoms are dominated by features of respiratory tract infections, in some patients with a very severe course leading to respiratory failure and, in extreme cases to death. Due to the spread of the infection worldwide, the WHO declared a pandemic in March 2020.Aim. An investigation of the impact of social isolation introduced due to the coronavirus pandemic on selected aspects of life. The researchers focused on observing changes in habits related to physical activity and their connections with people’s subjective well-being and emotional state.Material and methods. The study was carried out within the international project of the group „IRG on COVID and exercise”. The research tool was a standardized questionnaire.Results. Based on the data collected and the analysis of the percentage results, it can be observed that the overwhelming majority of people taking up physical activity reported a better mood during the pandemic. However, statistical tests do not confirm these relationships due to the small sample size.Conclusions. Isolation favours physical activity. Future, in-depth studies, by enlarging the population group, are necessary to confirm the above observations.


1982 ◽  
Vol 14 (4-5) ◽  
pp. 245-252 ◽  
Author(s):  
C S Sinnott ◽  
D G Jamieson

The combination of increasing nitrate concentrations in the River Thames and the recent EEC Directive on the acceptable level in potable water is posing a potential problem. In assessing the impact of nitrates on water-resource systems, extensive use has been made of time-series analysis and simulation. These techniques are being used to define the optimal mix of alternatives for overcoming the problem on a regional basis.


GEOgraphia ◽  
2018 ◽  
Vol 20 (43) ◽  
pp. 124
Author(s):  
Amaury De Souza ◽  
Priscilla V Ikefuti ◽  
Ana Paula Garcia ◽  
Debora A.S Santos ◽  
Soetania Oliveira

Análise e previsão de parâmetros de qualidade do ar são tópicos importantes da pesquisa atmosférica e ambiental atual, devido ao impacto causado pela poluição do ar na saúde humana. Este estudo examina a transformação do dióxido de nitrogênio (NO2) em ozônio (O3) no ambiente urbano, usando o diagrama de séries temporais. Foram utilizados dados de concentração de poluentes ambientais e variáveis meteorológicas para prever a concentração de O3 na atmosfera. Foi testado o emprego de modelos de regressão linear múltipla como ferramenta para a predição da concentração de O3. Os resultados indicam que o valor da temperatura e a presença de NO2 influenciam na concentração de O3 em Campo Grande, capital do Estado do Mato Grosso do Sul. Palavras-chave: Ozônio. Dióxido de nitrogênio. Séries cronológicas. Regressões. ANALYSIS OF THE RELATIONSHIP BETWEEN O3, NO AND NO2 USING MULTIPLE LINEAR REGRESSION TECHNIQUES.Abstract: Analysis and prediction of air quality parameters are important topics of current atmospheric and environmental research due to the impact caused by air pollution on human health. This study examines the transformation of nitrogen dioxide (NO2) into ozone (O3) in the urban environment, using the time series diagram. Environmental pollutant concentration and meteorological variables were used to predict the O3 concentration in the atmosphere. The use of multiple linear regression models was tested as a tool to predict O3 concentration. The results indicate that the temperature value and the presence of NO2 influence the O3 concentration in Campo Grande, capital of the State of Mato Grosso do Sul.Keywords: Ozone. Nitrogen dioxide. Time series. Regressions. ANÁLISIS DE LA RELACIÓN ENTRE O3, NO Y NO2 UTILIZANDO MÚLTIPLES TÉCNICAS DE REGRESIÓN LINEAL.Resumen: Análisis y previsión de los parámetros de calidad del aire son temas importantes de la actual investigación de la atmósfera y el medio ambiente, debido al impacto de la contaminación atmosférica sobre la salud humana. Este estudio examina la transformación del dióxido de nitrógeno (NO2) en ozono (O3) en el entorno urbano, utilizando el diagrama de series de tiempo. Las concentraciones de los contaminantes ambientales de datos y variables climáticas fueron utilizadas para predecir la concentración de O3 en la atmósfera. El uso de múltiples modelos de regresión lineal como herramienta para predecir la concentración de O3 se puso a prueba. Los resultados indican que el valor de la temperatura y la presencia de NO2 influyen en la concentración de O3 en Campo Grande, capital del Estado de Mato Grosso do Sul.Palabras clave: Ozono. Dióxido de nitrógeno. Series de tiempo. Regresiones.


2008 ◽  
Vol 18 (12) ◽  
pp. 3679-3687 ◽  
Author(s):  
AYDIN A. CECEN ◽  
CAHIT ERKAL

We present a critical remark on the pitfalls of calculating the correlation dimension and the largest Lyapunov exponent from time series data when trend and periodicity exist. We consider a special case where a time series Zi can be expressed as the sum of two subsystems so that Zi = Xi + Yi and at least one of the subsystems is deterministic. We show that if the trend and periodicity are not properly removed, correlation dimension and Lyapunov exponent estimations yield misleading results, which can severely compromise the results of diagnostic tests and model identification. We also establish an analytic relationship between the largest Lyapunov exponents of the subsystems and that of the whole system. In addition, the impact of a periodic parameter perturbation on the Lyapunov exponent for the logistic map and the Lorenz system is discussed.


Sign in / Sign up

Export Citation Format

Share Document