scholarly journals On using predictive-ability tests in the selection of time-series prediction models: A Monte Carlo evaluation

Author(s):  
Mauro Costantini ◽  
Robert M. Kunst
Author(s):  
Stanisław Jankowski ◽  
Zbigniew Szymański ◽  
Zbigniew Wawrzyniak ◽  
Paweł Cichosz ◽  
Eliza Szczechla ◽  
...  

1995 ◽  
Vol 06 (02) ◽  
pp. 145-170 ◽  
Author(s):  
ALEX AUSSEM ◽  
FIONN MURTAGH ◽  
MARC SARAZIN

Dynamical Recurrent Neural Networks (DRNN) (Aussem 1995a) are a class of fully recurrent networks obtained by modeling synapses as autoregressive filters. By virtue of their internal dynamic, these networks approximate the underlying law governing the time series by a system of nonlinear difference equations of internal variables. They therefore provide history-sensitive forecasts without having to be explicitly fed with external memory. The model is trained by a local and recursive error propagation algorithm called temporal-recurrent-backpropagation. The efficiency of the procedure benefits from the exponential decay of the gradient terms backpropagated through the adjoint network. We assess the predictive ability of the DRNN model with meteorological and astronomical time series recorded around the candidate observation sites for the future VLT telescope. The hope is that reliable environmental forecasts provided with the model will allow the modern telescopes to be preset, a few hours in advance, in the most suited instrumental mode. In this perspective, the model is first appraised on precipitation measurements with traditional nonlinear AR and ARMA techniques using feedforward networks. Then we tackle a complex problem, namely the prediction of astronomical seeing, known to be a very erratic time series. A fuzzy coding approach is used to reduce the complexity of the underlying laws governing the seeing. Then, a fuzzy correspondence analysis is carried out to explore the internal relationships in the data. Based on a carefully selected set of meteorological variables at the same time-point, a nonlinear multiple regression, termed nowcasting (Murtagh et al. 1993, 1995), is carried out on the fuzzily coded seeing records. The DRNN is shown to outperform the fuzzy k-nearest neighbors method.


2020 ◽  
Vol 12 (11) ◽  
pp. 4730 ◽  
Author(s):  
Ping Wang ◽  
Hongyinping Feng ◽  
Guisheng Zhang ◽  
Daizong Yu

An accurate, reliable and stable air quality prediction system is conducive to the public health and management of atmospheric ecological environment; therefore, many models, individual or hybrid, have been implemented widely to deal with the prediction problem. However, many of these models do not take into consideration or extract improperly the period information in air quality index (AQI) time series, which impacts the models’ learning efficiency greatly. In this paper, a period extraction algorithm is proposed by using a Luenberger observer, and then a novel period-aware hybrid model combined the period extraction algorithm and tradition time series models is build to exploit the comprehensive forecasting capacity to the AQI time series with nonlinear and non-stationary noise. The hybrid model requires a multi-phase implementation. In the first step, the Luenberger observer is used to estimate the implied period function in the one-dimensional AQI series, and then the analyzed time series is mapped to the period space through the function to obtain the period information sub-series of the original series. In the second step, the period sub-series is combined with the original input vector as input vector components according to the time points to establish a new data set. Finally, the new data set containing period information is applied to train the traditional time series prediction models. Both theoretical proof and experimental results obtained on the AQI hour values of Beijing, Tianjin, Taiyuan and Shijiazhuang in North China prove that the hybrid model with period information presents stronger robustness and better forecasting accuracy than the traditional benchmark models.


2019 ◽  
Vol 17 (4) ◽  
pp. 22
Author(s):  
Omar Abbara ◽  
Mauricio Zevallos

<p>The paper assesses the method proposed by Shumway and Stoffer (2006, Chapter 6, Section 10) to estimate the parameters and volatility of stochastic volatility models. First, the paper presents a Monte Carlo evaluation of the parameter estimates considering several distributions for the perturbations in the observation equation. Second, the method is assessed empirically, through backtesting evaluation of VaR forecasts of the S&amp;P 500 time series returns. In both analyses, the paper also evaluates the convenience of using the Fuller transformation.</p>


Agriculture ◽  
2020 ◽  
Vol 10 (8) ◽  
pp. 308
Author(s):  
Reyna Persa ◽  
Arthur Bernardeli ◽  
Diego Jarquin

The availability of molecular markers has revolutionized conventional ways to improve genotypes in plant and animal breeding through genome-based predictions. Several models and methods have been developed to leverage the genomic information in the prediction context to allow more efficient ways to screen and select superior genotypes. In plant breeding, usually, grain yield (yield) is the main trait to drive the selection of superior genotypes; however, in many cases, the information of associated traits is also routinely collected and it can potentially be used to enhance the selection. In this research, we considered different prediction strategies to leverage the information of the associated traits ([AT]; full: all traits observed for the same genotype; and partial: some traits observed for the same genotype) under an alternative single-trait model and the multi-trait approach. The alternative single-trait model included the information of the AT for yield prediction via the phenotypic covariances while the multi-trait model jointly analyzed all the traits. The performance of these strategies was assessed using the marker and phenotypic information from the Soybean Nested Association Mapping (SoyNAM) project observed in Nebraska in 2012. The results showed that the alternative single-trait strategy, which combines the marker and the information of the AT, outperforms the multi-trait model by around 12% and the conventional single-trait strategy (baseline) by 25%. When no information on the AT was available for those genotypes in the testing sets, the multi-trait model reduced the baseline results by around 6%. For the cases where genotypes were partially observed (i.e., some traits observed but not others for the same genotype), the multi-trait strategy showed improvements of around 6% for yield and between 2% to 9% for the other traits. Hence, when yield drives the selection of superior genotypes, the single-trait and multi-trait genomic prediction will achieve significant improvements when some genotypes have been fully or partially tested, with the alternative single-trait model delivering the best results. These results provide empirical evidence of the usefulness of the AT for improving the predictive ability of prediction models for breeding applications.


Author(s):  
Ronald Wesonga ◽  
Fabian Nabugoomu ◽  
Brian Masimbi

Flight delays affect passenger travel satisfaction and increase airline costs. The authors explore airline differences with a focus on their delays based on autoregressive integrated moving averages. Aviation daily data were used in the analysis and model development. Time series modelling for six airlines was done to predict delays as a function of airport's timeliness performance. Findings show differences in the time series prediction models by airline. Differential analysis in the time series prediction models for airline delay suggests variations in airline efficiencies though at the same airport. The differences could be attributed to different management styles in the countries where the airlines originate. Thus, to improve airport timeliness performance, the study recommends airline disaggregated studies to explore the dynamics attributable to determinants of airline unique characteristics.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Irfan Haider Shakri

Purpose The purpose of this study is to compare five data-driven-based ML techniques to predict the time series data of Bitcoin returns, namely, alternating model tree, random forest (RF), multiple linear regression, multi-layer perceptron regression and M5 Tree algorithms. Design/methodology/approach The data used to forecast time series data of Bitcoin returns ranges from 8 July 2010 to 30 Aug 2020. This study used several predictors to predict bitcoin returns including economic policy uncertainty, equity market volatility index, S&P returns, USD/EURO exchange rates, oil and gold prices, volatilities and returns. Five statistical indexes, namely, correlation coefficient, mean absolute error, root mean square error, relative absolute error and root relative squared error are determined. The results of these metrices are used to develop colour intensity ranking. Findings Among the machine learning (ML) techniques used in this study, RF models has shown superior predictive ability for estimating the Bitcoin returns. Originality/value This study is first of its kind to use and compare ML models in the prediction of Bitcoins. More studies can be carried out by using further cryptocurrencies and other ML data-driven models in future.


monitoring the behavior of computer networks is essential for problem identification and optimal management. Part of this behavior to be monitored is the utilization of the network bandwidth. Several techniques are used to model and forecast network traffic such as time series models, modern data mining techniques, soft computing approaches, and neural networks are used for network traffic analysis and prediction. Efficient bandwidth utilization and optimization are very interesting research issues in effective networks because bandwidth is one of the most required and expensive Internet components needed today. It is generally known that the higher the bandwidth available, the better the network performance, thus an essential aid for network design and bandwidth wastage control and a need for traffic models which can capture the characteristics is necessary. In this paper, a time series prediction models were proposed for LAN office network bandwidth utilization. The proposed prediction models are tested by using evaluation metrics used in time series such as MSE and performance evaluation plot. Testing results show that the proposed models can enhance the detection of bandwidth traffic and provide an efficient tool for bandwidth utilization.


Sign in / Sign up

Export Citation Format

Share Document