scholarly journals A New Strategy of Hybrid Models using ARIMA, ANN, and DWT in Time Series Modelling

2021 ◽  
Vol 25 (1) ◽  
pp. 27-50
Author(s):  
Tsung-Lin Li ◽  
◽  
Chen-An Tsai ◽  

Time series forecasting is a challenging task of interest in many disciplines. A variety of techniques have been developed to deal with the problem through a combination of different disciplines. Although various researches have proved successful for hybrid models, none of them carried out the comparisons with solid statistical test. This paper proposes a new stepwise model determination method for artificial neural network (ANN) and a novel hybrid model combining autoregressive integrated moving average (ARIMA) model, ANN and discrete wavelet transformation (DWT). Simulation studies are conducted to compare the performance of different models, including ARIMA, ANN, ARIMA-ANN, DWT-ARIMA-ANN and the proposed method, ARIMA-DWT-ANN. Also, two real data sets, Lynx data and cabbage data, are used to demonstrate the applications. Our proposed method, ARIMA-DWT-ANN, outperforms other methods in both simulated datasets and Lynx data, while ANN shows a better performance in the cabbage data. We conducted a two-way ANOVA test to compare the performances of methods. The results showed a significant difference between methods. As a brief conclusion, it is suggested to try on ANN and ARIMA-DWT-ANN due to their robustness and high accuracy. Since the performance of hybrid models may vary across data sets based on their ARIMA alike or ANN alike natures, they should all be considered when encountering a new data to reach an optimal performance.

2012 ◽  
Author(s):  
Ruhaidah Samsudin ◽  
Puteh Saad ◽  
Ani Shabri

In this paper, time series prediction is considered as a problem of missing value. A model for the determination of the missing time series value is presented. The hybrid model integrating autoregressive intergrated moving average (ARIMA) and artificial neural network (ANN) model is developed to solve this problem. The developed models attempts to incorporate the linear characteristics of an ARIMA model and nonlinear patterns of ANN to create a hybrid model. In this study, time series modeling of rice yield data in Muda Irrigation area. Malaysia from 1995 to 2003 are considered. Experimental results with rice yields data sets indicate that the hybrid model improve the forecasting performance by either of the models used separately. Key words: ARIMA; Box and Jenkins; neural networks; rice yields; hybrid ANN model


2019 ◽  
Vol 10 (1) ◽  
pp. 17
Author(s):  
Isnaini Nuzula Agustin

AbstractEfficient Market is the market where all traded securities prices reflects all available information. Market Efficient Hypotesis in the Weak Form stated that past stock price movement incorporated with current securities’s prices, thus it can be used to predicting the current price or return. The objective of this research is to examine the weak form of Efficient Market Hypothesis (EMH) in Indonesia Sharia Stock Index (ISSI) over the period of January 3rd2017 -February 8th 2019. To Examine the EMH, some appropriate tests are developed, these are: Run Test, Autocorrelation Test, Autoregressive Integrated Moving Average (ARIMA), and Paired Sample t-test. The result findings showing that ISSI is not efficient in the weak form during the period of the study. Moreover, in accordance with time series modelling result, the fitted model is ARIMA (1,1,1) with accuracy level of 78%. This result proved that ARIMA model successfully and accurately in forecasting ISSI indices. It can be implied that the historical stock index data in the past still described the stock index information in the future. Thus, technical analysis is still feasible to do as the guide for investors in conducting transactions in the capital market.AbstrakPasar yang efisien adalah pasar dimana semua harga sekuritas yang diperdagangkan telah mencerminkan semua informasi yang tersedia. Teori pasar efisien bentuk lemah menyatakan bahwa perubahan harga masa lalu tidak berhubungan dengan harga sekuritas sekarang, sehingga tidak dapat digunakan untuk memprediksi harga atau return dari sekuritas. Penelitian ini bertujuan untuk melakukan pengujian hipotesis pasar efisien bentuk lemah pada Indeks Saham Syariah Indonesia (ISSI). Data diambil pada periode 3 Januari 2017 – 8 Februari 2019. Pada tahap awal penelitian, Run test dan Autocorrelation test dilakukan untuk melihat apakah pasar efisien bentuk lemah berlaku pada ISSI. Selanjutnya dilakukan pembentukan pemodelan time series ARIMA untuk melihat teknik prediksi yang sesuai untuk memprediksi Indeks Saham ISSI. Hasil Run test dan Autocorrelation test menunjukkan bahwa hipotesis pasar efisien bentuk lemah tidak terbukti. Pada pembentukan model ARIMA, terlihat bahwa model yang sesuai adalah ARIMA (1,1,1) menghasilkan tingkat akurasi sebesar 78%. Hal ini membuktikan bahwa model ARIMA berhasil dan akurat digunakan untuk memprediksi Indeks Harga Saham ISSI. Oleh karena itu, analisis teknikal masih dapat digunakan oleh investor untuk menjadi pedoman dalam melakukan transaksi perdagangan di pasar modal.


Author(s):  
Henry M. Kpamma ◽  
Silverius K. Bruku ◽  
John A. Awaab

Aims/ Objectives: This research was carried out with the intention of using time series to model the volume of overland timber exported within Bolgatanga municipalityPlace and Duration of Study: Study of the time series was based on a historical data of the volume of timber exported for twenty consecutive years, from 1999 to 2019 within Bolgatanga municipality.Methodology: The three-stage iterative modeling approach for Box Jenkins was used to match an ARIMA model and to forecast both the amount of timber export and the confiscated lumber. ARIMA method incorporates a cycle of autoregressive and a moving average. The three-stage iterative modeling technique of Box Jenkins which were used are model recognition, parameter estimation and/or diagnostic checks were also made. Results: From the preliminary investigation, the study showed that the amount of timber exported in municipality is skewed to the right, suggesting that much of the amount of timber exported is below the average. This, together with the high volatility in the volume of timber exported, indicates that the amount of timber exported within the municipalities during the twenty-year period was low. The plots from the trends also showed robust variations in the volume of timber exported indicating that timber exporters do not have better grips with the concepts and applications of export technology, hence the erratic nature of the volume of timber exported over the period. The quadratic pattern and the ARIMA (1,1,1) model best represented the amount of timber exported.The analysis further indicated that there will be a further decrease in the amount of timber export from the five years projection into the future. Over the last two decades the Bayesian approach to VAR has gained ground. For a future report, this estimation method will be followed to examine the ”long-run equilibrium relationships” between timber export volumes and climate change.Conclusion: The quadratic pattern and the ARIMA (1,1,1) model best represented the amount of timber exported. There will be a further decrease in the amount of timber export from the five years projection into the future.


Author(s):  
Gaetano Perone

AbstractThe coronavirus disease (COVID-19) is a severe, ongoing, novel pandemic that emerged in Wuhan, China, in December 2019. As of January 21, 2021, the virus had infected approximately 100 million people, causing over 2 million deaths. This article analyzed several time series forecasting methods to predict the spread of COVID-19 during the pandemic’s second wave in Italy (the period after October 13, 2020). The autoregressive moving average (ARIMA) model, innovations state space models for exponential smoothing (ETS), the neural network autoregression (NNAR) model, the trigonometric exponential smoothing state space model with Box–Cox transformation, ARMA errors, and trend and seasonal components (TBATS), and all of their feasible hybrid combinations were employed to forecast the number of patients hospitalized with mild symptoms and the number of patients hospitalized in the intensive care units (ICU). The data for the period February 21, 2020–October 13, 2020 were extracted from the website of the Italian Ministry of Health (www.salute.gov.it). The results showed that (i) hybrid models were better at capturing the linear, nonlinear, and seasonal pandemic patterns, significantly outperforming the respective single models for both time series, and (ii) the numbers of COVID-19-related hospitalizations of patients with mild symptoms and in the ICU were projected to increase rapidly from October 2020 to mid-November 2020. According to the estimations, the necessary ordinary and intensive care beds were expected to double in 10 days and to triple in approximately 20 days. These predictions were consistent with the observed trend, demonstrating that hybrid models may facilitate public health authorities’ decision-making, especially in the short-term.


2017 ◽  
Vol 9 (2) ◽  
pp. 653-657 ◽  
Author(s):  
A. Anuja ◽  
V. K. Yadav ◽  
V. S. Bharti ◽  
N. R. Kumar

Tamil Nadu is situated in the south eastern coast of the Indian peninsula with a coastal line of 1076 km (13% of the country’s coast line), 0.19 million sq.km of EEZ (9.4 % of total national EEZ) and a continental shelf of about 41,412 sq. km. This is one of the country’s leading state in marine fish production and ranks third in marine fish production. In Tamil Nadu, Ramanathapuram district is a leading maritime district followed by Nagapattinam and Thoothukudi. The objective of this study was to investigate the trends in marine fish production in Tamil Nadu. Yearly fish production data for the period of 1988-1989 to 2012-2013 were analyzed using time-series method called Autoregressive Integrated Moving Average (ARIMA) model and Regression analysis (curve estimation). In our study, the developed best ARIMA model for Tamil Nadu marine fish production was found to be ARIMA (1, 1, 1) which have the minimum BIC (Bayesian Information Criterion). ARIMA model had got a slightly higher forecasting accuracy rate for forecasting marine fish production of Tamil Nadu than Regression trend analysis. The independent sample test showed there was no significant difference between the two models. The limitations of ARIMA model include its requirement of a long time series data for better forecast. It is basically linear model assuming that data are stationary and have a limited ability to capture non-stationarities and nonlinearities in series data. Both the models indicated that Tamil Nadu marine fish production has plateaued and fishermen should be encouraged to adopt sustainable fishing practices.


Energies ◽  
2020 ◽  
Vol 13 (5) ◽  
pp. 1085
Author(s):  
Syed Naeem Haider ◽  
Qianchuan Zhao ◽  
Xueliang Li

Prediction of a battery’s health in data centers plays a significant role in Battery Management Systems (BMS). Data centers use thousands of batteries, and their lifespan ultimately decreases over time. Predicting battery’s degradation status is very critical, even before the first failure is encountered during its discharge cycle, which also turns out to be a very difficult task in real life. Therefore, a framework to improve Auto-Regressive Integrated Moving Average (ARIMA) accuracy for forecasting battery’s health with clustered predictors is proposed. Clustering approaches, such as Dynamic Time Warping (DTW) or k-shape-based, are beneficial to find patterns in data sets with multiple time series. The aspect of large number of batteries in a data center is used to cluster the voltage patterns, which are further utilized to improve the accuracy of the ARIMA model. Our proposed work shows that the forecasting accuracy of the ARIMA model is significantly improved by applying the results of the clustered predictor for batteries in a real data center. This paper presents the actual historical data of 40 batteries of the large-scale data center for one whole year to validate the effectiveness of the proposed methodology.


Genetics ◽  
2000 ◽  
Vol 154 (1) ◽  
pp. 381-395
Author(s):  
Pavel Morozov ◽  
Tatyana Sitnikova ◽  
Gary Churchill ◽  
Francisco José Ayala ◽  
Andrey Rzhetsky

Abstract We propose models for describing replacement rate variation in genes and proteins, in which the profile of relative replacement rates along the length of a given sequence is defined as a function of the site number. We consider here two types of functions, one derived from the cosine Fourier series, and the other from discrete wavelet transforms. The number of parameters used for characterizing the substitution rates along the sequences can be flexibly changed and in their most parameter-rich versions, both Fourier and wavelet models become equivalent to the unrestricted-rates model, in which each site of a sequence alignment evolves at a unique rate. When applied to a few real data sets, the new models appeared to fit data better than the discrete gamma model when compared with the Akaike information criterion and the likelihood-ratio test, although the parametric bootstrap version of the Cox test performed for one of the data sets indicated that the difference in likelihoods between the two models is not significant. The new models are applicable to testing biological hypotheses such as the statistical identity of rate variation profiles among homologous protein families. These models are also useful for determining regions in genes and proteins that evolve significantly faster or slower than the sequence average. We illustrate the application of the new method by analyzing human immunoglobulin and Drosophilid alcohol dehydrogenase sequences.


2019 ◽  
Vol 147 ◽  
Author(s):  
C. W. Tian ◽  
H. Wang ◽  
X. M. Luo

AbstractSeasonal autoregressive-integrated moving average (SARIMA) has been widely used to model and forecast incidence of infectious diseases in time-series analysis. This study aimed to model and forecast monthly cases of hand, foot and mouth disease (HFMD) in China. Monthly incidence HFMD cases in China from May 2008 to August 2018 were analysed with the SARIMA model. A seasonal variation of HFMD incidence was found from May 2008 to August 2018 in China, with a predominant peak from April to July and a trough from January to March. In addition, the annual peak occurred periodically with a large annual peak followed by a relatively small annual peak. A SARIMA model of SARIMA (1, 1, 2) (0, 1, 1)12 was identified, and the mean error rate and determination coefficient were 16.86% and 94.27%, respectively. There was an annual periodicity and seasonal variation of HFMD incidence in China, which could be predicted well by a SARIMA (1, 1, 2) (0, 1, 1)12 model.


2017 ◽  
Vol 19 (2) ◽  
pp. 261-281 ◽  
Author(s):  
Sahbi Boubaker

In this paper, a modeling-identification approach for the monthly municipal water demand system in Hail region, Saudi Arabia, is developed. This approach is based on an auto-regressive integrated moving average (ARIMA) model tuned by the particle swarm optimization (PSO). The ARIMA (p, d, q) modeling requires estimation of the integer orders p and q of the AR and MA parts; and the real coefficients of the model. More than being simple, easy to implement and effective, the PSO-ARIMA model does not require data pre-processing (original time-series normalization for artificial neural network (ANN) or data stationarization for traditional stochastic time-series (STS)). Moreover, its performance indicators such as the mean absolute percentage error (MAPE), coefficient of determination (R2), root mean squared error (RMSE) and average absolute relative error (AARE) are compared with those of ANN and STS. The obtained results show that the PSO-ARIMA outperforms the ANN and STS approaches since it can optimize simultaneously integer and real parameters and provides better accuracy in terms of MAPE (5.2832%), R2 (0.9375), RMSE (2.2111 × 105m3) and AARE (5.2911%). The PSO-ARIMA model has been implemented using 69 records (for both training and testing). The results can help local water decision makers to better manage the current water resources and to plan extensions in response to the increasing need.


2021 ◽  
pp. 1-13
Author(s):  
Muhammad Rafi ◽  
Mohammad Taha Wahab ◽  
Muhammad Bilal Khan ◽  
Hani Raza

Automatic Teller Machine (ATM) are still largely used to dispense cash to the customers. ATM cash replenishment is a process of refilling ATM machine with a specific amount of cash. Due to vacillating users demands and seasonal patterns, it is a very challenging problem for the financial institutions to keep the optimal amount of cash for each ATM. In this paper, we present a time series model based on Auto Regressive Integrated Moving Average (ARIMA) technique called Time Series ARIMA Model for ATM (TASM4ATM). This study used ATM back-end refilling historical data from 6 different financial organizations in Pakistan. There are 2040 distinct ATMs and 18 month of replenishment data from these ATMs are used to train the proposed model. The model is compared with the state-of- the-art models like Recurrent Neural Network (RNN) and Amazon’s DeepAR model. Two approaches are used for forecasting (i) Single ATM and (ii) clusters of ATMs (In which ATMs are clustered with similar cash-demands). The Mean Absolute Percentage Error (MAPE) and Symmetric Mean Absolute Percentage Error (SMAPE) are used to evaluate the models. The suggested model produces far better forecasting as compared to the models in comparison and produced an average of 7.86/7.99 values for MAPE/SMAPE errors on individual ATMs and average of 6.57/6.64 values for MAPE/SMAPE errors on clusters of ATMs.


Sign in / Sign up

Export Citation Format

Share Document