Prediction of Children Diabetes by Autoregressive Integrated Moving Averages Model Using Big Data and Not Only SQL

2019 ◽  
Vol 16 (8) ◽  
pp. 3510-3513
Author(s):  
A. Bazila Banu ◽  
R. K. Priyadarshini ◽  
Ponniah Thirumalaikolundusubramanian

Enormous efforts have been made by the health care organizations to assess the frequency and occurrence of diabetes among children. The epidemiology of diabetes is estimated with different methods. However, to effectively manage and estimate the diabetes, monitoring systems like glucose meters and Continuous Glucose Monitoring Systems (CGM) can be used. CGM is a way to determine glucose levels right through the day and night. The data obtained from such systems can be utilized effectively to manage as well to predict the diabetes. As the glucose level of the patient is monitored throughout the day, it results in an enormous amount of data. It is difficult to analyze large datasets using SQL, therefore NoSQL is used for handling big data based prediction. One such NoSQL tool known as ArangoDB is used to process the dataset with Arango Query Language (AQL). Investigations relevant to selection of attributes required for the model are discussed. In this paper, ARIMA model has been implemented to predict the diabetes among children. The model is evaluated in terms of moving average of glucose value of a particular person on a specific day. The results show that ARIMA model is appropriate for predicting Time-Series data especially like data obtained by CGM systems.

2020 ◽  
Author(s):  
Sanyaolu Ameye ◽  
Michael Awoleye ◽  
Emmanuel Agogo ◽  
Ette Etuk

BACKGROUND The Coronavirus disease 2019 (COVID-2019) is a global pandemic and Nigeria is not left out in being affected. Though, the disease is just over three months since first case was identified in the country, we present a predictive model to forecast the number of cases expected to be seen in the country in the next 100 days. OBJECTIVE To implement a predictive model in forecasting the near future number of positive cases expected in the country following the present trend METHODS We performed an Auto Regressive Integrated Moving Average (ARIMA) model prediction on the epidemiological data obtained from Nigerian Centre for Disease Control to predict the epidemiological trend of the prevalence and incidence of COVID-2019. RESULTS There were 93 time series data points which lacked stationarity. From our ARIMA model, it is expected that the number of new cases declared per day will keep rising and towards the early September, 2020, Nigeria is expected to have well above sixty thousand confirmed cases. CONCLUSIONS We however believe that as we have more data points our model will be better fine-tuned.


MAUSAM ◽  
2021 ◽  
Vol 68 (2) ◽  
pp. 349-356
Author(s):  
J. HAZARIKA ◽  
B. PATHAK ◽  
A. N. PATOWARY

Perceptive the rainfall pattern is tough for the solution of several regional environmental issues of water resources management, with implications for agriculture, climate change, and natural calamity such as floods and droughts. Statistical computing, modeling and forecasting data are key instruments for studying these patterns. The study of time series analysis and forecasting has become a major tool in different applications in hydrology and environmental fields. Among the most effective approaches for analyzing time series data is the ARIMA (Autoregressive Integrated Moving Average) model introduced by Box and Jenkins. In this study, an attempt has been made to use Box-Jenkins methodology to build ARIMA model for monthly rainfall data taken from Dibrugarh for the period of 1980- 2014 with a total of 420 points.  We investigated and found that ARIMA (0, 0, 0) (0, 1, 1)12 model is suitable for the given data set. As such this model can be used to forecast the pattern of monthly rainfall for the upcoming years, which can help the decision makers to establish priorities in terms of agricultural, flood, water demand management etc.  


Author(s):  
Manikandan M. ◽  
Vishnu Prasad R. ◽  
Amit Kumar Mishra ◽  
Rajesh Kumar Konduru ◽  
Newtonraj A.

Background: As per World Health Organization (WHO) report 1.24 million people die each year as a result of road traffic accidents (RTA) globally. A vast majority of 20-50 million people suffer from non-fatal injuries, many of them ultimately end in disability. Forecasting RTA deaths could help in planning the intervention at the right time in an effective way.Methods: An attempt was made to forecast the RTA deaths in India with seasonal auto regressive integrated moving average (SARIMA) model. ARIMA model is one of the common methods which are used for forecasting variables as the method is very easy and requires only long time series data. The method of selection of appropriate ARIMA model has been explained in detail. Month wise RTA deaths for previous years data was collected from Govt. of India website. Data for 12 years (2001 to 2012) was extracted and appropriate ARIMA model was selected. Using the validated ARIMA model the RTA deaths are forecasted for 8 years (2013-2020).Results: The appropriate SARIMA (1,0,0) (2,1,0) 12 model was selected based on minimal AIC and BIC values. The forecasted RTA deaths show increasing trend overtime.Conclusions: There is an increasing trend in the forecasted numbers of road traffic accidental deaths and it also shows seasonality of RTA deaths with more number of accidents during the month of April and May in every years. It is recommended that the policy makers and transport authority should pay more attention to road traffic accidents and plan some effective intervention to reduce the burden of RTA deaths.


2019 ◽  
Vol 13 (3) ◽  
pp. 135-144
Author(s):  
Sasmita Hayoto ◽  
Yopi Andry Lesnussa ◽  
Henry W. M. Patty ◽  
Ronald John Djami

The Autoregressive Integrated Moving Average (ARIMA) model is often used to forecast time series data. In the era of globalization, rapidly progressing times, one of them in the field of transportation. The aircraft is one of the transportation that the residents can use to support their activities, both in business and tourism. The objective of the research is to know the forecasting of the number of passengers of airplanes at the arrival gate of Pattimura Ambon International Airport using ARIMA Box-Jenkins method. The best model selection is ARIMA (0, 1, 3) because it has significant parameter value and MSE value is smaller.


The challenging endeavor of a time series forecast model is to predict the future time series data accurately. Traditionally, the fundamental forecasting model in time series analysis is the autoregressive integrated moving average model or the ARIMA model requiring a model identification of a three-component vector which are the autoregressive order, the differencing order, and the moving average order before fitting coefficients of the model via the Box-Jenkins method. A model identification is analyzed via the sample autocorrelation function and the sample partial autocorrelation function which are effective tools for identifying the ARMA order but it is quite difficult for analysts. Even though a likelihood based-method is presented to automate this process by varying the ARIMA order and choosing the best one with the smallest criteria, such as Akaike information criterion. Nevertheless the obtained ARIMA model may not pass the residual diagnostic test. This paper presents the residual neural network model, called the self-identification ResNet-ARIMA order model to automatically learn the ARIMA order from known ARIMA time series data via sample autocorrelation function, the sample partial autocorrelation function and differencing time series images. In this work, the training time series data are randomly simulated and checked for stationary and invertibility properties before they are used. The result order from the model is used to generate and fit the ARIMA model by the Box-Jenkins method for predicting future values. The whole process of the forecasting time series algorithm is called the self-identification ResNet-ARIMA algorithm. The performance of the residual neural network model is evaluated by Precision, Recall and F1-score and is compared with the likelihood basedmethod and ResNET50. In addition, the performance of the forecasting time series algorithm is applied to the real world datasets to ensure the reliability by mean absolute percentage error, symmetric mean absolute percentage error, mean absolute error and root mean square error and this algorithm is confirmed with the residual diagnostic checks by the Ljung-Box test. From the experimental results, the new methodologies of this research outperforms other models in terms of identifying the order and predicting the future values.


2021 ◽  
Author(s):  
Almamunur Rashid ◽  
Mahiuddin Alamgir ◽  
Mohamad Tofayal Ahmed ◽  
Roquia Salam ◽  
Abu Reza Md. Towfiqul I ◽  
...  

Abstract Groundwater resource plays a crucial role for agricultural crop production and socio-economic development in some parts of the world including Bangladesh. Joypurhat district, the northwest part of Bangladesh, a crop production hub, is entirely dependent on groundwater irrigation. A precise assessment and prediction of groundwater level (GWL) can assist long-term GWR management, especially in drought-prone agricultural regions. Therefore, this study was carried out to identify trends and magnitude of GWL fluctuation (1980-2019) using the Modified Mann- Kendall test, Pettitt’s Test, and Sen Slope estimators in the drought-prone Joypurhat district, northwest Bangladesh. Time-series data analysis was performed to forecast GWL from 2020 to 2050 using the Auto-Regressive Integrated Moving Average (ARIMA) model. The findings of the MMK test revealed a significant declining trend of GWL, and the trend turning points were identified in the years 1991, 1993, 1997, and 2004, respectively. Results also indicate that the declining rate of GWL varied from 0.104 m/yr to 0.159 m/yr and the average rate of GWL declination was 0.136 m/yr during 1980-2019. The outcomes of wavelet spectrum analysis depicted two significant periods of the declining trend in Khetlal and Akkelpur Upazilas. The results obtained from the optimal identified model ARIMA (2,1,0), indicating that GWL will decline at a depth of 13.76 m in 2050, and the average declination rate of GWL will be 0.143 m/yr in the study area. The predicted results showed a similar declining tendency of GWL from 2020 to 2050, suggesting a disquieting condition, particularly for Khetlal Upazila. This research would provide a practical approach for GWL assessment and prediction that could help decision-makers implement long-term GWR management in the study area.


2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Xinli Zhang ◽  
Yu Yu ◽  
Fei Xiong ◽  
Le Luo

This paper is aimed at establishing a combined prediction model to predict the demand for medical care in terms of daily visits in an outpatient blood sampling room, which provides a basis for rational arrangement of human resources and planning. On the basis of analyzing the comprehensive characteristics of the randomness, periodicity, trend, and day-of-the-week effects of the daily number of blood collections in the hospital, we firstly established an autoregressive integrated moving average model (ARIMA) model to capture the periodicity, volatility, and trend, and secondly, we constructed a simple exponential smoothing (SES) model considering the day-of-the-week effect. Finally, a combined prediction model of the residual correction is established based on the prediction results of the two models. The models are applied to data from 60 weeks of daily visits in the outpatient blood sampling room of a large hospital in Chengdu, for forecasting the daily number of blood collections about 1 week ahead. The result shows that the MAPE of the combined model is the smallest overall, of which the improvement during the weekend is obvious, indicating that the prediction error of extreme value is significantly reduced. The ARIMA model can extract the seasonal and nonseasonal components of the time series, and the SES model can capture the overall trend and the influence of regular changes in the time series, while the combined prediction model, taking into account the comprehensive characteristics of the time series data, has better fitting prediction accuracy than a single model. The new model can well realize the short-to-medium-term prediction of the daily number of blood collections one week in advance.


2017 ◽  
Vol 12 (1) ◽  
pp. 43-50
Author(s):  
Umi Mahmudah

AbstractNowadays it is getting harder for higher education graduates in finding a decent job. This study aims to predict the graduate unemployment in Indonesia by using autoregressive integrated moving average (ARIMA) model. A time series data of the graduate unemployment from 2005 to 2016 is analyzed. The results suggest that ARIMA (1,2,0) is the best model for forecasting analysis, where there is a tendency of increasing number for the next ten periods. Furthermore, the average of point forecast for the next 10 periods is about 1,266,179 while its minimum value is 1,012,861 the maximum values is 1,523,156. Overall, ARIMA (1,2,0) provides an adequate forecasting model so that there is no potential for improvement.


2016 ◽  
Vol 23 (3) ◽  
pp. 302-322 ◽  
Author(s):  
Ka Chi Lam ◽  
Olalekan Shamsideen Oshodi

Purpose – Fluctuations in construction output has an adverse effect on the construction industry and the economy due to its strong linkage. Developing reliable and accurate predictive models is vital to implementing effective response strategies to mitigate the impact of such fluctuations. The purpose of this paper is to compare the accuracy of two univariate forecast models, i.e. Box-Jenkins (autoregressive integrated moving average (ARIMA)) and Neural Network Autoregressive (NNAR). Design/methodology/approach – Four quarterly time-series data on the construction output of Hong Kong were collected (1983Q1-2014Q4). The collected data were divided into two parts. The first part was fitted to the model, while the other was used to evaluate the predictive accuracy of the developed models. Findings – The NNAR model can provide reliable and accurate forecast of total, private and “others” construction output for the medium term. In addition, the NNAR model outperforms the ARIMA model, in terms of accuracy. Research limitations/implications – The applicability of the NNAR model to the construction industry of other countries could be further explored. The main limitation of artificial intelligence models is the lack of explanatory capability. Practical implications – The NNAR model could be used as a tool for accurately predicting future patterns in construction output. This is vital for the sustained growth of the construction industry and the economy. Originality/value – This is the first study to apply the NNAR model to construction output forecasting research.


Geofluids ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-15
Author(s):  
Yi-Hui Pang ◽  
Hong-Bo Wang ◽  
Jian-Jian Zhao ◽  
De-Yong Shang

Hydraulic support plays a key role in ground control of longwall mining. The smart prediction methods of support load are important for achieving intelligent mining. In this paper, the hydraulic support load data is decomposed into trend term, cycle term, and residual term, and it is found that the data has clear trend and period features, which can be called time series data. Based on the autoregression theory and weighted moving average method, the time series model is built to analyze the load data and predict its evolution trend, and the prediction accuracy of the sliding window model, ARIMA (Autoregressive Integrated Moving Average) model, and SARIMA (Seasonal Autoregressive Integrated Moving Average) model to the hydraulic support load under different parameters are evaluated, respectively. The results of single-point and multipoint prediction test with various sliding window values indicate that the sliding window method has no advantage in predicting the trend of the support load. The ARIMA model shows a better short-term trend prediction than the sliding window model. To some extent, increasing the length of the autoregressive term can improve the long-term prediction accuracy of the model, but it also increases the sensitivity of the model to support load fluctuation, and it is still difficult to predict the load trend in one support cycle. The SARIMA model has better prediction results than the sliding window model and the ARIMA model, which reveals the load evolution trend accurately during the whole support cycle. However, there are many external factors affecting the support load, such as overburden properties, hydraulic support moving speed, and worker’s operation. The smarter model of SARIMA considering these factors should be developed to be more suitable in predicting the hydraulic support load.


Sign in / Sign up

Export Citation Format

Share Document