scholarly journals Prediksi Barang Keluar TB. Wijaya Bangunan Menggunakan Algoritma KNN Regression dengan RStudio

2021 ◽  
Vol 6 (2) ◽  
pp. 90-97
Author(s):  
Natcha Kwintarini Suparman ◽  
Budi Arif Dermawan ◽  
Tesa Nur Padilah

TB. Wijaya Bangunan is a business entity that has weaknesses in managing inventories. This study aims to help TB. Wijaya Bangunan in managing inventory based on existing data reduce the difference between the number of incoming goods and the number of outgoing goods. The methods used are data collection, data preparation, data selection, preprocessing, data transformation, distance calculation, calculation of predictions, evaluation, and display of prediction results using a Shiny framework. This study uses the Time Series KNN Regression algorithm to predict the number of outgoing goods based on time series data with existing data. The most predicted results came out in the 9th week period as much as 22.40%. Based on the process that has been done, it can be concluded that the evaluation value of Root Mean Square Error (RMSE) is at least 3.55, which means it has the best predictive accuracy results.

2020 ◽  
Author(s):  
Hsiao-Ko Chang ◽  
Hui-Chih Wang ◽  
Chih-Fen Huang ◽  
Feipei Lai

BACKGROUND In most of Taiwan’s medical institutions, congestion is a serious problem for emergency departments. Due to a lack of beds, patients spend more time in emergency retention zones, which make it difficult to detect cardiac arrest (CA). OBJECTIVE We seek to develop a pharmaceutical early warning model to predict cardiac arrest in emergency departments via drug classification and medical expert suggestion. METHODS We propose a new early warning score model for detecting cardiac arrest via pharmaceutical classification and by using a sliding window; we apply learning-based algorithms to time-series data for a Pharmaceutical Early Warning Scoring Model (PEWSM). By treating pharmaceutical features as a dynamic time-series factor for cardiopulmonary resuscitation (CPR) patients, we increase sensitivity, reduce false alarm rates and mortality, and increase the model’s accuracy. To evaluate the proposed model we use the area under the receiver operating characteristic curve (AUROC). RESULTS Four important findings are as follows: (1) We identify the most important drug predictors: bits, and replenishers and regulators of water and electrolytes. The best AUROC of bits is 85%; that of replenishers and regulators of water and electrolytes is 86%. These two features are the most influential of the drug features in the task. (2) We verify feature selection, in which accounting for drugs improve the accuracy: In Task 1, the best AUROC of vital signs is 77%, and that of all features is 86%. In Task 2, the best AUROC of all features is 85%, which demonstrates that thus accounting for the drugs significantly affects prediction. (3) We use a better model: For traditional machine learning, this study adds a new AI technology: the long short-term memory (LSTM) model with the best time-series accuracy, comparable to the traditional random forest (RF) model; the two AUROC measures are 85%. (4) We determine whether the event can be predicted beforehand: The best classifier is still an RF model, in which the observational starting time is 4 hours before the CPR event. Although the accuracy is impaired, the predictive accuracy still reaches 70%. Therefore, we believe that CPR events can be predicted four hours before the event. CONCLUSIONS This paper uses a sliding window to account for dynamic time-series data consisting of the patient’s vital signs and drug injections. In a comparison with NEWS, we improve predictive accuracy via feature selection, which includes drugs as features. In addition, LSTM yields better performance with time-series data. The proposed PEWSM, which offers 4-hour predictions, is better than the National Early Warning Score (NEWS) in the literature. This also confirms that the doctor’s heuristic rules are consistent with the results found by machine learning algorithms.


2020 ◽  
Author(s):  
Hsiao-Ko Chang ◽  
Hui-Chih Wang ◽  
Chih-Fen Huang ◽  
Feipei Lai

BACKGROUND In most of Taiwan’s medical institutions, congestion is a serious problem for emergency departments. Due to a lack of beds, patients spend more time in emergency retention zones, which make it difficult to detect cardiac arrest (CA). OBJECTIVE We seek to develop a Drug Early Warning System Model (DEWSM), it included drug injections and vital signs as this research important features. We use it to predict cardiac arrest in emergency departments via drug classification and medical expert suggestion. METHODS We propose this new model for detecting cardiac arrest via drug classification and by using a sliding window; we apply learning-based algorithms to time-series data for a DEWSM. By treating drug features as a dynamic time-series factor for cardiopulmonary resuscitation (CPR) patients, we increase sensitivity, reduce false alarm rates and mortality, and increase the model’s accuracy. To evaluate the proposed model, we use the area under the receiver operating characteristic curve (AUROC). RESULTS Four important findings are as follows: (1) We identify the most important drug predictors: bits (intravenous therapy), and replenishers and regulators of water and electrolytes (fluid and electrolyte supplement). The best AUROC of bits is 85%, it means the medical expert suggest the drug features: bits, it will affect the vital signs, and then the evaluate this model correctly classified patients with CPR reach 85%; that of replenishers and regulators of water and electrolytes is 86%. These two features are the most influential of the drug features in the task. (2) We verify feature selection, in which accounting for drugs improve the accuracy: In Task 1, the best AUROC of vital signs is 77%, and that of all features is 86%. In Task 2, the best AUROC of all features is 85%, which demonstrates that thus accounting for the drugs significantly affects prediction. (3) We use a better model: For traditional machine learning, this study adds a new AI technology: the long short-term memory (LSTM) model with the best time-series accuracy, comparable to the traditional random forest (RF) model; the two AUROC measures are 85%. It can be seen that the use of new AI technology will achieve better results, currently comparable to the accuracy of traditional common RF, and the LSTM model can be adjusted in the future to obtain better results. (4) We determine whether the event can be predicted beforehand: The best classifier is still an RF model, in which the observational starting time is 4 hours before the CPR event. Although the accuracy is impaired, the predictive accuracy still reaches 70%. Therefore, we believe that CPR events can be predicted four hours before the event. CONCLUSIONS This paper uses a sliding window to account for dynamic time-series data consisting of the patient’s vital signs and drug injections. The National Early Warning Score (NEWS) only focuses on the score of vital signs, and does not include factors related to drug injections. In this study, the experimental results of adding the drug injections are better than only vital signs. In a comparison with NEWS, we improve predictive accuracy via feature selection, which includes drugs as features. In addition, we use traditional machine learning methods and deep learning (using LSTM method as the main processing time series data) as the basis for comparison of this research. The proposed DEWSM, which offers 4-hour predictions, is better than the NEWS in the literature. This also confirms that the doctor’s heuristic rules are consistent with the results found by machine learning algorithms.


2018 ◽  
Vol 8 (1) ◽  
pp. 16
Author(s):  
Ilaria Lucrezia Amerise ◽  
Agostino Tarsitano

The objective of this research is to develop a fast, simple method for detecting and replacing extreme spikes in high-frequency time series data. The method primarily consists  of a nonparametric procedure that pursues a balance between fidelity to observed data and smoothness. Furthermore, through examination of the absolute difference between original and smoothed values, the technique is also able to detect and, where necessary, replace outliers with less extreme data. Unlike other filtering procedures found in the literature, our method does not require a model to be specified for the data. Additionally, the filter makes only a single pass through the time series. Experiments  show that the new method can be validly used as a data preparation tool to ensure that time series modeling is supported by clean data, particularly in a complex context such as one with high-frequency data.


Author(s):  
Subhashis Datta ◽  
Achintya Mukhopadhyay ◽  
Dipankar Sanyal

A nonlinear fourth-order dynamic model of a thermal pulse combustor has been developed. In this work, the time series data generated by solution of the fourth order system is converted into a set of symbols based on the values of pressure variables. The key step to symbolization involves transformation of the original values to a stream of discretised symbols by partitioning the range of observed values into a finite number of regions and then assigning a symbol to each measurement based on the region in which it falls. Once all the measured values are symbolized, a symbol sequence vector consisting of L successive temporal observations is defined and its relative frequency is determined. In this work, the relative frequencies of different symbol sequences are computed by scanning the time series data in forward and reverse directions. The difference between the relative frequencies obtained in forward and reverse scanning is termed as "irreversibility" of the process. It is observed that for given alphabet and word sizes, the "irreversibility" increases as the system approaches extinction. The effects of different choices of alphabet and word sizes are also considered.


2013 ◽  
Vol 63 (2) ◽  
Author(s):  
M. H. Osman ◽  
Z. M. Nopiah ◽  
S. Abdullah ◽  
A. Lennie

An overlapping segmentation method on time series data is often used for preparing training dataset i.e. the population of instance, for classification data mining. Having large number of redundant instances would burden the training process with heavy computational operation. This would happen if practitioners fail to acknowledge an appropriate amount of overlap when performing the time series segmentation. Fortunately, the risk could be decreased if knowledge preferences can be determined to guide on overlapping criteria in the segmentation algorithm. Thus, this study aims to investigate how the Varri method is able to contribute for better understanding in preparing training dataset consists of irredundant fatigue segment from the loading history (fatigue signal). Generally, the method locates segment boundaries based on local maxima in the difference function which are above the assigned threshold. In the present study, the mean and standard deviation have been used to define the function due to the fact that predicting attributes are the key components in defining instance redundancy. The resulting dataset from the proposed method is trained by three classification algorithms under the supervision of the Genetic algorithms-based feature selection wrapper approach. The average performance index shows an additional advantage of the proposed method as compared to the conventional procedure in preparing training dataset.


2017 ◽  
Vol 04 (04) ◽  
pp. 1750045 ◽  
Author(s):  
Dilip B. Madan ◽  
King Wang

Market clichés assert that markets take escalators up and elevators down. The observation suggests differentiating models for up and down moves. Non-diffusive models allow for this and we model the move as the difference of two independent mean reverting increasing processes driven by gamma process shocks. The model is estimated on time series data as well as option data. Broadly speaking, the rise occurs with more frequent and smaller jumps with a faster rate of convergence to equilibrium. The down tick process has larger, less frequent moves with longer memories. Applications to delta hedging and the setting of profit targets and stop losses are also presented.


2016 ◽  
Author(s):  
Leonid Tiokhin ◽  
Daniel Hruschka

In a recent paper, Beall, Hofer, and Schaller (2016) use observational time series data to test the hypothesis that the 2014 Ebola outbreak influenced the 2014 U.S. Federal Elections. They find substantial associations between online search volume for Ebola and people’s tendency to vote Republican, an effect observed primarily in states with norms favoring Republican candidates. However, the analyses do not deal with the well-known problem of temporal autocorrelation in time series. We show that all variables analyzed exhibit extremely high levels of temporal autocorrelation (i.e. similarity in data-point values across time). After appropriately removing first-order autocorrelation, the observed relationships are attenuated and non-significant. This suggests that either no real associations exist, or that existing data are insufficiently powered to test the proposed hypotheses. We conclude by highlighting other pitfalls of observational data analysis, and draw attention to analytical strategies developed in related disciplines for avoiding these errors.


2016 ◽  
Author(s):  
Leonid Tiokhin ◽  
Daniel Hruschka

In a recent paper, Beall, Hofer, and Schaller (2016) use observational time series data to test the hypothesis that the 2014 Ebola outbreak influenced the 2014 U.S. Federal Elections. They find substantial associations between online search volume for Ebola and people’s tendency to vote Republican, an effect observed primarily in states with norms favoring Republican candidates. However, the analyses do not deal with the well-known problem of temporal autocorrelation in time series. We show that all variables analyzed exhibit extremely high levels of temporal autocorrelation (i.e. similarity in data-point values across time). After appropriately removing first-order autocorrelation, the observed relationships are attenuated and non-significant. This suggests that either no real associations exist, or that existing data are insufficiently powered to test the proposed hypotheses. We conclude by highlighting other pitfalls of observational data analysis, and draw attention to analytical strategies developed in related disciplines for avoiding these errors.


2020 ◽  
Vol 16 (4) ◽  
Author(s):  
Satish Chander ◽  
Vijaya Padmanabha ◽  
Joseph Mani

AbstractCOVID’19 is an emerging disease and the precise epidemiological profile does not exist in the world. Hence, the COVID’19 outbreak is treated as a Public Health Emergency of the International Concern by the World Health Organization (WHO). Hence, an effective and optimal prediction of COVID’19 mechanism, named Jaya Spider Monkey Optimization-based Deep Convolutional long short-term classifier (JayaSMO-based Deep ConvLSTM) is proposed in this research to predict the rate of confirmed, death, and recovered cases from the time series data. The proposed COVID’19 prediction method uses the COVID’19 data, which is the trending domain of research at the current era of fighting the COVID’19 attacks thereby, to reduce the death toll. However, the proposed JayaSMO algorithm is designed by integrating the Spider Monkey Optimization (SMO) with the Jaya algorithm, respectively. The Deep ConvLSTM classifier facilitates to predict the COVID’19 from the time series data based on the fitness function. Besides, the technical indicators, such as Relative Strength Index (RSI), Rate of Change (ROCR), Exponential Moving Average (EMA), Williams %R, Double Exponential Moving Average (DEMA), and Stochastic %K, are extracted effectively for further processing. Thus, the resulted output of the proposed JayaSMO-based Deep ConvLSTM is employed for COVID’19 prediction. Moreover, the developed model obtained the better performance using the metrics, like Mean Square Error (MSE), and Root Mean Square Error (RMSE) by considering confirmed, death, and the recovered cases of COVID’19 for China and Oman. Thus, the proposed JayaSMO-based Deep ConvLSTM showed improved results with a minimal MSE of 1.791, and the minimal RMSE of 1.338 based on confirmed cases in Oman. In addition, the developed model achieved the death cases with the values of 1.609, and 1.268 for MSE and RMSE, whereas the MSE and the RMSE value of 1.945, and 1.394 is achieved by the developed model using recovered cases in China.


2018 ◽  
Vol 12 (2) ◽  
pp. 115
Author(s):  
Ida Wahyuni ◽  
Fadhli Almu'iini Ahda

The uncertain pattern of rainfall causes apple farmers to be difficult in determining the flowering time that resulted in the apple harvest becomes not maximal. Many methods are used to predict rainfall, one of which is Fuzzy Inference System (FIS) Tsukamoto. Earlier research using this method managed to get a fairly small Root Mean Square Error (RMSE) value. In this research, FIS Tsukamoto method is used to create rainfall prediction modeling in four locations in Batu, East Java with the aim to get a small RMSE also. Tsukamoto's FIS method is used to predict rainfall with time series data from 2005 to 2014. The result of this research is the prototype of Tsukamoto FIS method that can be used to predict rainfall with RMSE error value in Junggo area of 9,196, in Pujon area of 9,407 , in Tinjomulyo area of 8,798, in Ngujung area of 8,825.


Sign in / Sign up

Export Citation Format

Share Document