scholarly journals Development of missing data prediction model for carbon monoxide

2019 ◽  
Vol 15 (1) ◽  
pp. 13-17
Author(s):  
Nurul Latiffah Abd Rani ◽  
Azman Azid ◽  
Muhamad Shirwan Abdullah Sani ◽  
Mohd Saiful Samsudin ◽  
Ku Mohd Kalkausar Ku Yusof ◽  
...  

Carbon monoxide (CO) is one of the most important pollutants since it is selected for API calculation. Therefore, it is paramount to ensure that there is no missing data of CO during the analysis. There are numbers of occurrences that may contribute to the missing data problems such as inability of the instrument to record certain parameters. In view of this fact, a CO prediction model needs to be developed to address this problem. A dataset of meteorological and air pollutants value was obtained from the Air Quality Division, Department of Environment Malaysia (DOE). A total of 113112 datasets were used to develop the model using sensitivity analysis (SA) through artificial neural network (ANN). SA showed particulate matter (PM10) and ozone (O3) were the most significant input variables for missing data prediction model of CO. Three hidden nodes were the optimum number to develop the ANN model with the value of R2 equal to 0.5311. Both models (artificial neural network-carbon monoxide-all parameters (ANN-CO-AP) and artificial neural network-carbon monoxide-leave out (ANN-CO-LO)) showed high value of R2 (0.7639 and 0.5311) and low value of RMSE (0.2482 and 0.3506), respectively. These values indicated that the models might only employ the most significant input variables to represent the CO rather than using all input variables.

2021 ◽  
Vol 3 (2) ◽  
Author(s):  
Charles Gbenga Williams ◽  
Oluwapelumi O. Ojuri

AbstractAs a result of heterogeneity nature of soils and variation in its hydraulic conductivity over several orders of magnitude for various soil types from fine-grained to coarse-grained soils, predictive methods to estimate hydraulic conductivity of soils from properties considered more easily obtainable have now been given an appropriate consideration. This study evaluates the performance of artificial neural network (ANN) being one of the popular computational intelligence techniques in predicting hydraulic conductivity of wide range of soil types and compared with the traditional multiple linear regression (MLR). ANN and MLR models were developed using six input variables. Results revealed that only three input variables were statistically significant in MLR model development. Performance evaluations of the developed models using determination coefficient and mean square error show that the prediction capability of ANN is far better than MLR. In addition, comparative study with available existing models shows that the developed ANN and MLR in this study performed relatively better.


2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Abolghasem Daeichian ◽  
Rana Shahramfar ◽  
Elham Heidari

Abstract Lime is a significant material in many industrial processes, including steelmaking by blast furnace. Lime production through rotary kilns is a standard method in industries, yet it has depreciation, high energy consumption, and environmental pollution. A model of the lime production process can help to not only increase our knowledge and awareness but also can help reduce its disadvantages. This paper presents a black-box model by Artificial Neural Network (ANN) for the lime production process considering pre-heater, rotary kiln, and cooler parameters. To this end, actual data are collected from Zobahan Isfahan Steel Company, Iran, which consists of 746 data obtained in a duration of one year. The proposed model considers 23 input variables, predicting the amount of produced lime as an output variable. The ANN parameters such as number of hidden layers, number of neurons in each layer, activation functions, and training algorithm are optimized. Then, the sensitivity of the optimum model to the input variables is investigated. Top-three input variables are selected on the basis of one-group sensitivity analysis and their interactions are studied. Finally, an ANN model is developed considering the top-three most effective input variables. The mean square error of the proposed models with 23 and 3 inputs are equal to 0.000693 and 0.004061, respectively, which shows a high prediction capability of the two proposed models.


2019 ◽  
Vol 331 ◽  
pp. 336-345 ◽  
Author(s):  
Zebin Yang ◽  
Dennis K.J. Lin ◽  
Aijun Zhang

Author(s):  
Yi-Shu Chen ◽  
Dan Chen ◽  
Chao Shen ◽  
Ming Chen ◽  
Chao-Hui Jin ◽  
...  

Abstract Background The artificial neural network (ANN) emerged recently as a potent diagnostic tool, especially for complicated systemic diseases. This study aimed to establish a diagnostic model for the recognition of fatty liver disease (FLD) by virtue of the ANN. Methods A total of 7,396 pairs of gender- and age-matched subjects who underwent health check-ups at the First Affiliated Hospital, College of Medicine, Zhejiang University (Hangzhou, China) were enrolled to establish the ANN model. Indices available in health check-up reports were utilized as potential input variables. The performance of our model was evaluated through a receiver-operating characteristic (ROC) curve analysis. Other outcome measures included diagnostic accuracy, sensitivity, specificity, Cohen’s k coefficient, Brier score, and Hosmer-Lemeshow test. The Fatty Liver Index (FLI) and the Hepatic Steatosis Index (HSI), retrained using our training-group data with its original designated input variables, were used as comparisons in the capability of FLD diagnosis. Results Eight variables (age, gender, body mass index, alanine aminotransferase, aspartate aminotransferase, uric acid, total triglyceride, and fasting plasma glucose) were eventually adopted as input nodes of the ANN model. By applying a cut-off point of 0.51, the area under ROC curves of our ANN model in predicting FLD in the testing group was 0.908 [95% confidence interval (CI), 0.901–0.915]—significantly higher (P < 0.05) than that of the FLI model (0.881, 95% CI, 0.872–0.891) and that of the HSI model (0.885; 95% CI, 0.877–0.893). Our ANN model exhibited higher diagnostic accuracy, better concordance with ultrasonography results, and superior capability of calibration than the FLI model and the HSI model. Conclusions Our ANN system showed good capability in the diagnosis of FLD. It is anticipated that our ANN model will be of both clinical and epidemiological use in the future.


Author(s):  
Hadjira Maouz ◽  
◽  
Asma Adda ◽  
Salah Hanini ◽  
◽  
...  

The concentration of carbonyl is one of the most important properties contributing to the detection of the thermal aging of polymer ethylene propylene diene monomer (EPDM). In this publication, an artificial neural network (ANN) model was developed to predict concentration of carbenyl during the thermal aging of EPDM using a database consisting of seven input variables. The best fitting training data was obtained with the architecture of (7 inputs neurons, 10 hidden neurons and 1 output neuron). A Levenberg Marquardt learning (LM) algorithm, hyperbolic tangent transfer function were used at the hidden and output layer respectively. The optimal ANN was obtained with a high correlation coefficient R= 0.995 and a very low root mean square error RMSE = 0.0148 mol/l during the generalization phase. The comparison between the experimental and calculated results show that the ANN model is able of predicted the concentration of carbonyl during the thermal aging of ethylene propylene diene monomer


2014 ◽  
Vol 17 (1) ◽  
pp. 56-74 ◽  
Author(s):  
Gurjeet Singh ◽  
Rabindra K. Panda ◽  
Marc Lamers

The reported study was undertaken in a small agricultural watershed, namely, Kapgari in Eastern India having a drainage area of 973 ha. The watershed was subdivided into three sub-watersheds on the basis of drainage network and land topography. An attempt was made to relate the continuously monitored runoff data from the sub-watersheds and the whole-watershed with the rainfall and temperature data using the artificial neural network (ANN) technique. The reported study also evaluated the bias in the prediction of daily runoff with shorter length of training data set using different resampling techniques with the ANN modeling. A 10-fold cross-validation (CV) technique was used to find the optimum number of hidden neurons in the hidden layer and to avoid neural network over-fitting during the training process for shorter length of data. The results illustrated that the ANN models developed with shorter length of training data set avoid neural network over-fitting during the training process, using a 10-fold CV method. Moreover, the biasness was investigated using the bootstrap resampling technique based ANN (BANN) for short length of training data set. In comparison with the 10-fold CV technique, the BANN is more efficient in solving the problems of the over-fitting and under-fitting during training of models for shorter length of data set.


Sign in / Sign up

Export Citation Format

Share Document