Comparative Analysis of Energy Poverty Prediction Models Using Machine Learning Algorithms

2021 ◽  
Vol 56 (5) ◽  
pp. 239-255
Author(s):  
Zhe Hong ◽  
In Kwon Park
2021 ◽  
Vol 40 ◽  
pp. 03007
Author(s):  
Ruby Hasan

In the last few years, cardiovascular diseases have emerged as one of the most common causes of deaths worldwide. The lifestyle changes, eating habits, working cultures etc, has significantly contributed to this alarming issue across the globe including the developed, underdeveloped and developing nations. Early detection of the initial signs of cardiovascular diseases and the continuous medical supervision can help in reducing rising number of patients and eventually the mortality rate. However with limited medical facilities and specialist doctors, it is difficult to continuously monitor the patients and provide consultations. Technological interventions are required to facilitate the patient monitoring and treatment. The healthcare data generated through various medical procedures and continuous patient monitoring can be utilized to develop efficient prediction models for cardiovascular diseases. The early prognosis of cardiovascular illnesses can aid in making decisions on life-style changes in high hazard sufferers and in turn lessen the complications, which may be an outstanding milestone inside the field of medicine. This paper studies some of the most widely used machine learning algorithms for heart disease prediction by using the medical data and historical information. The various techniques are discussed and a comparative analysis of the same is presented. This report compares five common strategies for predicting the chance of heart attack that have been published in the literature. KNN, Decision Tree, Gaussian Naive Bayes, Logistic Regression, and Random Forest are some of the approaches used. Further, the paper also highlights the advantages and disadvantages of using the various techniques for developing the prediction models.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Matthijs Blankers ◽  
Louk F. M. van der Post ◽  
Jack J. M. Dekker

Abstract Background Accurate prediction models for whether patients on the verge of a psychiatric criseis need hospitalization are lacking and machine learning methods may help improve the accuracy of psychiatric hospitalization prediction models. In this paper we evaluate the accuracy of ten machine learning algorithms, including the generalized linear model (GLM/logistic regression) to predict psychiatric hospitalization in the first 12 months after a psychiatric crisis care contact. We also evaluate an ensemble model to optimize the accuracy and we explore individual predictors of hospitalization. Methods Data from 2084 patients included in the longitudinal Amsterdam Study of Acute Psychiatry with at least one reported psychiatric crisis care contact were included. Target variable for the prediction models was whether the patient was hospitalized in the 12 months following inclusion. The predictive power of 39 variables related to patients’ socio-demographics, clinical characteristics and previous mental health care contacts was evaluated. The accuracy and area under the receiver operating characteristic curve (AUC) of the machine learning algorithms were compared and we also estimated the relative importance of each predictor variable. The best and least performing algorithms were compared with GLM/logistic regression using net reclassification improvement analysis and the five best performing algorithms were combined in an ensemble model using stacking. Results All models performed above chance level. We found Gradient Boosting to be the best performing algorithm (AUC = 0.774) and K-Nearest Neighbors to be the least performing (AUC = 0.702). The performance of GLM/logistic regression (AUC = 0.76) was slightly above average among the tested algorithms. In a Net Reclassification Improvement analysis Gradient Boosting outperformed GLM/logistic regression by 2.9% and K-Nearest Neighbors by 11.3%. GLM/logistic regression outperformed K-Nearest Neighbors by 8.7%. Nine of the top-10 most important predictor variables were related to previous mental health care use. Conclusions Gradient Boosting led to the highest predictive accuracy and AUC while GLM/logistic regression performed average among the tested algorithms. Although statistically significant, the magnitude of the differences between the machine learning algorithms was in most cases modest. The results show that a predictive accuracy similar to the best performing model can be achieved when combining multiple algorithms in an ensemble model.


Author(s):  
Sandy C. Lauguico ◽  
◽  
Ronnie S. Concepcion II ◽  
Jonnel D. Alejandrino ◽  
Rogelio Ruzcko Tobias ◽  
...  

The arising problem on food scarcity drives the innovation of urban farming. One of the methods in urban farming is the smart aquaponics. However, for a smart aquaponics to yield crops successfully, it needs intensive monitoring, control, and automation. An efficient way of implementing this is the utilization of vision systems and machine learning algorithms to optimize the capabilities of the farming technique. To realize this, a comparative analysis of three machine learning estimators: Logistic Regression (LR), K-Nearest Neighbor (KNN), and Linear Support Vector Machine (L-SVM) was conducted. This was done by modeling each algorithm from the machine vision-feature extracted images of lettuce which were raised in a smart aquaponics setup. Each of the model was optimized to increase cross and hold-out validations. The results showed that KNN having the tuned hyperparameters of n_neighbors=24, weights='distance', algorithm='auto', leaf_size = 10 was the most effective model for the given dataset, yielding a cross-validation mean accuracy of 87.06% and a classification accuracy of 91.67%.


Sign in / Sign up

Export Citation Format

Share Document