Machine Learning Methods Performance Evaluation*

In this paper, we describe an approach for air pollution modeling in the data incompleteness scenarios, when the sensors cover the monitoring area only partially. The fundamental calculus and metrics of using machine learning modeling algorithms are presented. Moreover, the assessing indicators and metrics for machine learning methods performance evaluation are described. Based on the conducted analysis, conclusions on the most appropriate evaluation approaches are made.

Download Full-text

Performance Evaluation and Comparison of Ensemble Based Bagging and Boosting Machine Learning Methods for Automated Early Prediction of Myocardial Infarction

10.1109/icccnt51525.2021.9580063 ◽

2021 ◽

Author(s):

Md. Azizul Hakim ◽

Nusrat Jahan ◽

Zannat Ara Zerin ◽

Amena Begum Farha

Keyword(s):

Machine Learning ◽

Myocardial Infarction ◽

Performance Evaluation ◽

Early Prediction ◽

Learning Methods ◽

Machine Learning Methods ◽

Performance Evaluation And Comparison

Download Full-text

Research of machine learning methods for student management performance evaluation

IET International Conference on Information Science and Control Engineering 2012 (ICISCE 2012) ◽

10.1049/cp.2012.2337 ◽

2012 ◽

Author(s):

Xiaobo Huang ◽

Ying Huang ◽

Qiang Pan ◽

Jianrong Huang ◽

Gang Zhang

Keyword(s):

Machine Learning ◽

Performance Evaluation ◽

Management Performance ◽

Learning Methods ◽

Machine Learning Methods ◽

Student Management

Download Full-text

Air pollution modeling for an industrial complex and model performance evaluation

Environmental Pollution ◽

10.1016/s0269-7491(00)00083-x ◽

2001 ◽

Vol 111 (3) ◽

pp. 471-477 ◽

Cited By ~ 40

Author(s):

R Sivacoumar ◽

A.D Bhanarkar ◽

S.K Goyal ◽

S.K Gadkari ◽

A.L Aggarwal

Keyword(s):

Air Pollution ◽

Performance Evaluation ◽

Model Performance ◽

Industrial Complex ◽

Air Pollution Modeling ◽

Model Performance Evaluation ◽

Pollution Modeling

Download Full-text

HISYCOL a hybrid computational intelligence system for combined machine learning: the case of air pollution modeling in Athens

Neural Computing and Applications ◽

10.1007/s00521-015-1927-7 ◽

2015 ◽

Vol 27 (5) ◽

pp. 1191-1206 ◽

Cited By ~ 29

Author(s):

Ilias Bougoudis ◽

Konstantinos Demertzis ◽

Lazaros Iliadis

Keyword(s):

Machine Learning ◽

Air Pollution ◽

Computational Intelligence ◽

Air Pollution Modeling ◽

Intelligence System ◽

Pollution Modeling

Download Full-text

Performance Evaluation of Pseudo Code with Weka for Accuracy Calculation

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d5394.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 7818-7823

Keyword(s):

Machine Learning ◽

Performance Evaluation ◽

Data Transmission ◽

Software Metrics ◽

Fault Prediction ◽

Learning Methods ◽

Accuracy Measurement ◽

Machine Learning Methods ◽

Interoperability Test

Programming testing is a fundamental and essential advance of the existence cycle of programming improvement to recognize and defects in programming and afterward fix the deficiencies. The reliability of the data transmission or the quality of proper processing ,maintenance and retrieval of information to a server can be tested for some systems. Accuracy is also one factor that is usually used to the Joint Interoperability Test Command as a criterion for accessing interoperability. This is the main investigation of PC flaw forecast and exactness as per our examination, which spotlights on the utilization of PROMISE database dataset. Some PROMISE database dataset tests are compared between pseudo code (PYTHON) and actual software (WEKA),which in computer fault prediction and accuracy measurement are effective software metrics and machine learning methods.

Download Full-text

Towards low-cost and high-performance air pollution measurements using machine learning calibration techniques

10.5194/amt-2020-473 ◽

2020 ◽

Author(s):

Peer Nowack ◽

Lev Konstantinovskiy ◽

Hannah Gardiner ◽

John Cant

Keyword(s):

Machine Learning ◽

Air Pollution ◽

Ridge Regression ◽

Low Cost ◽

Reference Signal ◽

Machine Learning Algorithms ◽

Public Health Issue ◽

Learning Methods ◽

Machine Learning Methods ◽

Calibration Techniques

Abstract. Air pollution is a key public health issue in urban areas worldwide. The development of low-cost air pollution sensors is consequently a major research priority. However, low-cost sensors often fail to attain sufficient measurement performance compared to state-of-the-art measurement stations, and typically require calibration procedures in expensive laboratory settings. As a result, there has been much debate about calibration techniques that could make their performance more reliable, while also developing calibration procedures that can be carried out without access to advanced laboratories. One repeatedly proposed strategy is low-cost sensor calibration through co-location with public measurement stations. The idea is that, using a regression function, the low-cost sensor signals can be calibrated against the station reference signal, to be then deployed separately with performances similar to the original stations. Here we test the idea of using machine learning algorithms for such regression tasks using hourly-averaged co-location data for nitrogen dioxide (NO2) and particulate matter of particle sizes smaller than 10 μm (PM10) at three different locations in the urban area of London, UK. Specifically, we compare the performance of Ridge regression, a linear statistical learning algorithm, to two non-linear algorithms in the form of Random Forest (RF) regression and Gaussian Process regression (GPR). We further benchmark the performance of all three machine learning methods to the more common Multiple Linear Regression (MLR). We obtain very good out-of-sample R2-scores (coefficient of determination) > 0.7, frequently exceeding 0.8, for the machine learning calibrated low-cost sensors. In contrast, the performance of MLR is more dependent on random variations in the sensor hardware and co-located signals, and is also more sensitive to the length of the co-location period. We find that, subject to certain conditions, GPR is typically the best performing method in our calibration setting, followed by Ridge regression and RF regression. However, we also highlight several key limitations of the machine learning methods, which will be crucial to consider in any co-location calibration. In particular, none of the methods is able to extrapolate to pollution levels well outside those encountered at training stage. Ultimately, this is one of the key limiting factors when sensors are deployed away from the co-location site itself. Consequently, we find that the linear Ridge method, which best mitigates such extrapolation effects, is typically performing as good as, or even better, than GPR after sensor re-location. Overall, our results highlight the potential of co-location methods paired with machine learning calibration techniques to reduce costs of air pollution measurements, subject to careful consideration of the co-location training conditions, the choice of calibration variables, and the features of the calibration algorithm.

Download Full-text

Machine Learning to Forecast Medical Attentions of Pneumonia Cases in Colombian Cities: An implementation with Air Quality, Meteorological and Admission Data

10.21203/rs.3.rs-53367/v1 ◽

2020 ◽

Author(s):

Juan David Gutiérrez

Keyword(s):

Public Health ◽

Machine Learning ◽

Air Pollution ◽

Gradient Boosting ◽

Learning Methods ◽

Machine Learning Methods ◽

Health Authorities ◽

Admission Data ◽

Extreme Gradient Boosting ◽

Public Health Authorities

Abstract Background: Previous authors have evidenced the relationship between air pollution-aerosols and meteorological variables with the occurrence of pneumonia. Forecasting the number of attentions of pneumonia cases may be useful to optimize the allocation of healthcare resources and support public health authorities to implement emergency plans to face an increase in patients. The purpose of this study is to implement four machine-learning methods to forecast the number of attentions of pneumonia cases in the five largest cities of Colombia by using air pollution-aerosols, and meteorological and admission data.Methods: The number of attentions of pneumonia cases in the five most populated Colombian cities was provided by public health authorities between January 2009 and December 2019. Air pollution-aerosols and meteorological data were obtained from remote sensors. Four machine-learning methods were implemented for each city. We selected the machine-learning methods with the best performance in each city and implemented two techniques to identify the most relevant variables in the forecasting developed by the best-performing machine-learning models. Results: According to R2 metric, random forest was the machine-learning method with the best performance for Bogotá, Medellín and Cali; whereas for Barranquilla, the best performance was obtained from the Bayesian adaptive regression trees, and for Cartagena, extreme gradient boosting had the best performance. The most important variables for the forecasting were related to the admission data.Conclusions: The results obtained from this study suggest that machine learning can be used to efficiently forecast the number of attentions of pneumonia cases, and therefore, it can be a useful decision-making tool for public health authorities.

Download Full-text