A Comparative Analysis of Sentiment Classification Based on Deep and Traditional Ensemble Machine Learning Models

Long-term unemployment has significant societal impact and is of particular concerns for policymakers with regard to economic growth and public finances. This paper constructs advanced ensemble machine learning models to predict citizens’ risks of becoming long-term unemployed using data collected from European public authorities for employment service. The proposed model achieves 81.2% accuracy on identifying citizens with high risks of long-term unemployment. This paper also examines how to dissect black-box machine learning models by offering explanations at both a local and global level using SHAP, a state-of-the-art model-agnostic approach to explain factors that contribute to long-term unemployment. Lastly, this paper addresses an under-explored question when applying machine learning in the public domain, that is, the inherent bias in model predictions. The results show that popular models such as gradient boosted trees may produce unfair predictions against senior age groups and immigrants. Overall, this paper sheds light on the recent increasing shift for governments to adopt machine learning models to profile and prioritize employment resources to reduce the detrimental effects of long-term unemployment and improve public welfare.

Download Full-text

Comparative Analysis of Machine Learning Models for the Prediction of Pedestrian Crash Severity: Focused on Balancing Pedestrian Crash Dataset

Journal of Korean Society for Geospatial Information System ◽

10.7319/kogsis.2021.29.2.003 ◽

2021 ◽

Vol 29 (2) ◽

pp. 3-15

Author(s):

Hojun Lee ◽

Sugie Lee

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Crash Severity ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Efficient Breast Cancer Prediction Using Ensemble Machine Learning Models

2019 4th International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT) ◽

10.1109/rteict46194.2019.9016968 ◽

2019 ◽

Cited By ~ 1

Author(s):

Naveen ◽

R. K. Sharma ◽

Anil Ramachandran Nair

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Models ◽

Cancer Prediction ◽

Ensemble Machine Learning ◽

Machine Learning Models

Download Full-text

A Comprehensive Comparative Analysis of Machine Learning Models for Predicting Heating and Cooling Loads

Current Approaches in Science and Technology Research Vol. 6 ◽

10.9734/bpi/castr/v6/2602f ◽

2021 ◽

pp. 77-92

Author(s):

Eslam Mohammed Abdelkader ◽

Abobakr Al-Sakkaf ◽

Reem Ahmed

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Learning Models ◽

Heating And Cooling ◽

Cooling Loads ◽

Machine Learning Models

Download Full-text

A Comparative Analysis of Machine Learning Algorithms in Design Process of Adaptive Traffic Signal Control System

Journal of Physics Conference Series ◽

10.1088/1742-6596/2161/1/012054 ◽

2022 ◽

Vol 2161 (1) ◽

pp. 012054

Author(s):

R M Savithramma ◽

R Sumathi ◽

H S Sudhira

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Regression Tree ◽

Gradient Boosting ◽

Signal Control ◽

Traffic Signal Control ◽

Suitable Model ◽

Traffic Classification ◽

Learning Models ◽

Machine Learning Models

Abstract In recent decades machine learning technology has proved its efficiency in most sectors by making human life easier. With this popularity and efficiency, it is applied to design traffic signal control systems to mitigate traffic congestion and distribute waiting delays. Hence, many researchers around the world are working to address this issue. As a part of the solution, this article presents a comparative analysis of various machine learning models to come up with a suitable model for an isolated intersection. In this context, eight machine learning models including Linear Regression, Ridge, Lasso, Support Vector Regression, k-Nearest Neighbour, Decision Tree, Random Forest, and Gradient Boosting Regression Tree are selected. Shivakumara Swamiji Circle (SSC), one of the intersections in Tumakuru, Karnataka, India is selected as a case study area. Essential data is collected from SSC through videography. The selected models are developed to predict green time based on traffic classification and volume in Passenger Car Units (PCU) for each phase on the PyCharm platform. The models are evaluated based on various performance metrics. Results revealed that all the selected models predict green splits with 91% accuracy using traffic classification as input, whereas, models showed 85% accuracy with PCU as input. And also, Gradient Boosting Regression Tree is the best suitable model for the selected intersection, whereas, Decision Tree is not referred model for this application.

Download Full-text