A Comparative Analysis of Sentiment Classification Based on Deep and Traditional Ensemble Machine Learning Models

Author(s):  
Mahammed Kamruzzaman ◽  
Mohammed Hossain ◽  
Md. Rashidul Islam Imran ◽  
Sagor Chandro Bakchy
2020 ◽  
Vol 214 ◽  
pp. 01023
Author(s):  
Linan (Frank) Zhao

Long-term unemployment has significant societal impact and is of particular concerns for policymakers with regard to economic growth and public finances. This paper constructs advanced ensemble machine learning models to predict citizens’ risks of becoming long-term unemployed using data collected from European public authorities for employment service. The proposed model achieves 81.2% accuracy on identifying citizens with high risks of long-term unemployment. This paper also examines how to dissect black-box machine learning models by offering explanations at both a local and global level using SHAP, a state-of-the-art model-agnostic approach to explain factors that contribute to long-term unemployment. Lastly, this paper addresses an under-explored question when applying machine learning in the public domain, that is, the inherent bias in model predictions. The results show that popular models such as gradient boosted trees may produce unfair predictions against senior age groups and immigrants. Overall, this paper sheds light on the recent increasing shift for governments to adopt machine learning models to profile and prioritize employment resources to reduce the detrimental effects of long-term unemployment and improve public welfare.


2022 ◽  
Vol 2161 (1) ◽  
pp. 012054
Author(s):  
R M Savithramma ◽  
R Sumathi ◽  
H S Sudhira

Abstract In recent decades machine learning technology has proved its efficiency in most sectors by making human life easier. With this popularity and efficiency, it is applied to design traffic signal control systems to mitigate traffic congestion and distribute waiting delays. Hence, many researchers around the world are working to address this issue. As a part of the solution, this article presents a comparative analysis of various machine learning models to come up with a suitable model for an isolated intersection. In this context, eight machine learning models including Linear Regression, Ridge, Lasso, Support Vector Regression, k-Nearest Neighbour, Decision Tree, Random Forest, and Gradient Boosting Regression Tree are selected. Shivakumara Swamiji Circle (SSC), one of the intersections in Tumakuru, Karnataka, India is selected as a case study area. Essential data is collected from SSC through videography. The selected models are developed to predict green time based on traffic classification and volume in Passenger Car Units (PCU) for each phase on the PyCharm platform. The models are evaluated based on various performance metrics. Results revealed that all the selected models predict green splits with 91% accuracy using traffic classification as input, whereas, models showed 85% accuracy with PCU as input. And also, Gradient Boosting Regression Tree is the best suitable model for the selected intersection, whereas, Decision Tree is not referred model for this application.


2022 ◽  
Vol 8 ◽  
pp. 612-618
Author(s):  
Pavel Matrenin ◽  
Murodbek Safaraliev ◽  
Stepan Dmitriev ◽  
Sergey Kokin ◽  
Anvari Ghulomzoda ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document