Comparison of individual, ensemble and integrated ensemble machine learning methods to predict China’s SME credit risk in supply chain finance

2016 ◽  
Vol 28 (S1) ◽  
pp. 41-50 ◽  
Author(s):  
You Zhu ◽  
Chi Xie ◽  
Gang-Jin Wang ◽  
Xin-Guo Yan
Entropy ◽  
2016 ◽  
Vol 18 (5) ◽  
pp. 195 ◽  
Author(s):  
You Zhu ◽  
Chi Xie ◽  
Gang-Jin Wang ◽  
Xin-Guo Yan

2019 ◽  
Vol 19 (292) ◽  
Author(s):  
Nan Hu ◽  
Jian Li ◽  
Alexis Meyer-Cirkel

We compared the predictive performance of a series of machine learning and traditional methods for monthly CDS spreads, using firms’ accounting-based, market-based and macroeconomics variables for a time period of 2006 to 2016. We find that ensemble machine learning methods (Bagging, Gradient Boosting and Random Forest) strongly outperform other estimators, and Bagging particularly stands out in terms of accuracy. Traditional credit risk models using OLS techniques have the lowest out-of-sample prediction accuracy. The results suggest that the non-linear machine learning methods, especially the ensemble methods, add considerable value to existent credit risk prediction accuracy and enable CDS shadow pricing for companies missing those securities.


2019 ◽  
Vol 22 (03) ◽  
pp. 1950021 ◽  
Author(s):  
Huei-Wen Teng ◽  
Michael Lee

Machine learning has successful applications in credit risk management, portfolio management, automatic trading, and fraud detection, to name a few, in the domain of finance technology. Reformulating and solving these topics adequately and accurately is problem specific and challenging along with the availability of complex and voluminous data. In credit risk management, one major problem is to predict the default of credit card holders using real dataset. We review five machine learning methods: the [Formula: see text]-nearest neighbors decision trees, boosting, support vector machine, and neural networks, and apply them to the above problem. In addition, we give explicit Python scripts to conduct analysis using a dataset of 29,999 instances with 23 features collected from a major bank in Taiwan, downloadable in the UC Irvine Machine Learning Repository. We show that the decision tree performs best among others in terms of validation curves.


2020 ◽  
Vol 1625 ◽  
pp. 012024
Author(s):  
D Prayogo ◽  
D I Santoso ◽  
D Wijaya ◽  
T Gunawan ◽  
J A Widjaja

Informatica ◽  
2020 ◽  
Vol 44 (3) ◽  
Author(s):  
Ramzi Saifan ◽  
Khaled Sharif ◽  
Mohammad Abu-Ghazaleh ◽  
Mohammad Abdel-Majeed

Buildings ◽  
2022 ◽  
Vol 12 (1) ◽  
pp. 46
Author(s):  
Obuks Augustine Ejohwomu ◽  
Olakekan Shamsideen Oshodi ◽  
Majeed Oladokun ◽  
Oyegoke Teslim Bukoye ◽  
Nwabueze Emekwuru ◽  
...  

Exposure of humans to high concentrations of PM2.5 has adverse effects on their health. Researchers estimate that exposure to particulate matter from fossil fuel emissions accounted for 18% of deaths in 2018—a challenge policymakers argue is being exacerbated by the increase in the number of extreme weather events and rapid urbanization as they tinker with strategies for reducing air pollutants. Drawing on a number of ensemble machine learning methods that have emerged as a result of advancements in data science, this study examines the effectiveness of using ensemble models for forecasting the concentrations of air pollutants, using PM2.5 as a representative case. A comprehensive evaluation of the ensemble methods was carried out by comparing their predictive performance with that of other standalone algorithms. The findings suggest that hybrid models provide useful tools for PM2.5 concentration forecasting. The developed models show that machine learning models are efficient in predicting air particulate concentrations, and can be used for air pollution forecasting. This study also provides insights into how climatic factors influence the concentrations of pollutants found in the air.


Sign in / Sign up

Export Citation Format

Share Document