Using Small Business Banking Data for Explainable Credit Risk Scoring

Wei Wang; Christopher Lesner; Alexander Ran; Marko Rukonic; Jason Xue; Eric Shiu

doi:10.1609/aaai.v34i08.7055

Using Small Business Banking Data for Explainable Credit Risk Scoring

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i08.7055 ◽

2020 ◽

Vol 34 (08) ◽

pp. 13396-13401

Author(s):

Wei Wang ◽

Christopher Lesner ◽

Alexander Ran ◽

Marko Rukonic ◽

Jason Xue ◽

...

Keyword(s):

Machine Learning ◽

Small Business ◽

Credit Risk ◽

Risk Model ◽

Learning Models ◽

Machine Learning Model ◽

Risk Scoring ◽

Financial Transactions ◽

Credit Risk Model ◽

Machine Learning Models

Machine learning applied to financial transaction records can predict how likely a small business is to repay a loan. For this purpose we compared a traditional scorecard credit risk model against various machine learning models and found that XGBoost with monotonic constraints outperformed scorecard model by 7% in K-S statistic. To deploy such a machine learning model in production for loan application risk scoring it must comply with lending industry regulations that require lenders to provide understandable and specific reasons for credit decisions. Thus we also developed a loan decision explanation technique based on the ideas of WoE and SHAP. Our research was carried out using a historical dataset of tens of thousands of loans and millions of associated financial transactions. The credit risk scoring model based on XGBoost with monotonic constraints and SHAP explanations described in this paper have been deployed by QuickBooks Capital to assess incoming loan applications since July 2019.

Download Full-text

Credit Risk Scoring Analysis Based on Machine Learning Models

2019 6th International Conference on Information Science and Control Engineering (ICISCE) ◽

10.1109/icisce48695.2019.00052 ◽

2019 ◽

Author(s):

Ziyue Qiu ◽

Yuming Li ◽

Pin Ni ◽

Gangmin Li

Keyword(s):

Machine Learning ◽

Credit Risk ◽

Learning Models ◽

Risk Scoring ◽

Machine Learning Models

Download Full-text

Comparison of machine learning models based on time domain and frequency domain features for faults diagnosis in rotating machines

MATEC Web of Conferences ◽

10.1051/matecconf/201821117009 ◽

2018 ◽

Vol 211 ◽

pp. 17009

Author(s):

Natalia Espinoza Sepulveda ◽

Jyoti Sinha

Keyword(s):

Machine Learning ◽

Frequency Domain ◽

Time Domain ◽

Intelligent Systems ◽

Learning Models ◽

Machine Vibration ◽

Vibration Data ◽

Machine Learning Model ◽

The Time Domain ◽

Machine Learning Models

The development of technologies for the maintenance industry has taken an important role to meet the demanding challenges. One of the important challenges is to predict the defects, if any, in machines as early as possible to manage the machines downtime. The vibration-based condition monitoring (VCM) is well-known for this purpose but requires the human experience and expertise. The machine learning models using the intelligent systems and pattern recognition seem to be the future avenue for machine fault detection without the human expertise. Several such studies are published in the literature. This paper is also on the machine learning model for the different machine faults classification and detection. Here the time domain and frequency domain features derived from the measured machine vibration data are used separated in the development of the machine learning models using the artificial neutral network method. The effectiveness of both the time and frequency domain features based models are compared when they are applied to an experimental rig. The paper presents the proposed machine learning models and their performance in terms of the observations and results.

Download Full-text

Automated Retraining of Machine Learning Models

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3322.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 445-452

Keyword(s):

Machine Learning ◽

Input Data ◽

Research Work ◽

Learning Models ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Crucial Component ◽

Conventional Machine ◽

Over Time ◽

Machine Learning Models

Data is the most crucial component of a successful ML system. Once a machine learning model is developed, it gets obsolete over time due to presence of new input data being generated every second. In order to keep our predictions accurate we need to find a way to keep our models up to date. Our research work involves finding a mechanism which can retrain the model with new data automatically. This research also involves exploring the possibilities of automating machine learning processes. We started this project by training and testing our model using conventional machine learning methods. The outcome was then compared with the outcome of those experiments conducted using the AutoML methods like TPOT. This helped us in finding an efficient technique to retrain our models. These techniques can be used in areas where people do not deal with the actual working of a ML model but only require the outputs of ML processes

Download Full-text

The generalized Vasicek credit risk model: A Machine Learning approach

Finance Research Letters ◽

10.1016/j.frl.2021.102669 ◽

2022 ◽

pp. 102669

Author(s):

Rubén García-Céspedes ◽

Manuel Moreno

Keyword(s):

Machine Learning ◽

Credit Risk ◽

Risk Model ◽

Learning Approach ◽

Machine Learning Approach ◽

Credit Risk Model

Download Full-text

TOPICAL ISSUES OF APPLICATION OF MACHINE LEARNING METHODS IN ECONOMY

Инновационные аспекты развития науки и техники. Сборник статей VIII Международной научно-практической конференции: сборник статей, [электронное издание сетевого распространения] / Под ред. Н.В. Емельянова. – М.: “КДУ”, “Добросвет”, 2021. – 149 с. ◽

10.31453/kdu.ru.978-5-7913-1176-4-2021-28-33 ◽

2021 ◽

Author(s):

Natalia Pavlovna Persteneva ◽

◽

Darya Dmitrievn Skryleva ◽

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Learning Model ◽

Learning Models ◽

Learning Methods ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Popular Classes ◽

Machine Learning Models

The article discusses machine learning methods. Using the example of two popular classes: supervised learning and unsupervised learning. Variants of the main types of machine learning models for each method are presented. A generalized algorithm for building any machine learning model is formed.

Download Full-text

Improving Logging Prediction on Imbalanced Datasets

International Journal of Open Source Software and Processes ◽

10.4018/ijossp.2016040103 ◽

2016 ◽

Vol 7 (2) ◽

pp. 43-71 ◽

Cited By ~ 3

Author(s):

Sangeeta Lal ◽

Neetu Sardana ◽

Ashish Sureka

Keyword(s):

Machine Learning ◽

Open Source ◽

Class Imbalance ◽

Learning Model ◽

Learning Models ◽

Class Imbalance Problem ◽

Imbalanced Datasets ◽

Imbalance Problem ◽

Machine Learning Model ◽

Machine Learning Models

Logging is an important yet tough decision for OSS developers. Machine-learning models are useful in improving several steps of OSS development, including logging. Several recent studies propose machine-learning models to predict logged code construct. The prediction performances of these models are limited due to the class-imbalance problem since the number of logged code constructs is small as compared to non-logged code constructs. No previous study analyzes the class-imbalance problem for logged code construct prediction. The authors first analyze the performances of J48, RF, and SVM classifiers for catch-blocks and if-blocks logged code constructs prediction on imbalanced datasets. Second, the authors propose LogIm, an ensemble and threshold-based machine-learning model. Third, the authors evaluate the performance of LogIm on three open-source projects. On average, LogIm model improves the performance of baseline classifiers, J48, RF, and SVM, by 7.38%, 9.24%, and 4.6% for catch-blocks, and 12.11%, 14.95%, and 19.13% for if-blocks logging prediction.

Download Full-text

A Comparative Assessment of Credit Risk Model Based on Machine Learning ——a case study of bank loan data

Procedia Computer Science ◽

10.1016/j.procs.2020.06.069 ◽

2020 ◽

Vol 174 ◽

pp. 141-149

Author(s):

Yuelin Wang ◽

Yihan Zhang ◽

Yan Lu ◽

Xinran Yu

Keyword(s):

Machine Learning ◽

Credit Risk ◽

Risk Model ◽

Comparative Assessment ◽

Bank Loan ◽

Model Based ◽

Credit Risk Model

Download Full-text

Proposal of Credit Risk Model Using Machine Learning in Motorcycle Sales

Lecture Notes in Computer Science - Human Interface and the Management of Information. Information-Rich and Intelligent Environments ◽

10.1007/978-3-030-78361-7_26 ◽

2021 ◽

pp. 353-363

Author(s):

Ryota Fujinuma ◽

Yumi Asahi

Keyword(s):

Machine Learning ◽

Credit Risk ◽

Risk Model ◽

Credit Risk Model

Download Full-text

GPT-3 AI language tool calls for cautious optimism

Emerald Expert Briefings ◽

10.1108/oxan-db256373 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Public Funding ◽

Computational Effort ◽

Learning Models ◽

Content Type ◽

Business Applications ◽

Machine Learning Model ◽

Explainable Ai ◽

Advance Research ◽

Machine Learning Models

Significance It required arguably the single largest computational effort for a machine learning model to date, and is it capable of producing text at times indistinguishable from the work of a human author. This has generated considerable excitement about potentially transformative business applications -- and concerns about the system's weaknesses and possible misuse. Impacts Stereotypes and biases in machine learning models will become increasingly problematic as they are adopted by businesses and governments. The use of flawed AI tools that result in embarrassing failures risk cuts to public funding for AI research. Academia and industry face pressure to advance research into explainable AI, but progress is slow.

Download Full-text

Unscented Kalman Filter-Aided Long Short-Term Memory Approach for Wind Nowcasting

Aerospace ◽

10.3390/aerospace8090236 ◽

2021 ◽

Vol 8 (9) ◽

pp. 236

Author(s):

Junghyun Kim ◽

Kyuman Lee

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Unscented Kalman Filter ◽

Aviation Industry ◽

Learning Models ◽

Short Term ◽

Machine Learning Model ◽

Long Short Term Memory ◽

Lstm Network ◽

Machine Learning Models

Obtaining reliable wind information is critical for efficiently managing air traffic and airport operations. Wind forecasting has been considered one of the most challenging tasks in the aviation industry. Recently, with the advent of artificial intelligence, many machine learning techniques have been widely used to address a variety of complex phenomena in wind predictions. In this paper, we propose a hybrid framework that combines a machine learning model with Kalman filtering for a wind nowcasting problem in the aviation industry. More specifically, this study has three objectives as follows: (1) compare the performance of the machine learning models (i.e., Gaussian process, multi-layer perceptron, and long short-term memory (LSTM) network) to identify the most appropriate model for wind predictions, (2) combine the machine learning model selected in step (1) with an unscented Kalman filter (UKF) to improve the fidelity of the model, and (3) perform Monte Carlo simulations to quantify uncertainties arising from the modeling process. Results show that short-term time-series wind datasets are best predicted by the LSTM network compared to the other machine learning models and the UKF-aided LSTM (UKF-LSTM) approach outperforms the LSTM network only, especially when long-term wind forecasting needs to be considered.

Download Full-text