Comparing Performance of Machine Learning Algorithms for Default Risk Prediction in Peer to Peer Lending

TEM Journal ◽

10.18421/tem101-16 ◽

2021 ◽

pp. 133-143

Author(s):

Yanka Aleksandrova

Keyword(s):

Machine Learning ◽

Default Risk ◽

Credit Scoring ◽

Learning Algorithms ◽

Peer To Peer ◽

Machine Learning Algorithms ◽

Ensemble Classifiers ◽

Peer Lending ◽

Official Site ◽

Heterogeneous Ensemble

The purpose of this research is to evaluate several popular machine learning algorithms for credit scoring for peer to peer lending. The dataset to fit the models is extracted from the official site of Lending Club. Several models have been implemented, including single classifiers (logistic regression, decision tree, multilayer perceptron), homogeneous ensembles (XGBoost, GBM, Random Forest) and heterogeneous ensemble classifiers like Stacked Ensembles. Results show that ensemble classifiers outperform single ones with Stacked Ensemble and XGBoost being the leaders.

Download Full-text

Improved prediction of clay soil expansion using machine learning algorithms and meta-heuristic dichotomous ensemble classifiers

Geoscience Frontiers ◽

10.1016/j.gsf.2021.101296 ◽

2021 ◽

pp. 101296

Author(s):

E.U. Eyo ◽

S.J. Abbey ◽

T.T. Lawrence ◽

F.K. Tetteh

Keyword(s):

Machine Learning ◽

Clay Soil ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Ensemble Classifiers

Download Full-text

A Machine Learning Approach for Micro-Credit Scoring

Risks ◽

10.3390/risks9030050 ◽

2021 ◽

Vol 9 (3) ◽

pp. 50

Author(s):

Apostolos Ampountolas ◽

Titus Nyarko Nde ◽

Paresh Date ◽

Corina Constantinescu

Keyword(s):

Machine Learning ◽

Random Forest ◽

Interest Rates ◽

Credit Scoring ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Learning Approach ◽

Machine Learning Approach ◽

Credit History ◽

Lending Institutions

In micro-lending markets, lack of recorded credit history is a significant impediment to assessing individual borrowers’ creditworthiness and therefore deciding fair interest rates. This research compares various machine learning algorithms on real micro-lending data to test their efficacy at classifying borrowers into various credit categories. We demonstrate that off-the-shelf multi-class classifiers such as random forest algorithms can perform this task very well, using readily available data about customers (such as age, occupation, and location). This presents inexpensive and reliable means to micro-lending institutions around the developing world with which to assess creditworthiness in the absence of credit history or central credit databases.

Download Full-text

Societal bias reinforcement through machine learning: a credit scoring perspective

AI and Ethics ◽

10.1007/s43681-020-00026-z ◽

2020 ◽

Author(s):

Bertrand K. Hassani

Keyword(s):

Machine Learning ◽

Credit Scoring ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Social Bias ◽

Social Biases ◽

The Way

AbstractDoes machine learning and AI ensure that social biases thrive? This paper aims to analyze this issue. Indeed, as algorithms are informed by data, if these are corrupted, from a social bias perspective, good machine learning algorithms would learn from the data provided and reverberate the patterns learnt on the predictions related to either the classification or the regression intended. In other words, the way society behaves whether positively or negatively would necessarily be reflected by the models. In this paper, we analyze how social biases are transmitted from the data into banks loan approvals by predicting either the gender or the ethnicity of the customers using the exact same information provided by customers’ through their applications

Download Full-text

Credit Scoring Using Machine Learning by Combing Social Network Information: Evidence from Peer-to-Peer Lending

Information ◽

10.3390/info10120397 ◽

2019 ◽

Vol 10 (12) ◽

pp. 397 ◽

Cited By ~ 1

Author(s):

Beibei Niu ◽

Jinzheng Ren ◽

Xiaotao Li

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Social Network ◽

Financial Institutions ◽

Learning Algorithm ◽

Credit Scoring ◽

Peer To Peer ◽

Machine Learning Algorithms ◽

Loan Default ◽

Network Information

Financial institutions use credit scoring to evaluate potential loan default risks. However, insufficient credit information limits the peer-to-peer (P2P) lending platform’s capacity to build effective credit scoring. In recent years, many types of data are used for credit scoring to compensate for the lack of credit history data. Whether social network information can be used to strengthen financial institutions’ predictive power has received much attention in the industry and academia. The aim of this study is to test the reliability of social network information in predicting loan default. We extract borrowers’ social network information from mobile phones and then use logistic regression to test the relationship between social network information and loan default. Three machine learning algorithms—random forest, AdaBoost, and LightGBM—were constructed to demonstrate the predictive performance of social network information. The logistic regression results show that there is a statistically significant correlation between social network information and loan default. The machine learning algorithm results show that social network information can improve loan default prediction performance significantly. The experiment results suggest that social network information is valuable for credit scoring.

Download Full-text

Using Machine Learning Algorithms to create a Credit Scoring Model for mobile money users

2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI) ◽

10.1109/saci51354.2021.9465561 ◽

2021 ◽

Author(s):

Monica Charles Mhina ◽

Fabrice Labeau

Keyword(s):

Machine Learning ◽

Credit Scoring ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Mobile Money ◽

Scoring Model ◽

Credit Scoring Model

Download Full-text

INTELLIGENT COOPERATIVE WEB CACHING POLICIES FOR MEDIA OBJECTS BASED ON J48 DECISION TREE AND NAÏVE BAYES SUPERVISED MACHINE LEARNING ALGORITHMS IN STRUCTURED PEER-TO-PEER SYSTEMS

Journal of Information and Communication Technology ◽

10.32890/jict2016.15.2.5 ◽

2016 ◽

Vol 15 (2) ◽

Author(s):

Hamidah Ibrahim ◽

Waheed Yasin ◽

Nur Izura Udzir ◽

Nor Asilah Wati Abdul Hamid

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Learning Algorithms ◽

Peer To Peer ◽

Web Caching ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Peer To Peer Systems ◽

Media Objects ◽

J48 Decision Tree

Download Full-text

Supplemental Material for One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis

Psychological Methods ◽

10.1037/met0000262.supp ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Number Of Factors

Download Full-text

Forecasting US movies box office performances in Turkey using machine learning algorithms

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189120 ◽

2020 ◽

Vol 39 (5) ◽

pp. 6579-6590

Author(s):

Sandy Çağlıyor ◽

Başar Öztayşi ◽

Selime Sezgin

Keyword(s):

Machine Learning ◽

Global Economy ◽

Learning Algorithms ◽

Forecast Model ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

High Stakes ◽

Box Office ◽

Industry Forecast ◽

The Impact

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text

The Unlearnable Checkerboard Pattern

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.1.2.holloway.1 ◽

2019 ◽

Vol 1 (2) ◽

pp. 78-80

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Checkerboard Pattern ◽

Simple Task

Detecting some patterns is a simple task for humans, but nearly impossible for current machine learning algorithms. Here, the "checkerboard" pattern is examined, where human prediction nears 100% and machine prediction drops significantly below 50%.

Download Full-text