scholarly journals A COST-SENSITIVE LOGISTIC REGRESSION CREDIT SCORING MODEL BASED ON MULTI-OBJECTIVE OPTIMIZATION APPROACH

2019 ◽  
Vol 26 (2) ◽  
pp. 405-429 ◽  
Author(s):  
Feng Shen ◽  
Run Wang ◽  
Yu Shen

Credit scoring is an important process for peer-to-peer (P2P) lending companies as it determines whether loan applicants are likely to default. The aim of most credit scoring models is to minimize the classification error rate, which implies that all classification errors bear the same cost; however, in reality, there is a significant cost-sensitive problem in credit scoring methods. Therefore, in this paper, a new cost-sensitive logistic regression credit scoring model based on a multi-objective optimization approach is proposed that has two objectives in the cost-sensitive logistic regression process. The cost-sensitive logistic regression parameters are solved using a multiple objective particle swarm optimization (MOPSO) algorithm. In the empirical analysis, the proposed model was applied to the credit scoring of a Chinese famous P2P company, from which it was found that compared with other common credit scoring models, the proposed model was able to effectively reduce type II error rates and total classification error costs, and improve the AUC, the F1 values (reconciliation average of Recall and Precision), and the G-means. The proposed model was compared with other multi-objective optimization algorithms to further demonstrate that MOPSO is the best approach for cost-sensitive logistic regression credit scoring models.

2021 ◽  
Vol 73 (7) ◽  
pp. 41-44
Author(s):  
Y.S. Zhieru

The final stage of constructing a logistic regression model is checking its validity and testing it on real data. The degree of validity of a logistic regression model is evidenced by its ability to correctly classify borrowers, the model's ability to distinguish "good" borrowers from "bad" borrowers.


Author(s):  
Wirot Yotsawat ◽  
Pakaket Wattuya ◽  
Anongnart Srivihok

<span>Several credit-scoring models have been developed using ensemble classifiers in order to improve the accuracy of assessment. However, among the ensemble models, little consideration has been focused on the hyper-parameters tuning of base learners, although these are crucial to constructing ensemble models. This study proposes an improved credit scoring model based on the extreme gradient boosting (XGB) classifier using Bayesian hyper-parameters optimization (XGB-BO). The model comprises two steps. Firstly, data pre-processing is utilized to handle missing values and scale the data. Secondly, Bayesian hyper-parameter optimization is applied to tune the hyper-parameters of the XGB classifier and used to train the model. The model is evaluated on four widely public datasets, i.e., the German, Australia, lending club, and Polish datasets. Several state-of-the-art classification algorithms are implemented for predictive comparison with the proposed method. The results of the proposed model showed promising results, with an improvement in accuracy of 4.10%, 3.03%, and 2.76% on the German, lending club, and Australian datasets, respectively. The proposed model outperformed commonly used techniques, e.g., decision tree, support vector machine, neural network, logistic regression, random forest, and bagging, according to the evaluation results. The experimental results confirmed that the XGB-BO model is suitable for assessing the creditworthiness of applicants.</span>


2021 ◽  
Vol 14 (1) ◽  
pp. 130
Author(s):  
Sunghyon Kyeong ◽  
Daehee Kim ◽  
Jinho Shin

The credit scoring model is one of the most important decision-making tools for the sustainability of banking systems. This study is the first to examine whether it can be improved by using system log data that are stoed extensively for system operation. We used the log data recorded by the mobile application system of KakaoBank, a leading internet bank used by more than 14 million people in Korea. After generating candidate variables from KakaoBank’s log data, we created a credit scoring model by utilizing variables with high information values and logistic regression, the most common method for developing credit scoring models in financial institutions. To prove our hypothesis on the improvement of credit scoring model performance, we performed an independent sample t-test using the simulation results of repeated model development and performance measurement based on randomly sampled data. Consequently, the discrimination power of the proposed model using logistic regression (neural network) compared to the credit bureau-based model significantly improved by 1.84 (2.22) percentage points based on the Kolmogorov–Smirnov statistics. The results of this study suggest that a bank can utilize the accumulated log data inside the bank to improve decision-making systems, including credit scoring, at a low cost.


2018 ◽  
Vol 1 (1) ◽  
pp. 43-56
Author(s):  
Rio Hendriadi ◽  
Anne Putri ◽  
Dona Amelia ◽  
Rany Syafrina

Objective – This research is conducted to design and to develop credit scoring model on conventional bank in order to determine individual loan, the research takes place in PT BPR Sungai Puar, Kabupaten Agam. This model tries to evaluate the credit risk of BPR Sungai Puar.Design/methodology – The data are considered as secondary sources as they are taken from BPR Sungai Puar database by classifying them into two analysis tools including discriminant analysis and logistic regression. Results – The resuts are presentes inform of model and credit scoring perfection on PT BPR Sungai Puar Kabupaten Agam.Keywords Credit Scoring Model, Conventional Banks, Individual Loan


2019 ◽  
Vol 16 (8) ◽  
pp. 3514-3518
Author(s):  
Kamya Eria ◽  
Preethi Subramanian

Credit scoring plays a vital role in assessing the creditworthiness of loan applicants thus speeding up the approval process. Credit score models however rely on the accuracy of classification models for their performance. This accuracy performance depends not only on the choice of data mining process; it is heavily influenced by the quality of data as well. Although no techniques can be favored over the other, it has been evidenced that logistic regression has been widely employed as an industrial technique for its comprehensive simplicity. This study proposes a SEMMA-based credit scoring model developed with an improved Logistic Regression (LR) model. Improvements are by exclusion of irrelevant features and adjusting the partition ratios. The model has been compared with the predominant models and proved to contain outstanding results with minimal credit decision errors.


2011 ◽  
Vol 17 (1) ◽  
pp. 22-41 ◽  
Author(s):  
Xundi Diao ◽  
Heng Li ◽  
Saixing Zeng ◽  
Vivian Wy Tam ◽  
Hongling Guo

Speeding up a project's duration will definitely increase the cost and decrease the quality. The previous literatures were mainly related to project planning and controlling which mainly focus on cost-time tradeoff. However, limited researches have been referred to project quality based on mathematical methodologies. This paper proposes a tradeoff problem on time-cost-quality performance. A computer-based Pareto multi-objective optimization approach is utilized for solving the tradeoff problems. The approach can help searching near the reality Pareto-optimal set while not receiving any information on the stakeholders’ preference for time, cost and quality. Based on the developed approach, decision-making can become easy according to the sorted non-dominated solutions and project preferences.


Sign in / Sign up

Export Citation Format

Share Document