A COST-SENSITIVE LOGISTIC REGRESSION CREDIT SCORING MODEL BASED ON MULTI-OBJECTIVE OPTIMIZATION APPROACH

Credit scoring is an important process for peer-to-peer (P2P) lending companies as it determines whether loan applicants are likely to default. The aim of most credit scoring models is to minimize the classification error rate, which implies that all classification errors bear the same cost; however, in reality, there is a significant cost-sensitive problem in credit scoring methods. Therefore, in this paper, a new cost-sensitive logistic regression credit scoring model based on a multi-objective optimization approach is proposed that has two objectives in the cost-sensitive logistic regression process. The cost-sensitive logistic regression parameters are solved using a multiple objective particle swarm optimization (MOPSO) algorithm. In the empirical analysis, the proposed model was applied to the credit scoring of a Chinese famous P2P company, from which it was found that compared with other common credit scoring models, the proposed model was able to effectively reduce type II error rates and total classification error costs, and improve the AUC, the F1 values (reconciliation average of Recall and Precision), and the G-means. The proposed model was compared with other multi-objective optimization algorithms to further demonstrate that MOPSO is the best approach for cost-sensitive logistic regression credit scoring models.

Download Full-text

A multi-objective optimization approach for exploring the cost and makespan trade-off in additive manufacturing

European Journal of Operational Research ◽

10.1016/j.ejor.2021.10.020 ◽

2021 ◽

Author(s):

F. Tevhide Altekin ◽

Yossi Bukchin

Keyword(s):

Additive Manufacturing ◽

Optimization Approach ◽

Multi Objective Optimization ◽

Trade Off ◽

Multi Objective ◽

The Cost

Download Full-text

Application of Improved SMOTE Algorithm in Logistic Regression Credit Scoring Model

Hans Journal of Data Mining ◽

10.12677/hjdm.2021.112006 ◽

2021 ◽

Vol 11 (02) ◽

pp. 50-58

Author(s):

芷慧许

Keyword(s):

Logistic Regression ◽

Credit Scoring ◽

Scoring Model ◽

Credit Scoring Model

Download Full-text

A Hybrid Credit Scoring Model Using Neural Networks and Logistic Regression

Advances in Intelligent Information Hiding and Multimedia Signal Processing - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-13-9714-1_27 ◽

2019 ◽

pp. 251-258 ◽

Cited By ~ 1

Author(s):

Lkhagvadorj Munkhdalai ◽

Jong Yun Lee ◽

Keun Ho Ryu

Keyword(s):

Neural Networks ◽

Logistic Regression ◽

Credit Scoring ◽

Scoring Model ◽

Credit Scoring Model

Download Full-text

Methodology for the validation of the credit scoring model of the retail portfolio

10.18411/lj-05-2021-265 ◽

2021 ◽

Vol 73 (7) ◽

pp. 41-44

Author(s):

Y.S. Zhieru

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Final Stage ◽

Logistic Regression Model ◽

Credit Scoring ◽

Real Data ◽

Scoring Model ◽

Credit Scoring Model

The final stage of constructing a logistic regression model is checking its validity and testing it on real data. The degree of validity of a logistic regression model is evidenced by its ability to correctly classify borrowers, the model's ability to distinguish "good" borrowers from "bad" borrowers.

Download Full-text

Technology credit scoring model with fuzzy logistic regression

Applied Soft Computing ◽

10.1016/j.asoc.2016.02.025 ◽

2016 ◽

Vol 43 ◽

pp. 150-158 ◽

Cited By ~ 41

Author(s):

So Young Sohn ◽

Dong Ha Kim ◽

Jin Hee Yoon

Keyword(s):

Logistic Regression ◽

Credit Scoring ◽

Scoring Model ◽

Fuzzy Logistic Regression ◽

Credit Scoring Model

Download Full-text

Improved credit scoring model using XGBoost with Bayesian hyper-parameter optimization

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i6.pp5477-5487 ◽

2021 ◽

Vol 11 (6) ◽

pp. 5477

Author(s):

Wirot Yotsawat ◽

Pakaket Wattuya ◽

Anongnart Srivihok

Keyword(s):

Parameter Optimization ◽

Missing Values ◽

Credit Scoring ◽

Gradient Boosting ◽

Support Vector ◽

Scoring Model ◽

Ensemble Models ◽

Proposed Model ◽

Extreme Gradient Boosting ◽

Credit Scoring Model

<span>Several credit-scoring models have been developed using ensemble classifiers in order to improve the accuracy of assessment. However, among the ensemble models, little consideration has been focused on the hyper-parameters tuning of base learners, although these are crucial to constructing ensemble models. This study proposes an improved credit scoring model based on the extreme gradient boosting (XGB) classifier using Bayesian hyper-parameters optimization (XGB-BO). The model comprises two steps. Firstly, data pre-processing is utilized to handle missing values and scale the data. Secondly, Bayesian hyper-parameter optimization is applied to tune the hyper-parameters of the XGB classifier and used to train the model. The model is evaluated on four widely public datasets, i.e., the German, Australia, lending club, and Polish datasets. Several state-of-the-art classification algorithms are implemented for predictive comparison with the proposed method. The results of the proposed model showed promising results, with an improvement in accuracy of 4.10%, 3.03%, and 2.76% on the German, lending club, and Australian datasets, respectively. The proposed model outperformed commonly used techniques, e.g., decision tree, support vector machine, neural network, logistic regression, random forest, and bagging, according to the evaluation results. The experimental results confirmed that the XGB-BO model is suitable for assessing the creditworthiness of applicants.</span>

Download Full-text

Can System Log Data Enhance the Performance of Credit Scoring?—Evidence from an Internet Bank in Korea

Sustainability ◽

10.3390/su14010130 ◽

2021 ◽

Vol 14 (1) ◽

pp. 130

Author(s):

Sunghyon Kyeong ◽

Daehee Kim ◽

Jinho Shin

Keyword(s):

Decision Making ◽

Logistic Regression ◽

Credit Scoring ◽

Model Performance ◽

Sampled Data ◽

Discrimination Power ◽

Log Data ◽

Scoring Model ◽

Internet Bank ◽

Credit Scoring Model

The credit scoring model is one of the most important decision-making tools for the sustainability of banking systems. This study is the first to examine whether it can be improved by using system log data that are stoed extensively for system operation. We used the log data recorded by the mobile application system of KakaoBank, a leading internet bank used by more than 14 million people in Korea. After generating candidate variables from KakaoBank’s log data, we created a credit scoring model by utilizing variables with high information values and logistic regression, the most common method for developing credit scoring models in financial institutions. To prove our hypothesis on the improvement of credit scoring model performance, we performed an independent sample t-test using the simulation results of repeated model development and performance measurement based on randomly sampled data. Consequently, the discrimination power of the proposed model using logistic regression (neural network) compared to the credit bureau-based model significantly improved by 1.84 (2.22) percentage points based on the Kolmogorov–Smirnov statistics. The results of this study suggest that a bank can utilize the accumulated log data inside the bank to improve decision-making systems, including credit scoring, at a low cost.

Download Full-text

Design and Development of Credit Scoring Model for Conventional Banks for Individual Borrowing Case Study on PT BPR Sungai Puar District Agam

Journal of Accounting Research, Organization and Economics ◽

10.24815/jaroe.v1i1.10749 ◽

2018 ◽

Vol 1 (1) ◽

pp. 43-56

Author(s):

Rio Hendriadi ◽

Anne Putri ◽

Dona Amelia ◽

Rany Syafrina

Keyword(s):

Logistic Regression ◽

Discriminant Analysis ◽

Credit Risk ◽

Credit Scoring ◽

Scoring Model ◽

Secondary Sources ◽

Conventional Banks ◽

Credit Scoring Model ◽

Conventional Bank

Objective – This research is conducted to design and to develop credit scoring model on conventional bank in order to determine individual loan, the research takes place in PT BPR Sungai Puar, Kabupaten Agam. This model tries to evaluate the credit risk of BPR Sungai Puar.Design/methodology – The data are considered as secondary sources as they are taken from BPR Sungai Puar database by classifying them into two analysis tools including discriminant analysis and logistic regression. Results – The resuts are presentes inform of model and credit scoring perfection on PT BPR Sungai Puar Kabupaten Agam.Keywords Credit Scoring Model, Conventional Banks, Individual Loan

Download Full-text

Decision Support Credit Scoring Model to Improve Loan Default Prediction in Financial Institutions

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8316 ◽

2019 ◽

Vol 16 (8) ◽

pp. 3514-3518

Author(s):

Kamya Eria ◽

Preethi Subramanian

Keyword(s):

Logistic Regression ◽

Credit Scoring ◽

Vital Role ◽

Quality Of Data ◽

Loan Default ◽

Scoring Model ◽

Credit Score ◽

Default Prediction ◽

Credit Scoring Model

Credit scoring plays a vital role in assessing the creditworthiness of loan applicants thus speeding up the approval process. Credit score models however rely on the accuracy of classification models for their performance. This accuracy performance depends not only on the choice of data mining process; it is heavily influenced by the quality of data as well. Although no techniques can be favored over the other, it has been evidenced that logistic regression has been widely employed as an industrial technique for its comprehensive simplicity. This study proposes a SEMMA-based credit scoring model developed with an improved Logistic Regression (LR) model. Improvements are by exclusion of irrelevant features and adjusting the partition ratios. The model has been compared with the predominant models and proved to contain outstanding results with minimal credit decision errors.

Download Full-text

A PARETO MULTI-OBJECTIVE OPTIMIZATION APPROACH FOR SOLVING TIME-COST-QUALITY TRADEOFF PROBLEMS

Technological and Economic Development of Economy ◽

10.3846/13928619.2011.553988 ◽

2011 ◽

Vol 17 (1) ◽

pp. 22-41 ◽

Cited By ~ 8

Author(s):

Xundi Diao ◽

Heng Li ◽

Saixing Zeng ◽

Vivian Wy Tam ◽

Hongling Guo

Keyword(s):

Project Planning ◽

Optimization Approach ◽

Time Cost ◽

Multi Objective Optimization ◽

Quality Performance ◽

Multi Objective ◽

Pareto Optimal Set ◽

Computer Based ◽

The Cost ◽

Optimal Set

Speeding up a project's duration will definitely increase the cost and decrease the quality. The previous literatures were mainly related to project planning and controlling which mainly focus on cost-time tradeoff. However, limited researches have been referred to project quality based on mathematical methodologies. This paper proposes a tradeoff problem on time-cost-quality performance. A computer-based Pareto multi-objective optimization approach is utilized for solving the tradeoff problems. The approach can help searching near the reality Pareto-optimal set while not receiving any information on the stakeholders’ preference for time, cost and quality. Based on the developed approach, decision-making can become easy according to the sorted non-dominated solutions and project preferences.

Download Full-text