A Novel Imbalanced Data Classification Approach Based on Logistic Regression and Fisher Discriminant

Mathematical Problems in Engineering ◽

10.1155/2015/945359 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Author(s):

Baofeng Shi ◽

Jing Wang ◽

Junyan Qi ◽

Yanqiu Cheng

Keyword(s):

Logistic Regression ◽

Imbalanced Data ◽

Data Classification ◽

Classification Approach ◽

Fisher Discriminant ◽

Proposed Model ◽

Imbalanced Data Classification ◽

Customer Classification ◽

Key Indicators ◽

We introduce an imbalanced data classification approach based on logistic regression significant discriminant and Fisher discriminant. First of all, a key indicators extraction model based on logistic regression significant discriminant and correlation analysis is derived to extract features for customer classification. Secondly, on the basis of the linear weighted utilizing Fisher discriminant, a customer scoring model is established. And then, a customer rating model where the customer number of all ratings follows normal distribution is constructed. The performance of the proposed model and the classical SVM classification method are evaluated in terms of their ability to correctly classify consumers as default customer or nondefault customer. Empirical results using the data of 2157 customers in financial engineering suggest that the proposed approach better performance than the SVM model in dealing with imbalanced data classification. Moreover, our approach contributes to locating the qualified customers for the banks and the bond investors.

Download Full-text

A novel imbalanced data classification approach for suicidal ideation detection on social media

Computing ◽

10.1007/s00607-021-00984-0 ◽

2021 ◽

Author(s):

Mohamed Ali Ben Hassine ◽

Safa Abdellatif ◽

Sadok Ben Yahia

Keyword(s):

Social Media ◽

Suicidal Ideation ◽

Imbalanced Data ◽

Data Classification ◽

Classification Approach ◽

Imbalanced Data Classification

Download Full-text

A Novel Model for Imbalanced Data Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6145 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6680-6687

Author(s):

Jian Yin ◽

Chunjing Gan ◽

Kaiqi Zhao ◽

Xuan Lin ◽

Zhe Quan ◽

...

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Classification Performance ◽

Classification Model ◽

Proposed Model ◽

Imbalanced Data Classification ◽

Public Datasets ◽

Distribution Cost ◽

Novel Model ◽

Recently, imbalanced data classification has received much attention due to its wide applications. In the literature, existing researches have attempted to improve the classification performance by considering various factors such as the imbalanced distribution, cost-sensitive learning, data space improvement, and ensemble learning. Nevertheless, most of the existing methods focus on only part of these main aspects/factors. In this work, we propose a novel imbalanced data classification model that considers all these main aspects. To evaluate the performance of our proposed model, we have conducted experiments based on 14 public datasets. The results show that our model outperforms the state-of-the-art methods in terms of recall, G-mean, F-measure and AUC.

Download Full-text

Confusion-Matrix-Based Kernel Logistic Regression for Imbalanced Data Classification

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2017.2682249 ◽

2017 ◽

Vol 29 (9) ◽

pp. 1806-1819 ◽

Author(s):

Miho Ohsaki ◽

Peng Wang ◽

Kenji Matsuda ◽

Shigeru Katagiri ◽

Hideyuki Watanabe ◽

...

Keyword(s):

Logistic Regression ◽

Confusion Matrix ◽

Imbalanced Data ◽

Data Classification ◽

Kernel Logistic Regression ◽

Imbalanced Data Classification

Download Full-text

Radial-Based Undersampling for imbalanced data classification

Pattern Recognition ◽

10.1016/j.patcog.2020.107262 ◽

2020 ◽

Vol 102 ◽

pp. 107262 ◽

Author(s):

Michał Koziarski

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Imbalanced Data Classification

Download Full-text

Research of Medical High-Dimensional Imbalanced Data Classification Ensemble Feature Selection Algorithm with Random Forest

2017 International Conference on Smart Grid and Electrical Automation (ICSGEA) ◽

10.1109/icsgea.2017.158 ◽

2017 ◽

Author(s):

Min Zhu ◽

Bo Su ◽

Gangmin Ning

Keyword(s):

Feature Selection ◽

Random Forest ◽

Imbalanced Data ◽

Data Classification ◽

High Dimensional ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Imbalanced Data Classification

Download Full-text

Data reduction and stacking for imbalanced data classification

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-179335 ◽

2019 ◽

Vol 37 (6) ◽

pp. 7239-7249

Author(s):

Ireneusz Czarnowski ◽

Piotr Jędrzejowicz

Keyword(s):

Data Reduction ◽

Imbalanced Data ◽

Data Classification ◽

Imbalanced Data Classification

Download Full-text

An Under-Sampling Method with Support Vectors in Multi-class Imbalanced Data Classification

2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA) ◽

10.1109/skima47702.2019.8982391 ◽

2019 ◽

Author(s):

Md. Yasir Arafat ◽

Sabera Hoque ◽

Shuxiang Xu ◽

Dewan Md. Farid

Keyword(s):

Sampling Method ◽

Imbalanced Data ◽

Data Classification ◽

Support Vectors ◽

Imbalanced Data Classification ◽

Download Full-text

Imbalanced data classification using complementary fuzzy support vector machine techniques and SMOTE

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc.2017.8122737 ◽

2017 ◽

Author(s):

Ratchakoon Pruengkarn ◽

Kok Wai Wong ◽

Chun Che Fung

Keyword(s):

Support Vector Machine ◽

Imbalanced Data ◽

Data Classification ◽

Support Vector ◽

Fuzzy Support Vector Machine ◽

Imbalanced Data Classification

Download Full-text

Imbalanced data classification algorithm based on boosting and cascade model

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/icsmc.2012.6378183 ◽

2012 ◽

Author(s):

Xiaolong Zhang ◽

Chao Cheng

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Classification Algorithm ◽

Cascade Model ◽

Imbalanced Data Classification

Download Full-text

UFFDFR: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classification

Information Sciences ◽

10.1016/j.ins.2021.07.053 ◽

2021 ◽

Author(s):

Ming Zheng ◽

Tong Li ◽

Xiaoyao Zheng ◽

Qingying Yu ◽

Chuanming Chen ◽

...

Keyword(s):

Representative Sample ◽

Sample Selection ◽

Imbalanced Data ◽

Data Classification ◽

Fuzzy C Means ◽

Imbalanced Data Classification ◽

Fuzzy C Means Clustering ◽

Download Full-text