A Novel Imbalanced Data Classification Method Based on Weakly Supervised Learning for Fault Diagnosis

Author(s):  
Hui Liu ◽  
Zhenyu Liu ◽  
Weiqiang Jia ◽  
Donghao Zhang ◽  
Jianrong Tan
2013 ◽  
Vol 443 ◽  
pp. 741-745
Author(s):  
Hu Li ◽  
Peng Zou ◽  
Wei Hong Han ◽  
Rong Ze Xia

Many real world data is imbalanced, i.e. one category contains significantly more samples than other categories. Traditional classification methods take different categories equally and are often ineffective. Based on the comprehensive analysis of existing researches, we propose a new imbalanced data classification method based on clustering. The method clusters both majority class and minority class at first. Then, clustered minority class will be over-sampled by SMOTE while clustered majority class be under-sampled randomly. Through clustering, the proposed method can avoid the loss of useful information while resampling. Experiments on several UCI datasets show that the proposed method can effectively improve the classification results on imbalanced data.


Sign in / Sign up

Export Citation Format

Share Document