2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Xin Wang ◽  
Yue Yang ◽  
Mingsong Chen ◽  
Qin Wang ◽  
Qin Qin ◽  
...  

Aiming at low classification accuracy of imbalanced datasets, an oversampling algorithm—AGNES-SMOTE (Agglomerative Nesting-Synthetic Minority Oversampling Technique) based on hierarchical clustering and improved SMOTE—is proposed. Its key procedures include hierarchically cluster majority samples and minority samples, respectively; divide minority subclusters on the basis of the obtained majority subclusters; select “seed sample” based on the sampling weight and probability distribution of minority subcluster; and restrict the generation of new samples in a certain area by centroid method in the sampling process. The combination of AGNES-SMOTE and SVM (Support Vector Machine) is presented to deal with imbalanced datasets classification. Experiments on UCI datasets are conducted to compare the performance of different algorithms mentioned in the literature. Experimental results indicate AGNES-SMOTE excels in synthesizing new samples and improves SVM classification performance on imbalanced datasets.


2017 ◽  
Vol 34 (3) ◽  
pp. 427-443 ◽  
Author(s):  
Haydemar Núñez ◽  
Luis Gonzalez-Abril ◽  
Cecilio Angulo

2009 ◽  
Vol 29 (4) ◽  
pp. 1064-1067 ◽  
Author(s):  
Ming LIU ◽  
Ting-ing WANG ◽  
Xiao-an HUANG ◽  
Rui LIU

Sign in / Sign up

Export Citation Format

Share Document