A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

Author(s):  
Shihua Liu ◽  
Hao Zhang ◽  
Xianghua Liu

A Two-stage clustering framework and a clustering algorithm for mixed attribute data based on density peaks and Goodall distance are proposed. Firstly, the subset of numerical attributes of the dataset is clustered, and then the result is mapped into one-dimensional categorical attribute and added to the subset of categorical attribute data. Finally, the new dataset is clustered by the density peaks clustering algorithm to obtain the final result. Experiments on three commonly used UCI datasets show that this algorithm can effectively realize mixed attribute clustering and produce better clustering results than the traditional K-prototypes algorithm do. The clustering accuracy on the Acute, Heart and Credit datasets are 17%, 24%, and 21% higher on average than that of the K-prototypes, respectively.

2021 ◽  
Vol 554 ◽  
pp. 61-83
Author(s):  
Xiao Xu ◽  
Shifei Ding ◽  
Yanru Wang ◽  
Lijuan Wang ◽  
Weikuan Jia

Author(s):  
Ville Hautamäki ◽  
Antti Pöllänen ◽  
Tomi Kinnunen ◽  
Kong Aik Lee ◽  
Haizhou Li ◽  
...  

2020 ◽  
Vol 200 ◽  
pp. 106028 ◽  
Author(s):  
Xiao Xu ◽  
Shifei Ding ◽  
Lijuan Wang ◽  
Yanru Wang

Sign in / Sign up

Export Citation Format

Share Document