Discovering Similarity Across Heterogeneous Features
2020 ◽
Vol 16
(4)
◽
pp. 63-83
Keyword(s):
The analysis of both continuous and categorical attributes generating a heterogeneous mix of attributes poses challenges in data clustering. Traditional clustering techniques like k-means clustering work well when applied to small homogeneous datasets. However, as the data size becomes large, it becomes increasingly difficult to find meaningful and well-formed clusters. In this paper, the authors propose an approach that utilizes a combined similarity function, which looks at similarity across numeric and categorical features and employs this function in a clustering algorithm to identify similarity between data objects. The findings indicate that the proposed approach handles heterogeneous data better by forming well-separated clusters.
2019 ◽
Vol 8
(4)
◽
pp. 1657-1664
2019 ◽
Vol 27
(04)
◽
pp. 669-688
◽
2017 ◽
2018 ◽
Vol 27
(3)
◽
pp. 317-329
◽
2018 ◽
Vol 6
(2)
◽
pp. 176-183
2014 ◽
Vol 543-547
◽
pp. 1934-1938