Heuristic Approaches in Clustering Problems
Clustering is an approach used in data mining to classify objects in parallel with similarities or separate according to dissimilarities. The aim of clustering is to decrease the amount of data by grouping similar data items together. There are different methods to cluster. One of the most popular techniques is K-means algorithm and widely used in literature to solve clustering problem is discussed. Although it is a simple and fast algorithm, there are two main drawbacks. One of them is that, in minimizing problems, solution may trap into local minimum point since objective function is not convex. Since the clustering is an NP-hard problem and to avoid converging to a local minimum point, several heuristic algorithms applied to clustering analysis. The heuristic approaches are a good way to reach solution in a short time. Five approaches are mentioned briefly in the chapter and given some directions for details. For an example, particle swarm optimization approach was used for clustering problem. In example, iris dataset including 3 clusters and 150 data was used.