Two novel fuzzy clustering methods for solving data clustering problems

Data mining is the general methodology for retrieving useful information from big data. Clustering analysis is a mathematical method of classification for unsupervised machine learning. It can be adopted for data classification in Data mining. This paper combines the clustering process by fuzzy way and then deduces a special clustering algorithm with fast fuzzy c-means (FFCM) method. In summary, the paper illustrates the adoption of a series of fuzzy clustering methods in Data Mining. These methods have improved the computational efficiency with learning as the convergence speed is fast. The methodology of this paper presents significantly meaningful for information retrieval of big data.

Download Full-text

Fuzzy Clustering Methods with Rényi Relative Entropy and Cluster Size

Mathematics ◽

10.3390/math9121423 ◽

2021 ◽

Vol 9 (12) ◽

pp. 1423

Author(s):

Javier Bonilla ◽

Daniel Vélez ◽

Javier Montero ◽

J. Tinguaro Rodríguez

Keyword(s):

Relative Entropy ◽

Fuzzy Clustering ◽

Cluster Size ◽

Computational Study ◽

Gaussian Kernel ◽

Clustering Methods ◽

Rényi Divergence ◽

Divergence Measures ◽

Fuzzy Clustering Methods ◽

Rényi Relative Entropy

In the last two decades, information entropy measures have been relevantly applied in fuzzy clustering problems in order to regularize solutions by avoiding the formation of partitions with excessively overlapping clusters. Following this idea, relative entropy or divergence measures have been similarly applied, particularly to enable that kind of entropy-based regularization to also take into account, as well as interact with, cluster size variables. Particularly, since Rényi divergence generalizes several other divergence measures, its application in fuzzy clustering seems promising for devising more general and potentially more effective methods. However, previous works making use of either Rényi entropy or divergence in fuzzy clustering, respectively, have not considered cluster sizes (thus applying regularization in terms of entropy, not divergence) or employed divergence without a regularization purpose. Then, the main contribution of this work is the introduction of a new regularization term based on Rényi relative entropy between membership degrees and observation ratios per cluster to penalize overlapping solutions in fuzzy clustering analysis. Specifically, such Rényi divergence-based term is added to the variance-based Fuzzy C-means objective function when allowing cluster sizes. This then leads to the development of two new fuzzy clustering methods exhibiting Rényi divergence-based regularization, the second one extending the first by considering a Gaussian kernel metric instead of the Euclidean distance. Iterative expressions for these methods are derived through the explicit application of Lagrange multipliers. An interesting feature of these expressions is that the proposed methods seem to take advantage of a greater amount of information in the updating steps for membership degrees and observations ratios per cluster. Finally, an extensive computational study is presented showing the feasibility and comparatively good performance of the proposed methods.

Download Full-text

Fuzzy geodemographics: a contribution from fuzzy clustering methods

Innovations In GIS 5 ◽

10.1201/b16831-20 ◽

1998 ◽

pp. 141-149 ◽

Cited By ~ 1

Keyword(s):

Fuzzy Clustering ◽

Clustering Methods ◽

Fuzzy Clustering Methods

Download Full-text

Some context fuzzy clustering methods for classification problems

Proceedings of the 2010 Symposium on Information and Communication Technology - SoICT '10 ◽

10.1145/1852611.1852619 ◽

2010 ◽

Cited By ~ 10

Author(s):

Bui Cong Cuong ◽

Le Hoang Son ◽

Hoang Thi Minh Chau

Keyword(s):

Fuzzy Clustering ◽

Clustering Methods ◽

Classification Problems ◽

Fuzzy Clustering Methods

Download Full-text

Data Clustering Algorithms Using Rough Sets

Handbook of Research on Computational Intelligence for Engineering, Science, and Business ◽

10.4018/978-1-4666-2518-1.ch012 ◽

2013 ◽

pp. 297-327 ◽

Cited By ~ 6

Author(s):

B.K. Tripathy ◽

Adhir Ghosh

Keyword(s):

Comparative Study ◽

Rough Set ◽

Fuzzy Clustering ◽

Fuzzy Set ◽

Rough Sets ◽

Data Clustering ◽

Clustering Algorithms ◽

Clustering Methods ◽

Future Studies ◽

Multiple Clusters

Developing Data Clustering algorithms have been pursued by researchers since the introduction of k-means algorithm (Macqueen 1967; Lloyd 1982). These algorithms were subsequently modified to handle categorical data. In order to handle the situations where objects can have memberships in multiple clusters, fuzzy clustering and rough clustering methods were introduced (Lingras et al 2003, 2004a). There are many extensions of these initial algorithms (Lingras et al 2004b; Lingras 2007; Mitra 2004; Peters 2006, 2007). The MMR algorithm (Parmar et al 2007), its extensions (Tripathy et al 2009, 2011a, 2011b) and the MADE algorithm (Herawan et al 2010) use rough set techniques for clustering. In this chapter, the authors focus on rough set based clustering algorithms and provide a comparative study of all the fuzzy set based and rough set based clustering algorithms in terms of their efficiency. They also present problems for future studies in the direction of the topics covered.

Download Full-text

Lung cancer detection by using artificial neural network and fuzzy clustering methods

2011 IEEE GCC Conference and Exhibition (GCC) ◽

10.1109/ieeegcc.2011.5752535 ◽

2011 ◽

Cited By ~ 20

Author(s):

Fatma Taher ◽

Rachid Sammouda

Keyword(s):

Neural Network ◽

Lung Cancer ◽

Artificial Neural Network ◽

Cancer Detection ◽

Fuzzy Clustering ◽

Clustering Methods ◽

Fuzzy Clustering Methods ◽

Artificial Neural ◽

Lung Cancer Detection

Download Full-text

Using fuzzy clustering methods for delineating urban housing submarkets

Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems - GIS '07 ◽

10.1145/1341012.1341031 ◽

2007 ◽

Cited By ~ 7

Author(s):

Sungsoon Hwang ◽

Jean-Claude Thill

Keyword(s):

Fuzzy Clustering ◽

Urban Housing ◽

Clustering Methods ◽

Fuzzy Clustering Methods ◽

Housing Submarkets

Download Full-text

Fuzzy Clustering Methods for Categorical Multivariate Data Based on q-Divergence

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2018.p0524 ◽

2018 ◽

Vol 22 (4) ◽

pp. 524-536 ◽

Cited By ~ 2

Author(s):

Tadafumi Kondo ◽

◽

Yuchi Kanzawa

Keyword(s):

Conventional Method ◽

Fuzzy Clustering ◽

Optimization Problems ◽

Clustering Algorithms ◽

Multivariate Data ◽

Clustering Methods ◽

Kl Divergence ◽

Fuzzy Clustering Methods ◽

Conventional Methods ◽

Vectorial Data

This paper presents two fuzzy clustering algorithms for categorical multivariate data based on q-divergence. First, this study shows that a conventional method for vectorial data can be explained as regularizing another conventional method using q-divergence. Second, based on the known results that Kullback-Leibler (KL)-divergence is generalized into the q-divergence, and two conventional fuzzy clustering methods for categorical multivariate data adopt KL-divergence, two fuzzy clustering algorithms for categorical multivariate data that are based on q-divergence are derived from two optimization problems built by extending the KL-divergence in these conventional methods to the q-divergence. Through numerical experiments using real datasets, the proposed methods outperform the conventional methods in term of clustering accuracy.

Download Full-text