scholarly journals Fuzzy Clustering in Assortative and Disassortative Networks

Author(s):  
Ryoichi Kojima ◽  
Roberto Legaspi ◽  
Toshiaki Murofushi ◽  
◽  

Despite the significance of assortativity as a property of networks that paves for the emergence of new structural types, surprisingly, there has been little research done on assortativity. Assortative networks are perhaps among the most prominent examples of complex networks believed to be governed by common phenomena, thereby producing structures far from random. Further, certain vertices possess high centrality and can be regarded as significant and influential vertices that can become cluster centers that connect with high membership to many of the surrounding vertices. We propose a fuzzy clustering method to meaningfully characterize assortative, as well as disassortative, networks by adapting the Bonacichi’s power centrality to seek the high degree centrality vertices to become cluster centers. Moreover, we leverage our novel modularity function to determine the optimal number of clusters, as well as the optimal membership among clusters. However, due to the difficulty of finding real-world assortative network datasets that come with ground truths, we evaluated our method using synthetic data but possibly bearing resemblance to real-world network datasets as they were generated by the Lancichinetti–Fortunato–Radicchi benchmark. Our results show our non-hierarchical method outperforms a known hierarchical fuzzy clustering method, and also performs better than a well-known membership-based modularity function. Our method proved to perform beyond satisfactory for both assortative and disassortative networks.

Author(s):  
Fariba Salehi ◽  
Mohammad Reza Keyvanpour ◽  
Arash Sharifi

Kursor ◽  
2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Annisa Eka Haryati ◽  
Sugiyarto Sugiyarto ◽  
Rizki Desi Arindra Putri

Multivariate statistics have related problems with large data dimensions. One method that can be used is principal component analysis (PCA). Principal component analysis (PCA) is a technique used to reduce data dimensions consisting of several dependent variables while maintaining variance in the data. PCA can be used to stabilize measurements in statistical analysis, one of which is cluster analysis. Fuzzy clustering is a method of grouping based on membership values ​​that includes fuzzy sets as a weighting basis for grouping. In this study, the fuzzy clustering method used is Fuzzy Subtractive Clustering (FSC) and Fuzzy C-Means (FCM) with a combination of the Minkowski Chebysev distance. The purpose of this study was to compare the cluster results obtained from the FSC and FCM using the DBI validity index. The results obtained indicate that the results of clustering using FCM are better than the FSC.


2021 ◽  
pp. 1-16
Author(s):  
Aikaterini Karanikola ◽  
Charalampos M. Liapis ◽  
Sotiris Kotsiantis

In short, clustering is the process of partitioning a given set of objects into groups containing highly related instances. This relation is determined by a specific distance metric with which the intra-cluster similarity is estimated. Finding an optimal number of such partitions is usually the key step in the entire process, yet a rather difficult one. Selecting an unsuitable number of clusters might lead to incorrect conclusions and, consequently, to wrong decisions: the term “optimal” is quite ambiguous. Furthermore, various inherent characteristics of the datasets, such as clusters that overlap or clusters containing subclusters, will most often increase the level of difficulty of the task. Thus, the methods used to detect similarities and the parameter selection of the partition algorithm have a major impact on the quality of the groups and the identification of their optimal number. Given that each dataset constitutes a rather distinct case, validity indices are indicators introduced to address the problem of selecting such an optimal number of clusters. In this work, an extensive set of well-known validity indices, based on the approach of the so-called relative criteria, are examined comparatively. A total of 26 cluster validation measures were investigated in two distinct case studies: one in real-world and one in artificially generated data. To ensure a certain degree of difficulty, both real-world and generated data were selected to exhibit variations and inhomogeneity. Each of the indices is being deployed under the schemes of 9 different clustering methods, which incorporate 5 different distance metrics. All results are presented in various explanatory forms.


2012 ◽  
Vol 11 (3) ◽  
pp. 396-398 ◽  
Author(s):  
Jian Wang ◽  
Na Zhao ◽  
Wei Du ◽  
Yang Zhao ◽  
Ye Qian ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document