scholarly journals Fuzzy C-Means Clustering Algorithm with Multiple Fuzzification Coefficients

Algorithms ◽  
2020 ◽  
Vol 13 (7) ◽  
pp. 158
Author(s):  
Tran Dinh Khang ◽  
Nguyen Duc Vuong ◽  
Manh-Kien Tran ◽  
Michael Fowler

Clustering is an unsupervised machine learning technique with many practical applications that has gathered extensive research interest. Aside from deterministic or probabilistic techniques, fuzzy C-means clustering (FCM) is also a common clustering technique. Since the advent of the FCM method, many improvements have been made to increase clustering efficiency. These improvements focus on adjusting the membership representation of elements in the clusters, or on fuzzifying and defuzzifying techniques, as well as the distance function between elements. This study proposes a novel fuzzy clustering algorithm using multiple different fuzzification coefficients depending on the characteristics of each data sample. The proposed fuzzy clustering method has similar calculation steps to FCM with some modifications. The formulas are derived to ensure convergence. The main contribution of this approach is the utilization of multiple fuzzification coefficients as opposed to only one coefficient in the original FCM algorithm. The new algorithm is then evaluated with experiments on several common datasets and the results show that the proposed algorithm is more efficient compared to the original FCM as well as other clustering methods.

2013 ◽  
Vol 300-301 ◽  
pp. 735-739 ◽  
Author(s):  
Li Jen Kao ◽  
Yo Ping Huang

Fuzzy C-Means (FCM) clustering algorithm can be used to classify hand gesture images in human-robot interaction application. However, FCM algorithm does not work well on those images in which noises exist. The noises or outliers make all the cluster centers towards to the center of all points. In this paper, a new FCM algorithm is proposed to detect the outliers and then make the outliers have no influence on centers calculation. The experiment shows that the new FCM algorithm can get more accurate centers than the traditional FCM algorithm.


2018 ◽  
Vol 29 (1) ◽  
pp. 497-514
Author(s):  
A.K. Naveena ◽  
N.K. Narayanan

Abstract The main intention of this research is to develop a novel ranking measure for content-based image retrieval system. Owing to the achievement of data retrieval, most commercial search engines still utilize a text-based search approach for image search by utilizing encompassing textual information. As the text information is, in some cases, noisy and even inaccessible, the drawback of such a recovery strategy is to the extent that it cannot depict the contents of images precisely, subsequently hampering the execution of image search. In order to improve the performance of image search, we propose in this work a novel algorithm for improving image search through a multi-kernel fuzzy c-means (MKFCM) algorithm. In the initial step of our method, images are retrieved using four-level discrete wavelet transform-based features and the MKFCM clustering algorithm. Next, the retrieved images are analyzed using fuzzy c-means clustering methods, and the rank of the results is adjusted according to the distance of a cluster from a query. To improve the ranking performance, we combine the retrieved result and ranking result. At last, we obtain the ranked retrieved images. In addition, we analyze the effects of different clustering methods. The effectiveness of the proposed methodology is analyzed with the help of precision, recall, and F-measures.


Author(s):  
Yanwei Zhao ◽  
Huijun Tang ◽  
Nan Su ◽  
Wanliang Wang

Design for product adaptability is one of the techniques used to provide customers with products that exactly meet their requirements. Clustering methods have been used extensively in the study of product adaptability design. Of the clustering methods, the fuzzy clustering method is the most widely in the design field. The three main kinds of fuzzy clustering methods are the transitive closure method, the dynamic direct method and the maximum tree method. The dynamic direct clustering method has been found to produce design solutions with the lowest cost. In this paper, a new approach for obtaining adaptable product designs using the clustering method is proposed. The method consists of three steps. Firstly, the extension distance formula is used to determine the distance between two products in a product database. The product design space and the distances between individuals are used as grouping criteria in this step. Secondly, the minimal distance between products is used to obtain the clustering index. Thirdly, the threshold value is used to divide the products in the database into groups. Customer demands and the results obtained from the adaptable function (based on the extension distance formula) are used to evaluate the fitness of the groups and their corresponding products. The product with the largest adaptable function value to demand ratio is selected product. In order to the show the advantage of using the extension-clustering method, both the extension-clustering method and the dynamic direct method are presented and compared. The comparison indicates that the extension-clustering method leads to quicker evaluations of design alternatives and results that more closely match customers’ demands. An example of the adaptable design of circular saws tools is used to demonstrate that with the extension-clustering design method a high variety of intelligent configurations can be obtained with significant rapidity.


Author(s):  
Kei Kitajima ◽  
Yasunori Endo ◽  
Yukihiro Hamasuna ◽  
◽  
◽  
...  

Clustering is a method of data analysis without the use of supervised data. Even-sized clustering based on optimization (ECBO) is a clustering algorithm that focuses on cluster size with the constraints that cluster sizes must be the same. However, this constraints makes ECBO inconvenient to apply in cases where a certain margin of cluster size is allowed. It is believed that this issue can be overcome by applying a fuzzy clustering method. Fuzzy clustering can represent the membership of data to clusters more flexible. In this paper, we propose a new even-sized clustering algorithm based on fuzzy clustering and verify its effectiveness through numerical examples.


Algorithms ◽  
2021 ◽  
Vol 14 (9) ◽  
pp. 258
Author(s):  
Tran Dinh Khang ◽  
Manh-Kien Tran ◽  
Michael Fowler

Clustering is an unsupervised machine learning method with many practical applications that has gathered extensive research interest. It is a technique of dividing data elements into clusters such that elements in the same cluster are similar. Clustering belongs to the group of unsupervised machine learning techniques, meaning that there is no information about the labels of the elements. However, when knowledge of data points is known in advance, it will be beneficial to use a semi-supervised algorithm. Within many clustering techniques available, fuzzy C-means clustering (FCM) is a common one. To make the FCM algorithm a semi-supervised method, it was proposed in the literature to use an auxiliary matrix to adjust the membership grade of the elements to force them into certain clusters during the computation. In this study, instead of using the auxiliary matrix, we proposed to use multiple fuzzification coefficients to implement the semi-supervision component. After deriving the proposed semi-supervised fuzzy C-means clustering algorithm with multiple fuzzification coefficients (sSMC-FCM), we demonstrated the convergence of the algorithm and validated the efficiency of the method through a numerical example.


2014 ◽  
Vol 986-987 ◽  
pp. 1579-1582
Author(s):  
Bao Zhen Feng

In current large-scale electronic circuit devices, failure data calibration capacity is not strong and it is difficult to be precise classification and intelligent judgment. It lacks of the necessary mechanisms to eliminate the error message, bringing troubles to fault detection. In order to avoid the above defect, this paper presents a fault detection method for large-scale electronic circuit based on fuzzy clustering algorithm. Firstly, the use of means clustering method, the fault information is made initial classification. Then, using the second fuzzy clustering method make fault information filtering in different categories, in order to achieve the fault data confirmation. Experimental results show that the proposed algorithm can effectively improve the accuracy of fault detection of large-scale electronic circuit.


Author(s):  
ANNETTE KELLER ◽  
FRANK KLAWONN

We introduce an objective function-based fuzzy clustering technique that assigns one influence parameter to each single data variable for each cluster. Our method is not only suited to detect structures or groups of data that are not uniformly distributed over the structure's single domains, but gives also information about the influence of individual variables on the detected groups. In addition, our approach can be seen as a generalization of the well-known fuzzy c-means clustering algorithm.


Author(s):  
Ye. V. Bodyanskiy ◽  
A. Yu. Shafronenko ◽  
I. N. Klymova

Context. The problems of big data clustering today is a very relevant area of artificial intelligence. This task is often found in many applications related to data mining, deep learning, etc. To solve these problems, traditional approaches and methods require that the entire data sample be submitted in batch form. Objective. The aim of the work is to propose a method of fuzzy probabilistic data clustering using evolutionary optimization of cat swarm, that would be devoid of the drawbacks of traditional data clustering approaches. Method. The procedure of fuzzy probabilistic data clustering using evolutionary algorithms, for faster determination of sample extrema, cluster centroids and adaptive functions, allowing not to spend machine resources for storing intermediate calculations and do not require additional time to solve the problem of data clustering, regardless of the dimension and the method of presentation for processing. Results. The proposed data clustering algorithm based on evolutionary optimization is simple in numerical implementation, is devoid of the drawbacks inherent in traditional fuzzy clustering methods and can work with a large size of input information processed online in real time. Conclusions. The results of the experiment allow to recommend the developed method for solving the problems of automatic clustering and classification of big data, as quickly as possible to find the extrema of the sample, regardless of the method of submitting the data for processing. The proposed method of online probabilistic fuzzy data clustering based on evolutionary optimization of cat swarm is intended for use in hybrid computational intelligence systems, neuro-fuzzy systems, in training artificial neural networks, in clustering and classification problems.


Author(s):  
Masayuki Higashi ◽  
◽  
Tadafumi Kondo ◽  
Yuchi Kanzawa

This study presents a fuzzy clustering algorithm for classifying spherical data based on q-divergence. First, it is shown that a conventional method for vectorial data is equivalent to the regularization of another conventional method using q-divergence. Next, based on the knowledge that q-divergence is a generalization of Kullback-Leibler (KL)-divergence and that there is a conventional fuzzy clustering method for classifying spherical data based on KL-divergence, a fuzzy clustering algorithm for spherical data is derived based on q-divergence. This algorithm uses an optimization problem built by extending KL-divergence in the conventional method to q-divergence. Finally, some numerical experiments are conducted to verify the proposed methods.


Author(s):  
PAWEL BRZEMINSKI ◽  
WITOLD PEDRYCZ

In our study we presented an effective method for clustering of Web pages. From flat HTML files we extracted keywords, formed feature vectors as representation of Web pages and applied them to a clustering method. We took advantage of the Fuzzy C-Means clustering algorithm (FCM). We demonstrated an organized and schematic manner of data collection. Various categories of Web pages were retrieved from ODP (Open Directory Project) in order to create our datasets. The results of clustering proved that the method performs well for all datasets. Finally, we presented a comprehensive experimental study examining: the behavior of the algorithm for different input parameters, internal structure of datasets and classification experiments.


Sign in / Sign up

Export Citation Format

Share Document