A fuzzy clustering algorithm based on hybrid surrogate model

Data clustering based on regression relationship is able to improve the validity and reliability of the engineering data mining results. Surrogate models are widely used to evaluate the regression relationship in the process of data clustering, but there is no single surrogate model that always performs the best for all the regression relationships. To solve this issue, a fuzzy clustering algorithm based on hybrid surrogate model is proposed in this work. The proposed algorithm is based on the framework of fuzzy c-means algorithm, in which the differences between the clusters are evaluated by the regression relationship instead of Euclidean distance. Several surrogate models are simultaneously utilized to evaluate the regression relationship through a weighting scheme. The clustering objective function is designed based on the prediction errors of multiple surrogate models, and an alternating optimization method is proposed to minimize it to obtain the memberships of data and the weights of surrogate models. The synthetic datasets are used to test single surrogate model-based fuzzy clustering algorithms to choose the surrogate models used in the proposed algorithm. It is found that support vector regression-based and response surface-based fuzzy clustering algorithms show competitive clustering performance, so support vector regression and response surface are used to construct the hybrid surrogate model in the proposed algorithm. The experimental results of synthetic datasets and engineering datasets show that the proposed algorithm can provide more competitive clustering performance compared with single surrogate model-based fuzzy clustering algorithms for the datasets with regression relationships.

Download Full-text

AN EXTENDED FUZZY CLUSTERING ALGORITHM AND ITS APPLICATION

Journal of Circuits System and Computers ◽

10.1142/s0218126695000175 ◽

1995 ◽

Vol 05 (02) ◽

pp. 239-259

Author(s):

SU HWAN KIM ◽

SEON WOOK KIM ◽

TAE WON RHEE

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Main Memory ◽

Color Image Segmentation ◽

Occurrence Rate ◽

Secondary Memory ◽

Worst Case ◽

Memory Space ◽

Fuzzy Clustering Algorithm

For data analyses, it is very important to combine data with similar attribute values into a categorically homogeneous subset, called a cluster, and this technique is called clustering. Generally crisp clustering algorithms are weak in noise, because each datum should be assigned to exactly one cluster. In order to solve the problem, a fuzzy c-means, a fuzzy maximum likelihood estimation, and an optimal fuzzy clustering algorithms in the fuzzy set theory have been proposed. They, however, require a lot of processing time because of exhaustive iteration with an amount of data and their memberships. Especially large memory space results in the degradation of performance in real-time processing applications, because it takes too much time to swap between the main memory and the secondary memory. To overcome these limitations, an extended fuzzy clustering algorithm based on an unsupervised optimal fuzzy clustering algorithm is proposed in this paper. This algorithm assigns a weight factor to each distinct datum considering its occurrence rate. Also, the proposed extended fuzzy clustering algorithm considers the degree of importances of each attribute, which determines the characteristics of the data. The worst case is that the whole data has an uniformly normal distribution, which means the importance of all attributes are the same. The proposed extended fuzzy clustering algorithm has better performance than the unsupervised optimal fuzzy clustering algorithm in terms of memory space and execution time in most cases. For simulation the proposed algorithm is applied to color image segmentation. Also automatic target detection and multipeak detection are considered as applications. These schemes can be applied to any other fuzzy clustering algorithms.

Download Full-text

Study of Combined Fuzzy Clustering Algorithm Based on F-Statistics Hierarchy Clustering

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.29-32.802 ◽

2010 ◽

Vol 29-32 ◽

pp. 802-808

Author(s):

Min Min

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Cluster Center ◽

Evaluation Data ◽

Fuzzy Clustering Algorithm ◽

Initial Cluster ◽

The Common ◽

Common Problems ◽

F Statistics

On analyzing the common problems in fuzzy clustering algorithms, we put forward the combined fuzzy clustering one, which will automatically generate a reasonable clustering numbers and initial cluster center. This clustering algorithm has been tested by real evaluation data of teaching designs. The result proves that the combined fuzzy clustering based on F-statistic is more effective.

Download Full-text

Service Partition Method Based on Particle Swarm Fuzzy Clustering

Wireless Communications and Mobile Computing ◽

10.1155/2021/7225552 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Hong Xia ◽

Qingyi Dong ◽

Hui Gao ◽

Yanping Chen ◽

ZhongMin Wang

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Particle Swarm ◽

Cluster Center ◽

Fuzzy Clustering Algorithm ◽

Partition Method ◽

Service Data ◽

Optimal Cluster ◽

Better Than

It is difficult to accurately classify a service into specific service clusters for the multirelationships between services. To solve this problem, this paper proposes a service partition method based on particle swarm fuzzy clustering, which can effectively consider multirelationships between services by using a fuzzy clustering algorithm. Firstly, the algorithm for automatically determining the number of clusters is to determine the number of service clusters based on the density of the service core point. Secondly, the fuzzy c -means combined with particle swarm optimization algorithm to find the optimal cluster center of the service. Finally, the fuzzy clustering algorithm uses the improved Gram-cosine similarity to obtain the final results. Extensive experiments on real web service data show that our method is better than mainstream clustering algorithms in accuracy.

Download Full-text

Short-term Forecasting of PV Power Based on the Fuzzy Clustering Algorithm and Support Vector Machine in Smart Distribution Planning

2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) ◽

10.1109/iaeac.2018.8577853 ◽

2018 ◽

Author(s):

Li Shan ◽

Xin Peizhe ◽

Zou Guohui

Keyword(s):

Support Vector Machine ◽

Fuzzy Clustering ◽

Clustering Algorithm ◽

Support Vector ◽

Short Term ◽

Distribution Planning ◽

Fuzzy Clustering Algorithm ◽

Short Term Forecasting

Download Full-text

A Corporate Credit Rating Model Using Support Vector Domain Combined with Fuzzy Clustering Algorithm

Mathematical Problems in Engineering ◽

10.1155/2012/302624 ◽

2012 ◽

Vol 2012 ◽

pp. 1-20 ◽

Cited By ~ 7

Author(s):

Xuesong Guo ◽

Zhengwei Zhu ◽

Jia Shi

Keyword(s):

Artificial Intelligence ◽

Fuzzy Clustering ◽

Clustering Algorithm ◽

Credit Rating ◽

Computational Cost ◽

Support Vector ◽

Artificial Intelligence Techniques ◽

Corporate Credit ◽

Fuzzy Clustering Algorithm ◽

Corporate Credit Rating

Corporate credit-rating prediction using statistical and artificial intelligence techniques has received considerable attentions in the literature. Different from the thoughts of various techniques for adopting support vector machines as binary classifiers originally, a new method, based on support vector domain combined with fuzzy clustering algorithm for multiclassification, is proposed in the paper to accomplish corporate credit rating. By data preprocessing using fuzzy clustering algorithm, only the boundary data points are selected as training samples to accomplish support vector domain specification to reduce computational cost and also achieve better performance. To validate the proposed methodology, real-world cases are used for experiments, with results compared with conventional multiclassification support vector machine approaches and other artificial intelligence techniques. The results show that the proposed model improves the performance of corporate credit-rating with less computational consumption.

Download Full-text

Coarse-fine surrogate model driven multiobjective evolutionary fuzzy clustering algorithm with dual memberships for noisy image segmentation

Applied Soft Computing ◽

10.1016/j.asoc.2021.107778 ◽

2021 ◽

pp. 107778

Author(s):

Feng Zhao ◽

Feifan Liu ◽

Chaoqi Li ◽

Hanqiang Liu ◽

Rong Lan ◽

...

Keyword(s):

Image Segmentation ◽

Fuzzy Clustering ◽

Surrogate Model ◽

Clustering Algorithm ◽

Noisy Image ◽

Model Driven ◽

Fuzzy Clustering Algorithm

Download Full-text

A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

International Journal of Ambient Computing and Intelligence ◽

10.4018/ijaci.2016070103 ◽

2016 ◽

Vol 7 (2) ◽

pp. 47-74 ◽

Cited By ~ 17

Author(s):

Duggirala Raja Kishor ◽

N.B. Venkateswarlu

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Execution Time ◽

Data Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Performance Criteria ◽

The Novel ◽

The Em Algorithm ◽

Synthetic Datasets

Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm for the novel hybridization of EM and K-Means techniques for achieving better clustering performance (NovHbEMKM). This algorithm first performs K-Means and then using these results it performs EM and K-Means in the alternative iterations. Along with the NovHbEMKM, experiments are carried out with the algorithms for EM, EM using the results of K-Means and Cluster package of Purdue University. Experiments are carried out with datasets from UCI ML repository and synthetic datasets. Execution time, Clustering Fitness and Sum of Squared Errors (SSE) are computed as performance criteria. In all the experiments the proposed NovHbEMKM algorithm is taking less execution time by producing results with higher clustering fitness and lesser SSE than other algorithms including the Cluster package.

Download Full-text