clustering ensemble Latest Research Papers

Clustering ensemble via structured hypergraph learning

Information Fusion ◽

10.1016/j.inffus.2021.09.003 ◽

2022 ◽

Vol 78 ◽

pp. 171-179

Author(s):

Peng Zhou ◽

Xia Wang ◽

Liang Du ◽

Xuejun Li

Keyword(s):

Clustering Ensemble ◽

Hypergraph Learning

Random Sample Partition-Based Clustering Ensemble Algorithm for Big Data

10.1109/bigdata52589.2021.9671297 ◽

2021 ◽

Author(s):

Xueqin Du ◽

Yulin He ◽

Joshua Zhexue Huang

Keyword(s):

Big Data ◽

Random Sample ◽

Clustering Ensemble ◽

Ensemble Algorithm

Correction to: Clustering Ensemble Based on Sample’s Certainty

Cognitive Computation ◽

10.1007/s12559-021-09957-z ◽

2021 ◽

Author(s):

Xia Ji ◽

Shuaishuai Liu ◽

Peng Zhao ◽

Xuejun Li ◽

Qiong Liu

Keyword(s):

Clustering Ensemble

Improving Clustering Methods By Exploiting Richness Of Text Data

10.26686/wgtn.17019287.v1 ◽

2021 ◽

Author(s):

◽

Abdul Wahid

Keyword(s):

Evolutionary Algorithm ◽

State Of The Art ◽

Ensemble Methods ◽

Text Clustering ◽

Clustering Methods ◽

Clustering Method ◽

Clustering Ensemble ◽

Text Data ◽

Multi Objective ◽

User Queries

<p>Clustering is an unsupervised machine learning technique, which involves discovering different clusters (groups) of similar objects in unlabeled data and is generally considered to be a NP hard problem. Clustering methods are widely used in a verity of disciplines for analyzing different types of data, and a small improvement in clustering method can cause a ripple effect in advancing research of multiple fields. Clustering any type of data is challenging and there are many open research questions. The clustering problem is exacerbated in the case of text data because of the additional challenges such as issues in capturing semantics of a document, handling rich features of text data and dealing with the well known problem of the curse of dimensionality. In this thesis, we investigate the limitations of existing text clustering methods and address these limitations by providing five new text clustering methods--Query Sense Clustering (QSC), Dirichlet Weighted K-means (DWKM), Multi-View Multi-Objective Evolutionary Algorithm (MMOEA), Multi-objective Document Clustering (MDC) and Multi-Objective Multi-View Ensemble Clustering (MOMVEC). These five new clustering methods showed that the use of rich features in text clustering methods could outperform the existing state-of-the-art text clustering methods. The first new text clustering method QSC exploits user queries (one of the rich features in text data) to generate better quality clusters and cluster labels. The second text clustering method DWKM uses probability based weighting scheme to formulate a semantically weighted distance measure to improve the clustering results. The third text clustering method MMOEA is based on a multi-objective evolutionary algorithm. MMOEA exploits rich features to generate a diverse set of candidate clustering solutions, and forms a better clustering solution using a cluster-oriented approach. The fourth and the fifth text clustering method MDC and MOMVEC address the limitations of MMOEA. MDC and MOMVEC differ in terms of the implementation of their multi-objective evolutionary approaches. All five methods are compared with existing state-of-the-art methods. The results of the comparisons show that the newly developed text clustering methods out-perform existing methods by achieving up to 16\% improvement for some comparisons. In general, almost all newly developed clustering algorithms showed statistically significant improvements over other existing methods. The key ideas of the thesis highlight that exploiting user queries improves Search Result Clustering(SRC); utilizing rich features in weighting schemes and distance measures improves soft subspace clustering; utilizing multiple views and a multi-objective cluster oriented method improves clustering ensemble methods; and better evolutionary operators and objective functions improve multi-objective evolutionary clustering ensemble methods. The new text clustering methods introduced in this thesis can be widely applied in various domains that involve analysis of text data. The contributions of this thesis which include five new text clustering methods, will not only help researchers in the data mining field but also to help a wide range of researchers in other fields.</p>

Improving Clustering Methods By Exploiting Richness Of Text Data

10.26686/wgtn.17019287 ◽

2021 ◽

Author(s):

◽

Abdul Wahid

Keyword(s):

Evolutionary Algorithm ◽

State Of The Art ◽

Ensemble Methods ◽

Text Clustering ◽

Clustering Methods ◽

Clustering Method ◽

Clustering Ensemble ◽

Text Data ◽

Multi Objective ◽

User Queries

<p>Clustering is an unsupervised machine learning technique, which involves discovering different clusters (groups) of similar objects in unlabeled data and is generally considered to be a NP hard problem. Clustering methods are widely used in a verity of disciplines for analyzing different types of data, and a small improvement in clustering method can cause a ripple effect in advancing research of multiple fields. Clustering any type of data is challenging and there are many open research questions. The clustering problem is exacerbated in the case of text data because of the additional challenges such as issues in capturing semantics of a document, handling rich features of text data and dealing with the well known problem of the curse of dimensionality. In this thesis, we investigate the limitations of existing text clustering methods and address these limitations by providing five new text clustering methods--Query Sense Clustering (QSC), Dirichlet Weighted K-means (DWKM), Multi-View Multi-Objective Evolutionary Algorithm (MMOEA), Multi-objective Document Clustering (MDC) and Multi-Objective Multi-View Ensemble Clustering (MOMVEC). These five new clustering methods showed that the use of rich features in text clustering methods could outperform the existing state-of-the-art text clustering methods. The first new text clustering method QSC exploits user queries (one of the rich features in text data) to generate better quality clusters and cluster labels. The second text clustering method DWKM uses probability based weighting scheme to formulate a semantically weighted distance measure to improve the clustering results. The third text clustering method MMOEA is based on a multi-objective evolutionary algorithm. MMOEA exploits rich features to generate a diverse set of candidate clustering solutions, and forms a better clustering solution using a cluster-oriented approach. The fourth and the fifth text clustering method MDC and MOMVEC address the limitations of MMOEA. MDC and MOMVEC differ in terms of the implementation of their multi-objective evolutionary approaches. All five methods are compared with existing state-of-the-art methods. The results of the comparisons show that the newly developed text clustering methods out-perform existing methods by achieving up to 16\% improvement for some comparisons. In general, almost all newly developed clustering algorithms showed statistically significant improvements over other existing methods. The key ideas of the thesis highlight that exploiting user queries improves Search Result Clustering(SRC); utilizing rich features in weighting schemes and distance measures improves soft subspace clustering; utilizing multiple views and a multi-objective cluster oriented method improves clustering ensemble methods; and better evolutionary operators and objective functions improve multi-objective evolutionary clustering ensemble methods. The new text clustering methods introduced in this thesis can be widely applied in various domains that involve analysis of text data. The contributions of this thesis which include five new text clustering methods, will not only help researchers in the data mining field but also to help a wide range of researchers in other fields.</p>

Weighted Clustering Ensemble: A Review

Pattern Recognition ◽

10.1016/j.patcog.2021.108428 ◽

2021 ◽

pp. 108428

Author(s):

Mimi Zhang

Keyword(s):

Clustering Ensemble ◽

Weighted Clustering

A Bi-directional Fuzzy C-Means Clustering Ensemble Algorithm Considering Local Information

International Journal of Computational Intelligence Systems ◽

10.1007/s44196-021-00014-z ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Chunhua Ren ◽

Linfu Sun

Keyword(s):

Clustering Algorithms ◽

Real Data ◽

Local Information ◽

Data Sets ◽

Clustering Ensemble ◽

K Nearest Neighbors ◽

Fuzzy C Means ◽

Clustering Quality ◽

Fuzzy C Means Clustering ◽

Fcm Clustering

AbstractThe classic Fuzzy C-means (FCM) algorithm has limited clustering performance and is prone to misclassification of border points. This study offers a bi-directional FCM clustering ensemble approach that takes local information into account (LI_BIFCM) to overcome these challenges and increase clustering quality. First, various membership matrices are created after running FCM multiple times, based on the randomization of the initial cluster centers, and a vertical ensemble is performed using the maximum membership principle. Second, after each execution of FCM, multiple local membership matrices of the sample points are created using multiple K-nearest neighbors, and a horizontal ensemble is performed. Multiple horizontal ensembles can be created using multiple FCM clustering. Finally, the final clustering results are obtained by combining the vertical and horizontal clustering ensembles. Twelve data sets were chosen for testing from both synthetic and real data sources. The LI_BIFCM clustering performance outperformed four traditional clustering algorithms and three clustering ensemble algorithms in the experiments. Furthermore, the final clustering results has a weak correlation with the bi-directional cluster ensemble parameters, indicating that the suggested technique is robust.

A multi-level consensus function clustering ensemble

Soft Computing ◽

10.1007/s00500-021-06092-7 ◽

2021 ◽

Vol 25 (21) ◽

pp. 13147-13165

Author(s):

Kim-Hung Pho ◽

Hamidreza Akbarzadeh ◽

Hamid Parvin ◽

Samad Nejatian ◽

Hamid Alinejad-Rokny

Keyword(s):

Clustering Ensemble ◽

Consensus Function ◽

Multi Level

Hybrid genetic model for clustering ensemble

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.107457 ◽

2021 ◽

pp. 107457

Author(s):

Wenlu Yang ◽

Yinghui Zhang ◽

Hongjun Wang ◽

Ping Deng ◽

Tianrui Li

Keyword(s):

Genetic Model ◽

Clustering Ensemble

From clustering to clustering ensemble selection: A review

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2021.104388 ◽

2021 ◽

Vol 104 ◽

pp. 104388

Author(s):

Keyvan Golalipour ◽

Ebrahim Akbari ◽

Seyed Saeed Hamidi ◽

Malrey Lee ◽

Rasul Enayatifar

Keyword(s):

Clustering Ensemble ◽

Ensemble Selection

clustering ensemble
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Clustering ensemble via structured hypergraph learning

Random Sample Partition-Based Clustering Ensemble Algorithm for Big Data

Correction to: Clustering Ensemble Based on Sample’s Certainty

Improving Clustering Methods By Exploiting Richness Of Text Data

Improving Clustering Methods By Exploiting Richness Of Text Data

Weighted Clustering Ensemble: A Review

A Bi-directional Fuzzy C-Means Clustering Ensemble Algorithm Considering Local Information

A multi-level consensus function clustering ensemble

Hybrid genetic model for clustering ensemble

From clustering to clustering ensemble selection: A review

Export Citation Format

clustering ensembleRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Clustering ensemble via structured hypergraph learning

Random Sample Partition-Based Clustering Ensemble Algorithm for Big Data

Correction to: Clustering Ensemble Based on Sample’s Certainty

Improving Clustering Methods By Exploiting Richness Of Text Data

Improving Clustering Methods By Exploiting Richness Of Text Data

Weighted Clustering Ensemble: A Review

A Bi-directional Fuzzy C-Means Clustering Ensemble Algorithm Considering Local Information

A multi-level consensus function clustering ensemble

Hybrid genetic model for clustering ensemble

From clustering to clustering ensemble selection: A review

clustering ensemble
Recently Published Documents