A novel text clustering algorithm treated attributes differently

Web text exists non-certain and non-structure contents ,and it is difficult to cluster the text by normal classification methods. We propose a web text clustering algorithm based on fuzzy set to increase the computing accuracy with the web text. After abstracting the key words of the text, we can look it as attributes and design the fuzzy algorithm to decide the membership of the words. The algorithm can improve the algorithm complexity of time and space, increase the robustness comparing to the normal algorithm. To test the accuracy and efficiency of the algorithm, we take the comparative experiment between pattern clustering and our algorithm. The experiment shows that our method has a better result.

Download Full-text

Short Text Clustering Algorithm with Feature Keyword Expansion

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.532-533.1716 ◽

2012 ◽

Vol 532-533 ◽

pp. 1716-1720 ◽

Cited By ~ 3

Author(s):

Chun Xia Jin ◽

Hai Yan Zhou ◽

Qiu Chan Bai

Keyword(s):

Clustering Algorithm ◽

Text Clustering ◽

Experimental Results ◽

Semantic Features ◽

Short Text ◽

Clustering Quality ◽

Short Text Clustering

To solve the problem of sparse keywords and similarity drift in short text segments, this paper proposes short text clustering algorithm with feature keyword expansion (STCAFKE). The method can realize short text clustering by expanding feature keyword based on HowNet and combining K-means algorithm and density algorithm. It may add the number of text keyword with feature keyword expansion and increase text semantic features to realize short text clustering. Experimental results show that this algorithm has increased the short text clustering quality on precision and recall.

Download Full-text

Short Text Clustering Algorithm Based on Frequent Closed Word Sets

2019 12th International Symposium on Computational Intelligence and Design (ISCID) ◽

10.1109/iscid.2019.10144 ◽

2019 ◽

Author(s):

Chunxia Jin ◽

Qiuchan Bai

Keyword(s):

Clustering Algorithm ◽

Text Clustering ◽

Short Text ◽

Short Text Clustering

Download Full-text

The Study of an Improved Text Clustering Algorithm for Self-Organizing Maps

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/428/1/012024 ◽

2020 ◽

Vol 428 ◽

pp. 012024

Author(s):

Baolong Zhang ◽

Zemin Hou

Keyword(s):

Clustering Algorithm ◽

Text Clustering ◽

Self Organizing Maps ◽

Self Organizing

Download Full-text

A Text Clustering Algorithms Based on Hidden Markov Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.135-136.1155 ◽

2011 ◽

Vol 135-136 ◽

pp. 1155-1158

Author(s):

Wei Li ◽

Mei An Li

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Transfer Matrix ◽

Clustering Algorithm ◽

Hidden Markov ◽

Probability Model ◽

Clustering Algorithms ◽

Text Clustering ◽

Global Perspective ◽

Abstract Structure

Based on the probability model of clustering algorithm constructs a model for each cluster, calculate probability of every text falls in different models to decide text belongs to which cluster, conveniently in global Angle represents abstract structure of clusters. In this paper combining the hidden Markov model and k - means clustering algorithm realize text clustering, first produces first clustering results by k - means algorithm, as the initial probability model of a hidden Markov model ,constructed probability transfer matrix prediction every step of clustering iteration, when subtraction value of two probability transfer matrix is 0, clustering end. This algorithm can in global perspective every cluster of document clustering process, to avoid the repetition of clustering process, effectively improve the clustering algorithm .

Download Full-text