Document Clustering Algorithm Based on Tree-Structured Growing Self-Organizing Feature Map

Author(s):  
Xiaoshen Zheng ◽  
Wenling Liu ◽  
Pilian He ◽  
Weidi Dai
IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 115914-115925 ◽  
Author(s):  
Ping Yang ◽  
Dan Wang ◽  
Zhuojun Wei ◽  
Xiaolin Du ◽  
Tong Li

2021 ◽  
Vol 12 (4) ◽  
pp. 169-185
Author(s):  
Saida Ishak Boushaki ◽  
Omar Bendjeghaba ◽  
Nadjet Kamel

Clustering is an important unsupervised analysis technique for big data mining. It finds its application in several domains including biomedical documents of the MEDLINE database. Document clustering algorithms based on metaheuristics is an active research area. However, these algorithms suffer from the problems of getting trapped in local optima, need many parameters to adjust, and the documents should be indexed by a high dimensionality matrix using the traditional vector space model. In order to overcome these limitations, in this paper a new documents clustering algorithm (ASOS-LSI) with no parameters is proposed. It is based on the recent symbiotic organisms search metaheuristic (SOS) and enhanced by an acceleration technique. Furthermore, the documents are represented by semantic indexing based on the famous latent semantic indexing (LSI). Conducted experiments on well-known biomedical documents datasets show the significant superiority of ASOS-LSI over five famous algorithms in terms of compactness, f-measure, purity, misclassified documents, entropy, and runtime.


2014 ◽  
Vol 556-562 ◽  
pp. 3945-3948
Author(s):  
Xin Qing Geng ◽  
Hong Yan Yang ◽  
Feng Mei Tao

This paper applies the dynamic self-organizing maps algorithm to determining the number of clustering. The text eigenvector is acquired based on the vector space model (VSM) and TF.IDF method. The number of clustering acquired by the dynamic self-organizing maps. The threshold GT control the network’s growth.Compared to the traditional fuzzy clustering algorithm, the present algorithm possesses higher precision. The example demonstrates the effectiveness of the present algorithm.


Sign in / Sign up

Export Citation Format

Share Document