A dirichlet multinomial mixture model-based approach for short text clustering

Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '14 ◽

10.1145/2623330.2623715 ◽

2014 ◽

Author(s):

Jianhua Yin ◽

Jianyong Wang

Keyword(s):

Mixture Model ◽

Text Clustering ◽

Model Based ◽

Short Text Clustering

Download Full-text

A Novel Short Text Clustering Model Based on Grey System Theory

Arabian Journal for Science and Engineering ◽

10.1007/s13369-019-04191-0 ◽

2019 ◽

Vol 45 (4) ◽

pp. 2865-2882

Author(s):

Hüseyin Fidan ◽

Mehmet Erkan Yuksel

Keyword(s):

System Theory ◽

Text Clustering ◽

Grey System Theory ◽

Grey System ◽

Model Based ◽

Clustering Model ◽

Short Text Clustering

Download Full-text

Short Text Clustering Using Generalized Dirichlet Multinomial Mixture Model

Recent Challenges in Intelligent Information and Database Systems - Communications in Computer and Information Science ◽

10.1007/978-981-16-1685-3_13 ◽

2021 ◽

pp. 149-161

Author(s):

Samar Hannachi ◽

Fatma Najar ◽

Nizar Bouguila

Keyword(s):

Mixture Model ◽

Text Clustering ◽

Short Text Clustering ◽

Generalized Dirichlet

Download Full-text

Short text clustering based on Pitman-Yor process mixture model

Applied Intelligence ◽

10.1007/s10489-017-1055-4 ◽

2017 ◽

Vol 48 (7) ◽

pp. 1802-1812 ◽

Author(s):

Jipeng Qiang ◽

Yun Li ◽

Yunhao Yuan ◽

Xindong Wu

Keyword(s):

Mixture Model ◽

Text Clustering ◽

Short Text Clustering

Download Full-text

GBTM: A Short Text Clustering Model Based on Word Pairing

Proceedings of 2017 the 7th International Workshop on Computer Science and Engineering ◽

10.18178/wcse.2017.06.062 ◽

2017 ◽

Keyword(s):

Text Clustering ◽

Model Based ◽

Clustering Model ◽

Short Text Clustering ◽

Download Full-text

Confronting Sparseness and High Dimensionality in Short Text Clustering via Feature Vector Projections

2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai50040.2020.00129 ◽

2020 ◽

Author(s):

Leonidas Akritidis ◽

Miltiadis Alamaniotis ◽

Athanasios Fevgas ◽

Panayiotis Bozanis

Keyword(s):

Feature Vector ◽

Text Clustering ◽

High Dimensionality ◽

Short Text Clustering

Download Full-text

Short-Text Clustering using Statistical Semantics

Proceedings of the 24th International Conference on World Wide Web - WWW '15 Companion ◽

10.1145/2740908.2742474 ◽

2015 ◽

Author(s):

Sepideh Seifzadeh ◽

Ahmed K. Farahat ◽

Mohamed S. Kamel ◽

Fakhri Karray

Keyword(s):

Text Clustering ◽

Short Text Clustering

Download Full-text

Short Text Clustering Algorithms for Weibo Topic Detection

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.971-973.1747 ◽

2014 ◽

Vol 971-973 ◽

pp. 1747-1751 ◽

Author(s):

Lei Zhang ◽

Hai Qiang Chen ◽

Wei Jie Li ◽

Yan Zhao Liu ◽

Run Pu Wu

Keyword(s):

Text Analysis ◽

Semantic Information ◽

Clustering Algorithms ◽

Text Clustering ◽

Massive Data ◽

Topic Detection ◽

Clustering Methods ◽

Short Text Clustering ◽

Application Requirements

Text clustering is a popular research topic in the field of text mining, and now there are a lot of text clustering methods catering to different application requirements. Currently, Weibo data acquisition is through the API provided by big microblogging platforms. In this essay, we will discuss the algorithm of extracting popular topics posted by Weibo users by text clustering after massive data collection. Due to the fact that traditional text analysis may not be applicable to short texts used in Weibo, text clustering shall be carried out through combining multiple posts into long texts, based on their features (forwards, comments and followers, etc.). Either frequency-based or density-based short text clustering can deliver in most cases. The former is applicable to find hot topics from large Weibo short texts, and the latter is applicable to find abnormal contents. Both the two methods use semantic information to improve the accuracy of clustering. Besides, they improve the performance of clustering through the parallelism.

Download Full-text

A General Bio-inspired Method to Improve the Short-Text Clustering Task

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-12116-6_56 ◽

2010 ◽

pp. 661-672 ◽

Author(s):

Diego Ingaramo ◽

Marcelo Errecalde ◽

Paolo Rosso

Keyword(s):

Text Clustering ◽

Short Text Clustering

Download Full-text

Research on Chinese Short Text Clustering Ensemble via Convolutional Neural Networks

Lecture Notes in Electrical Engineering - Artificial Intelligence in China ◽

10.1007/978-981-15-0187-6_74 ◽

2020 ◽

pp. 622-628

Author(s):

Haowen Wan ◽

Bo Ning ◽

Xiaoyu Tao ◽

Jianfei Long

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Text Clustering ◽

Clustering Ensemble ◽

Short Text Clustering

Download Full-text

Collapsed Gibbs Sampling of Beta-Liouville Multinomial for Short Text Clustering

Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices - Lecture Notes in Computer Science ◽

10.1007/978-3-030-79457-6_48 ◽

2021 ◽

pp. 564-571

Author(s):

Samar Hannachi ◽

Fatma Najar ◽

Koffi Eddy Ihou ◽

Nizar Bouguila

Keyword(s):

Gibbs Sampling ◽

Text Clustering ◽

Short Text Clustering ◽

Collapsed Gibbs Sampling

Download Full-text