The Development of Topic Model Based on Beta-Negative Binomial Process
. Topic Model is one of the important subfields in Data Mining, which has been developed very quickly and has been applicated in many fields in recent years. Many researchers have been engaged in this field. In this paper, we introduce the BNB process based on Beta and Negative Binomial distribution, using the hierarchical distribution instead of Dirichlet in LDA. And we give the expression of parameter estimation used by Gibbs sampling. Then, BNB process is applicated in the text topic classification. We design experiments to decide the numbers of topics and compare the BNB process with LDA. Experiment results show that the BNB process has better performance over LDA in English Dataset, but they have almost the same result in Chinese micro-blog topic classification. Finally we analyze the problem and give the idea in further research.