Partially collapsed Gibbs sampling for latent Dirichlet allocation

Resumo A formação de governos multipartidários potencializa o risco de assimetria de informação entre principals e agentes, de maneira que os conflitos do gabinete sobre políticas se refletem no comportamento dos partidos no parlamento. Diversos estudos demonstram que o controle mútuo entre os partidos integrantes do gabinete é uma forma de compensar a perda de informação inerente à delegação. Enquanto a literatura costuma focar na fase de formulação das políticas, analisando os governos formados no Brasil entre 1995 e 2014, argumento que existe um conjunto mais diversificado de estratégias que permitem aos partidos escrutinar as políticas implementadas por seus parceiros de gabinete. Fazendo uso de análise de redes e técnicas quantitativas de análise de texto (método Gibbs Sampling, algoritmo bayesiano derivado do Latent Dirichlet allocation – LDA) mostro que, nas situações em que os portfólios ministeriais são distribuídos para atores com distintas preferências sobre políticas, os partidos intensificam o uso dos Requerimentos de Informação (RIC) para monitorar os ministérios e políticas que lhes interessam. A estrutura das redes de controle intragabinete varia em função da saliência dos ministérios: os partidos responsáveis pelos portfólios com maior dotação orçamentária são os atores com maior grau de centralidade nas redes de monitoramento mútuo.

Download Full-text

Latent Dirichlet Allocation based on Gibbs Sampling for gene function prediction

2014 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology ◽

10.1109/cibcb.2014.6845514 ◽

2014 ◽

Cited By ~ 17

Author(s):

Pietro Pinoli ◽

Davide Chicco ◽

Marco Masseroli

Keyword(s):

Gibbs Sampling ◽

Gene Function ◽

Latent Dirichlet Allocation ◽

Function Prediction ◽

Gene Function Prediction ◽

Dirichlet Allocation

Download Full-text

A more time-efficient gibbs sampling algorithm based on SparseLDA for latent dirichlet allocation

Intelligent Data Analysis ◽

10.3233/ida-173609 ◽

2018 ◽

Vol 22 (6) ◽

pp. 1227-1257

Author(s):

Xiaotang Zhou ◽

Jihong Ouyang ◽

Ximing Li

Keyword(s):

Gibbs Sampling ◽

Latent Dirichlet Allocation ◽

Sampling Algorithm ◽

Gibbs Sampling Algorithm ◽

Dirichlet Allocation

Download Full-text

Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collapsed Gibbs Sampling Inference Process

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v8i5.pp3204-3213 ◽

2018 ◽

Vol 8 (5) ◽

pp. 3204 ◽

Cited By ~ 1

Author(s):

Bambang Subeno ◽

Retno Kusumaningrum ◽

Farikhin Farikhin

Keyword(s):

Maximum Likelihood ◽

Latent Dirichlet Allocation ◽

Minimum Description Length ◽

Probability Model ◽

Optimal Number ◽

Classification Model ◽

Training Models ◽

Average Accuracy ◽

Collapsed Gibbs Sampling ◽

Dirichlet Allocation

<span lang="EN-GB">Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by the number of predefined topics. If conducted incorrectly, determining the amount of K topics will result in limited word correlation with topics. Too large or too small number of K topics causes inaccuracies in grouping topics in the formation of training models. This study aims to determine the optimal number of corpus topics in the LDA method using the maximum likelihood and Minimum Description Length (MDL) approach. The experimental process uses Indonesian news articles with the number of documents at 25, 50, 90, and 600; in each document, the numbers of words are 3898, 7760, 13005, and 4365. The results show that the maximum likelihood and MDL approach result in the same number of optimal topics. The optimal number of topics is influenced by alpha and beta parameters. In addition, the number of documents does not affect the computation times but the number of words does. Computational times for each of those datasets are 2.9721, 6.49637, 13.2967, and 3.7152 seconds. The optimisation model has resulted in many LDA topics as a classification model. This experiment shows that the highest average accuracy is 61% with alpha 0.1 and beta 0.001.</span>

Download Full-text

Indonesian text feature extraction using gibbs sampling and mean variational inference latent dirichlet allocation

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering ◽

10.1109/qir.2017.8168448 ◽

2017 ◽

Author(s):

P M Prihatini ◽

Ikgd Putra ◽

Iad Giriantari ◽

M Sudarma

Keyword(s):

Feature Extraction ◽

Gibbs Sampling ◽

Latent Dirichlet Allocation ◽

Variational Inference ◽

Text Feature ◽

Dirichlet Allocation

Download Full-text

MR-LDA

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2016100106 ◽

2016 ◽

Vol 8 (4) ◽

pp. 100-113 ◽

Cited By ~ 3

Author(s):

Xiongwen Pang ◽

Benshuai Wan ◽

Huifang Li ◽

Weiwei Lin

Keyword(s):

Text Mining ◽

Gibbs Sampling ◽

Efficient Method ◽

Sampling Method ◽

Latent Dirichlet Allocation ◽

Experimental Results ◽

Generation Process ◽

Effective Solution ◽

Topic Mining ◽

Dirichlet Allocation

Latent Dirichlet Allocation(LDA) is an efficient method of text mining,but applying LDA directly to Chinese micro-blog texts will not work well because micro-blogs are more social, brief, and closely related with each other. Based on LDA, this paper proposes a Micro-blog Relation LDA model (MR-LDA), which takes the relations between Chinese micro-blog documents and other Chinese micro-blog documents into consideration to help topic mining in micro-blog. The authors extend LDA in the following two points. First, they aggregate several Chinese micro-blogs as a single micro-blog document to solve the problem of short texts. Second, they model the generation process of Chinese micro-blogs more accurately by taking relationship between micro-blog documents into consideration. MR-LDA is more suitable to model Chinese micro-blog data. Gibbs sampling method is borrowed to inference the model. Experimental results on actual datasets show that MR-LDA model can offer an effective solution to text mining for Chinese micro-blog.

Download Full-text

Evaluation of Text Semantic Features using Latent Dirichlet Allocation Model

International Journal of Performability Engineering ◽

10.23940/ijpe.20.06.p15.968978 ◽

2020 ◽

Vol 16 (6) ◽

pp. 968

Author(s):

Zhou Chunjie ◽

Li Nao ◽

Zhang Chi ◽

Yang Xiaoyu

Keyword(s):

Latent Dirichlet Allocation ◽

Semantic Features ◽

Allocation Model ◽

Latent Dirichlet Allocation Model ◽

Dirichlet Allocation

Download Full-text

Similarity Detection Using Latent Semantic Analysis Algorithm

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i8.124 ◽

2018 ◽

Vol 6 (8) ◽

pp. 102

Author(s):

Priyanka R. Patil ◽

Shital A. Patil

Keyword(s):

Latent Semantic Analysis ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Mining Method ◽

Research Papers ◽

Information Measures ◽

Automated Software ◽

Day By Day ◽

Ways Of Life ◽

Dirichlet Allocation

Similarity View is an application for visually comparing and exploring multiple models of text and collection of document. Friendbook finds ways of life of clients from client driven sensor information, measures the closeness of ways of life amongst clients, and prescribes companions to clients if their ways of life have high likeness. Roused by demonstrate a clients day by day life as life records, from their ways of life are separated by utilizing the Latent Dirichlet Allocation Algorithm. Manual techniques can't be utilized for checking research papers, as the doled out commentator may have lacking learning in the exploration disciplines. For different subjective views, causing possible misinterpretations. An urgent need for an effective and feasible approach to check the submitted research papers with support of automated software. A method like text mining method come to solve the problem of automatically checking the research papers semantically. The proposed method to finding the proper similarity of text from the collection of documents by using Latent Dirichlet Allocation (LDA) algorithm and Latent Semantic Analysis (LSA) with synonym algorithm which is used to find synonyms of text index wise by using the English wordnet dictionary, another algorithm is LSA without synonym used to find the similarity of text based on index. LSA with synonym rate of accuracy is greater when the synonym are consider for matching.

Download Full-text

Partially collapsed Gibbs sampling for latent Dirichlet allocation

Fast collapsed gibbs sampling for latent dirichlet allocation

GLDA: Parallel Gibbs Sampling for Latent Dirichlet Allocation on GPU

Mecanismos de alinhamento de preferências em governos multipartidários: controle de políticas públicas no presidencialismo brasileiro

Latent Dirichlet Allocation based on Gibbs Sampling for gene function prediction

A more time-efficient gibbs sampling algorithm based on SparseLDA for latent dirichlet allocation

Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collapsed Gibbs Sampling Inference Process

Indonesian text feature extraction using gibbs sampling and mean variational inference latent dirichlet allocation

MR-LDA

Evaluation of Text Semantic Features using Latent Dirichlet Allocation Model

Similarity Detection Using Latent Semantic Analysis Algorithm

Export Citation Format