Topic Analysis of Indonesian Comment Text Using the Latent Dirichlet Allocation

Allocation Model ◽

Topic Analysis ◽

Related Text ◽

The Empirical Analysis ◽

Large Corpus ◽

Extraction of topics from large text corpuses helps improve Software Engineering (SE) processes. Latent Dirichlet Allocation (LDA) represents one of the algorithmic tools to understand, search, exploit, and summarize a large corpus of data (documents), and it is often used to perform such analysis. However, calibration of the models is computationally expensive, especially if iterating over a large number of topics. Our goal is to create a simple formula allowing analysts to estimate the number of topics, so that the top X topics include the desired proportion of documents under study. We derived the formula from the empirical analysis of three SE-related text corpuses. We believe that practitioners can use our formula to expedite LDA analysis. The formula is also of interest to theoreticians, as it suggests that different SE text corpuses have similar underlying properties.

Trend Topic Analysis using Latent Dirichlet Allocation (LDA) (Study Case: Denpasar People’s Complaints Online Website)

Jurnal Ilmiah Teknik Elektro Komputer dan Informatika ◽

10.26555/jiteki.v5i1.13088 ◽

2019 ◽

Vol 5 (1) ◽

Author(s):

Aulia Rizki Destarani ◽

Isnandar Slamet ◽

Sri Subanti

Keyword(s):

Topic Analysis ◽

Study Case ◽

Research topic analysis of the European Sport Management Quarterly: Topic modeling with Latent Dirichlet Allocation(LDA)

Korean Journal of Sport Science ◽

10.24985/kjss.2019.30.4.775 ◽

2019 ◽

Vol 30 (4) ◽

pp. 775-788

Author(s):

최미화 ◽

백주해 ◽

편도영 ◽

Hyungil Kwon

Keyword(s):

Topic Modeling ◽

Sport Management ◽

Research Topic ◽

Topic Analysis ◽

Topic Analysis of the Research Domain in Knowledge Organization: A Latent Dirichlet Allocation Approach

KNOWLEDGE ORGANIZATION ◽

10.5771/0943-7444-2018-2-170 ◽

2018 ◽

Vol 45 (2) ◽

pp. 170-183 ◽

Cited By ~ 3

Author(s):

Soohyung Joo ◽

Inkyung Choi ◽

and Namjoo Choi

Keyword(s):

Knowledge Organization ◽

Topic Analysis ◽

Research Domain ◽

Allocation Approach ◽

Topic analysis of Road safety inspections using latent dirichlet allocation: A case study of roadside safety in Irish main roads

Accident Analysis & Prevention ◽

10.1016/j.aap.2019.07.021 ◽

2019 ◽

Vol 131 ◽

pp. 336-349 ◽

Cited By ~ 3

Author(s):

Carlos Roque ◽

João Lourenço Cardoso ◽

Thomas Connell ◽

Govert Schermers ◽

Roland Weber

Keyword(s):

Road Safety ◽

Topic Analysis ◽

Roadside Safety ◽

Dirichlet Allocation ◽

Safety Inspections

GLOBAL FINANCIAL CRISIS AND TRADE PAPERS: TOPIC ANALYSIS VIA LATENT DIRICHLET ALLOCATION MODEL

Current Research in Social Sciences ◽

10.30613/curesosc.931149 ◽

2021 ◽

Author(s):

Halil ŞİMDİ ◽

Büşra GARİP

Keyword(s):

Financial Crisis ◽

Global Financial Crisis ◽

Allocation Model ◽

Topic Analysis ◽

Speeding up calibration of latent Dirichlet allocation model to improve topic analysis in software engineering

10.32920/ryerson.14665455 ◽

2021 ◽

Author(s):

Jorge Arturo Lopez

Keyword(s):

Software Engineering ◽

Simple Formula ◽

Allocation Model ◽

Topic Analysis ◽

Related Text ◽

The Empirical Analysis ◽

Large Corpus ◽

Extraction of topics from large text corpuses helps improve Software Engineering (SE) processes. Latent Dirichlet Allocation (LDA) represents one of the algorithmic tools to understand, search, exploit, and summarize a large corpus of data (documents), and it is often used to perform such analysis. However, calibration of the models is computationally expensive, especially if iterating over a large number of topics. Our goal is to create a simple formula allowing analysts to estimate the number of topics, so that the top X topics include the desired proportion of documents under study. We derived the formula from the empirical analysis of three SE-related text corpuses. We believe that practitioners can use our formula to expedite LDA analysis. The formula is also of interest to theoreticians, as it suggests that different SE text corpuses have similar underlying properties.

Topic analysis of online reviews for two competitive products using latent Dirichlet allocation

Electronic Commerce Research and Applications ◽

10.1016/j.elerap.2018.04.003 ◽

2018 ◽

Vol 29 ◽

pp. 142-156 ◽

Cited By ~ 21

Author(s):

Wenxin Wang ◽

Yi Feng ◽

Wenqiang Dai

Keyword(s):

Online Reviews ◽

Topic Analysis ◽

Competitive Products ◽

Evaluation of Text Semantic Features using Latent Dirichlet Allocation Model

International Journal of Performability Engineering ◽

10.23940/ijpe.20.06.p15.968978 ◽

2020 ◽

Vol 16 (6) ◽

pp. 968

Author(s):

Zhou Chunjie ◽

Li Nao ◽

Zhang Chi ◽

Yang Xiaoyu

Keyword(s):

Semantic Features ◽

Allocation Model ◽

Similarity Detection Using Latent Semantic Analysis Algorithm

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i8.124 ◽

2018 ◽

Vol 6 (8) ◽

pp. 102

Author(s):

Priyanka R. Patil ◽

Shital A. Patil

Keyword(s):

Latent Semantic Analysis ◽

Semantic Analysis ◽

Mining Method ◽

Research Papers ◽

Information Measures ◽

Automated Software ◽

Day By Day ◽

Ways Of Life ◽

Similarity View is an application for visually comparing and exploring multiple models of text and collection of document. Friendbook finds ways of life of clients from client driven sensor information, measures the closeness of ways of life amongst clients, and prescribes companions to clients if their ways of life have high likeness. Roused by demonstrate a clients day by day life as life records, from their ways of life are separated by utilizing the Latent Dirichlet Allocation Algorithm. Manual techniques can't be utilized for checking research papers, as the doled out commentator may have lacking learning in the exploration disciplines. For different subjective views, causing possible misinterpretations. An urgent need for an effective and feasible approach to check the submitted research papers with support of automated software. A method like text mining method come to solve the problem of automatically checking the research papers semantically. The proposed method to finding the proper similarity of text from the collection of documents by using Latent Dirichlet Allocation (LDA) algorithm and Latent Semantic Analysis (LSA) with synonym algorithm which is used to find synonyms of text index wise by using the English wordnet dictionary, another algorithm is LSA without synonym used to find the similarity of text based on index. LSA with synonym rate of accuracy is greater when the synonym are consider for matching.