scholarly journals Adaptive Mixed Component LDA for Low Resource Topic Modeling

Author(s):  
Suzanna Sia ◽  
Kevin Duh
Author(s):  
Zarmeen Nasim

This research is an endeavor to combine deep-learning-based language modeling with classical topic modeling techniques to produce interpretable topics for a given set of documents in Urdu, a low resource language. The existing topic modeling techniques produce a collection of words, often un-interpretable, as suggested topics without integrat-ing them into a semantically correct phrase/sentence. The proposed approach would first build an accurate Part of Speech (POS) tagger for the Urdu Language using a publicly available corpus of many million sentences. Using semanti-cally rich feature extraction approaches including Word2Vec and BERT, the proposed approach, in the next step, would experiment with different clus-tering and topic modeling techniques to produce a list of potential topics for a given set of documents. Finally, this list of topics would be sent to a labeler module to produce syntactically correct phrases that will represent interpretable topics.


2016 ◽  
Vol 03 (02) ◽  
pp. 079-083
Author(s):  
Lawrence Mbuagbaw ◽  
Francisca Monebenimp ◽  
Bolaji Obadeyi ◽  
Grace Bissohong ◽  
Marie-Thérèse Obama ◽  
...  

2018 ◽  
Vol 4 (1) ◽  
pp. 295-313 ◽  
Author(s):  
Karley A Riffe

Faculty work now includes market-like behaviors that create research, teaching, and service opportunities. This study employs an embedded case study design to evaluate the extent to which faculty members interact with external organizations to mitigate financial constraints and how those relationships vary by academic discipline. The findings show a similar number of ties among faculty members in high- and low-resource disciplines, reciprocity between faculty members and external organizations, and an expanded conceptualization of faculty work.


Author(s):  
Maria A. Milkova

Nowadays the process of information accumulation is so rapid that the concept of the usual iterative search requires revision. Being in the world of oversaturated information in order to comprehensively cover and analyze the problem under study, it is necessary to make high demands on the search methods. An innovative approach to search should flexibly take into account the large amount of already accumulated knowledge and a priori requirements for results. The results, in turn, should immediately provide a roadmap of the direction being studied with the possibility of as much detail as possible. The approach to search based on topic modeling, the so-called topic search, allows you to take into account all these requirements and thereby streamline the nature of working with information, increase the efficiency of knowledge production, avoid cognitive biases in the perception of information, which is important both on micro and macro level. In order to demonstrate an example of applying topic search, the article considers the task of analyzing an import substitution program based on patent data. The program includes plans for 22 industries and contains more than 1,500 products and technologies for the proposed import substitution. The use of patent search based on topic modeling allows to search immediately by the blocks of a priori information – terms of industrial plans for import substitution and at the output get a selection of relevant documents for each of the industries. This approach allows not only to provide a comprehensive picture of the effectiveness of the program as a whole, but also to visually obtain more detailed information about which groups of products and technologies have been patented.


2020 ◽  
Vol 16 (2) ◽  
pp. 83-115
Author(s):  
Mira Kim ◽  
◽  
Hye Sun Hwang ◽  
Xu Li

Diabetes ◽  
2018 ◽  
Vol 67 (Supplement 1) ◽  
pp. 93-LB
Author(s):  
EDDY JEAN BAPTISTE ◽  
PHILIPPE LARCO ◽  
MARIE-NANCY CHARLES LARCO ◽  
JULIA E. VON OETTINGEN ◽  
EDDLYS DUBOIS ◽  
...  

2019 ◽  
Vol 58 (6) ◽  
pp. 197-207
Author(s):  
Juhae Baeck ◽  
Hyungil Kwon ◽  
Mihwa Choi ◽  
Yi-Hsiu Lin

Sign in / Sign up

Export Citation Format

Share Document