scholarly journals A Generalized Constraint Approach to Bilingual Dictionary Induction for Low-Resource Language Families

Author(s):  
Arbi Haza Nasution ◽  
Yohei Murakami ◽  
Toru Ishida
Author(s):  
Mairidan Wushouer ◽  
Donghui Lin ◽  
Toru Ishida ◽  
Yohei Murakami

Information ◽  
2020 ◽  
Vol 11 (2) ◽  
pp. 67 ◽  
Author(s):  
Donghui Lin ◽  
Yohei Murakami ◽  
Toru Ishida

The most challenging issue with low-resource languages is the difficulty of obtaining enough language resources. In this paper, we propose a language service framework for low-resource languages that enables the automatic creation and customization of new resources from existing ones. To achieve this goal, we first introduce a service-oriented language infrastructure, the Language Grid; it realizes new language services by supporting the sharing and combining of language resources. We then show the applicability of the Language Grid to low-resource languages. Furthermore, we describe how we can now realize the automation and customization of language services. Finally, we illustrate our design concept by detailing a case study of automating and customizing bilingual dictionary induction for low-resource Turkic languages and Indonesian ethnic languages.


Author(s):  
Mairidan Wushouer ◽  
Donghui Lin ◽  
Toru Ishida ◽  
Katsutoshi Hirayama

2020 ◽  
Vol 17 (1) ◽  
pp. 54-60
Author(s):  
B. S. Sowmya Lakshmi ◽  
B. R. Shambhavi

Visvesvaraya Technological University, Belagavi, Karnataka, India One of the promising resources to extract dictionaries are said to be parallel corpora. Majority of the substantial works are based on parallel corpora, whereas for the resource scarce language pairs building a parallel corpus is a challenging task. To prevail over this issue, researchers found comparable corpora could be an alternative to extract dictionary. Proposed approach is to extract dictionary for a low resource language pair English and Kannada using comparable corpora obtained from Wikipedia dumps and corpus received from Indian Language Corpus Initiative (ILCI). Dictionary constructed comprises of both translation and transliteration entities with term level associations from English to Kannada. Resultant dictionary is of size 77545 tokens with precision score of 0.79. Proposed work is independent of language and could be expanded to other language pairs.


Author(s):  
Arbi Haza Nasution ◽  
Yohei Murakami ◽  
Toru Ishida

Creating bilingual dictionary is the first crucial step in enriching low-resource languages. Especially for the closely related ones, it has been shown that the constraint-based approach is useful for inducing bilingual lexicons from two bilingual dictionaries via the pivot language. However, if there are no available machine-readable dictionaries as input, we need to consider manual creation by bilingual native speakers. To reach a goal of comprehensively create multiple bilingual dictionaries, even if we already have several existing machine-readable bilingual dictionaries, it is still difficult to determine the execution order of the constraint-based approach to reducing the total cost. Plan optimization is crucial in composing the order of bilingual dictionaries creation with the consideration of the methods and their costs. We formalize the plan optimization for creating bilingual dictionaries by utilizing Markov Decision Process (MDP) with the goal to get a more accurate estimation of the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual lexicon induction. We model a prior beta distribution of bilingual lexicon induction precision with language similarity and polysemy of the topology as and parameters. It is further used to model cost function and state transition probability. We estimated the cost of all investment plans as a baseline for evaluating the proposed MDP-based approach with total cost as an evaluation metric. After utilizing the posterior beta distribution in the first batch of experiments to construct the prior beta distribution in the second batch of experiments, the result shows 61.5% of cost reduction compared to the estimated all investment plans and 39.4% of cost reduction compared to the estimated MDP optimal plan. The MDP-based proposal outperformed the baseline on the total cost.


2016 ◽  
Vol 03 (02) ◽  
pp. 079-083
Author(s):  
Lawrence Mbuagbaw ◽  
Francisca Monebenimp ◽  
Bolaji Obadeyi ◽  
Grace Bissohong ◽  
Marie-Thérèse Obama ◽  
...  

2018 ◽  
Vol 4 (1) ◽  
pp. 295-313 ◽  
Author(s):  
Karley A Riffe

Faculty work now includes market-like behaviors that create research, teaching, and service opportunities. This study employs an embedded case study design to evaluate the extent to which faculty members interact with external organizations to mitigate financial constraints and how those relationships vary by academic discipline. The findings show a similar number of ties among faculty members in high- and low-resource disciplines, reciprocity between faculty members and external organizations, and an expanded conceptualization of faculty work.


2020 ◽  
Vol 22 (1) ◽  
pp. 41-60
Author(s):  
Sungjee Choi ◽  
Inwoo Nam ◽  
Jaehwan Kim

Diabetes ◽  
2018 ◽  
Vol 67 (Supplement 1) ◽  
pp. 93-LB
Author(s):  
EDDY JEAN BAPTISTE ◽  
PHILIPPE LARCO ◽  
MARIE-NANCY CHARLES LARCO ◽  
JULIA E. VON OETTINGEN ◽  
EDDLYS DUBOIS ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document