Cross-language Phoneme Mapping for Low-resource Languages: An Exploration of Benefits and Trade-offs

Author(s):  
Nick K Chibuye ◽  
Todd Rosenstock ◽  
Brian DeRenzi
Symmetry ◽  
2019 ◽  
Vol 11 (2) ◽  
pp. 179 ◽  
Author(s):  
Chongchong Yu ◽  
Yunbing Chen ◽  
Yueqiao Li ◽  
Meng Kang ◽  
Shixuan Xu ◽  
...  

To rescue and preserve an endangered language, this paper studied an end-to-end speech recognition model based on sample transfer learning for the low-resource Tujia language. From the perspective of the Tujia language international phonetic alphabet (IPA) label layer, using Chinese corpus as an extension of the Tujia language can effectively solve the problem of an insufficient corpus in the Tujia language, constructing a cross-language corpus and an IPA dictionary that is unified between the Chinese and Tujia languages. The convolutional neural network (CNN) and bi-directional long short-term memory (BiLSTM) network were used to extract the cross-language acoustic features and train shared hidden layer weights for the Tujia language and Chinese phonetic corpus. In addition, the automatic speech recognition function of the Tujia language was realized using the end-to-end method that consists of symmetric encoding and decoding. Furthermore, transfer learning was used to establish the model of the cross-language end-to-end Tujia language recognition system. The experimental results showed that the recognition error rate of the proposed model is 46.19%, which is 2.11% lower than the that of the model that only used the Tujia language data for training. Therefore, this approach is feasible and effective.


Sensors ◽  
2021 ◽  
Vol 21 (24) ◽  
pp. 8313
Author(s):  
Łukasz Lepak ◽  
Kacper Radzikowski ◽  
Robert Nowak ◽  
Karol J. Piczak

Models for keyword spotting in continuous recordings can significantly improve the experience of navigating vast libraries of audio recordings. In this paper, we describe the development of such a keyword spotting system detecting regions of interest in Polish call centre conversations. Unfortunately, in spite of recent advancements in automatic speech recognition systems, human-level transcription accuracy reported on English benchmarks does not reflect the performance achievable in low-resource languages, such as Polish. Therefore, in this work, we shift our focus from complete speech-to-text conversion to acoustic similarity matching in the hope of reducing the demand for data annotation. As our primary approach, we evaluate Siamese and prototypical neural networks trained on several datasets of English and Polish recordings. While we obtain usable results in English, our models’ performance remains unsatisfactory when applied to Polish speech, both after mono- and cross-lingual training. This performance gap shows that generalisation with limited training resources is a significant obstacle for actual deployments in low-resource languages. As a potential countermeasure, we implement a detector using audio embeddings generated with a generic pre-trained model provided by Google. It has a much more favourable profile when applied in a cross-lingual setup to detect Polish audio patterns. Nevertheless, despite these promising results, its performance on out-of-distribution data are still far from stellar. It would indicate that, in spite of the richness of internal representations created by more generic models, such speech embeddings are not entirely malleable to cross-language transfer.


2021 ◽  
Vol 6 (Suppl 5) ◽  
pp. e005341
Author(s):  
Sara Chamberlain ◽  
Priyanka Dutt ◽  
Anna Godfrey ◽  
Radharani Mitra ◽  
Amnesty Elizabeth LeFevre ◽  
...  

There has been exponential growth in the numbers of ‘digital development’ programmes seeking to leverage technology to solve systemic challenges. However, despite promising results and a shift from pilots to scale-ups, many have failed to realise their full potential. This paper reflects on lessons learnt from scaling and transitioning one of the largest mobile health programmes in the world to the Indian government. The complementary suite of services was designed by BBC Media Action to strengthen families’ reproductive, maternal, neonatal and child health behaviours. Mobile Academy was a training course to refresh frontline health workers’ (FLHWs) knowledge and improve their interpersonal communication skills. Mobile Kunji was a job aid to support FLHWs’ interactions with families. Kilkari delivered weekly audio information to families’ phones to reinforce FLHWs’ counselling. As of April 2019, when Mobile Academy and Kilkari were transitioned to the government, 206 000 FLHWs had graduated and Kilkari had reached 10 million subscribers. Lessons learnt include the following: (1) private sector business models are challenging in low-resource settings; (2) you may pilot ‘apples’ but scale ‘oranges’; (3) trade-offs are required between ideal solution design and affordability; (4) programme components should be reassessed before scaling; (5) operational viability at scale is a prerequisite for sustainability; (6) consider the true cost of open-source software; (7) taking informed consent in low-resource settings is challenging; (8) big data offer promise, but social norms and SIM change constrain use; (9) successful government engagements require significant capacity; (10) define governance structures and roadmaps up front.


2019 ◽  
Author(s):  
Constantine Lignos ◽  
Daniel Cohen ◽  
Yen-Chieh Lien ◽  
Pratik Mehta ◽  
W. Bruce Croft ◽  
...  

2015 ◽  
Vol 58 ◽  
pp. 83-100 ◽  
Author(s):  
Selena Gimenez-Ibanez ◽  
Marta Boter ◽  
Roberto Solano

Jasmonates (JAs) are essential signalling molecules that co-ordinate the plant response to biotic and abiotic challenges, as well as co-ordinating several developmental processes. Huge progress has been made over the last decade in understanding the components and mechanisms that govern JA perception and signalling. The bioactive form of the hormone, (+)-7-iso-jasmonyl-l-isoleucine (JA-Ile), is perceived by the COI1–JAZ co-receptor complex. JASMONATE ZIM DOMAIN (JAZ) proteins also act as direct repressors of transcriptional activators such as MYC2. In the emerging picture of JA-Ile perception and signalling, COI1 operates as an E3 ubiquitin ligase that upon binding of JA-Ile targets JAZ repressors for degradation by the 26S proteasome, thereby derepressing transcription factors such as MYC2, which in turn activate JA-Ile-dependent transcriptional reprogramming. It is noteworthy that MYCs and different spliced variants of the JAZ proteins are involved in a negative regulatory feedback loop, which suggests a model that rapidly turns the transcriptional JA-Ile responses on and off and thereby avoids a detrimental overactivation of the pathway. This chapter highlights the most recent advances in our understanding of JA-Ile signalling, focusing on the latest repertoire of new targets of JAZ proteins to control different sets of JA-Ile-mediated responses, novel mechanisms of negative regulation of JA-Ile signalling, and hormonal cross-talk at the molecular level that ultimately determines plant adaptability and survival.


2004 ◽  
Vol 20 (4) ◽  
pp. 349-357 ◽  
Author(s):  
Ahmed M. Abdel-Khalek ◽  
Joaquin Tomás-Sabádo ◽  
Juana Gómez-Benito

Summary: To construct a Spanish version of the Kuwait University Anxiety Scale (S-KUAS), the Arabic and English versions of the KUAS have been separately translated into Spanish. To check the comparability in terms of meaning, the two Spanish preliminary translations were thoroughly scrutinized vis-à-vis both the Arabic and English forms by several experts. Bilingual subjects served to explore the cross-language equivalence of the English and Spanish versions of the KUAS. The correlation between the total scores on both versions was .93, and the t value was .30 (n.s.), denoting good similarity. The Alphas and 4-week test-retest reliabilities were greater than .84, while the criterion-related validity was .70 against scores on the trait subscale of the STAI. These findings denote good reliability and validity of the S-KUAS. Factor analysis yielded three high-loaded factors of Behavioral/Subjective, Cognitive/Affective, and Somatic Anxiety, equivalent to the original Arabic version. Female (n = 210) undergraduates attained significantly higher mean scores than their male (n = 102) counterparts. For the combined group of males and females, the correlation between the total score on the S-KUAS and age was -.17 (p < .01). By and large, the findings of the present study provide evidence of the utility of the S-KUAS in assessing trait anxiety levels in the Spanish undergraduate context.


2012 ◽  
Vol 11 (3) ◽  
pp. 118-126 ◽  
Author(s):  
Olive Emil Wetter ◽  
Jürgen Wegge ◽  
Klaus Jonas ◽  
Klaus-Helmut Schmidt

In most work contexts, several performance goals coexist, and conflicts between them and trade-offs can occur. Our paper is the first to contrast a dual goal for speed and accuracy with a single goal for speed on the same task. The Sternberg paradigm (Experiment 1, n = 57) and the d2 test (Experiment 2, n = 19) were used as performance tasks. Speed measures and errors revealed in both experiments that dual as well as single goals increase performance by enhancing memory scanning. However, the single speed goal triggered a speed-accuracy trade-off, favoring speed over accuracy, whereas this was not the case with the dual goal. In difficult trials, dual goals slowed down scanning processes again so that errors could be prevented. This new finding is particularly relevant for security domains, where both aspects have to be managed simultaneously.


2007 ◽  
Vol 62 (9) ◽  
pp. 1073-1074 ◽  
Author(s):  
Kennon M. Sheldon ◽  
Melanie S. Sheldon ◽  
Charles P. Nichols

Sign in / Sign up

Export Citation Format

Share Document