Articulatory Features for Multilingual Phone Recognition

2021 ◽  
pp. 57-66
Author(s):  
K. E. Manjunath
2015 ◽  
Vol 137 (4) ◽  
pp. 2302-2302
Author(s):  
Ganesh Sivaraman ◽  
Vikramjit Mitra ◽  
Hosung Nam ◽  
Elliot Saltzman ◽  
Carol Espy-Wilson

2020 ◽  
pp. 43-50
Author(s):  
ALBINA A. DOBRININA ◽  

The paper considers some articulatory features of allophones of the vowel /i/ in the Altai-Kizhi dialect (spoken in the locality Ust-Kan, Altai) of the Altai language visualized by magnetic resonance imaging (MRI). The Altai-Kizhi is the central basic dialect of the Altai literary language. In Altai, each rural locality represents a unique dialect, whose relevance of studying was emphasized by V. V. Radlov. Speech sounds of the /i/-type in the dialects of the Altai language are realized mainly as front variants with different degrees of openness. In the written Altai speech, the symbol “и” is used to denote narrow front non-labialized vowel; some variants of the Altai vowel /i/ are central-back differing in this from the Russian vowel /i/. Experimental data on the territorial dialects of the Altai-Kizhi dialect, obtained from its 6 native speakers (d1-d6) taking into account variable inherent palate height, shows both the common articulation bases of native speakers (clearly-expressed frontness) and their differences (variable openness).


Author(s):  
Manjunath K. E. ◽  
Srinivasa Raghavan K. M. ◽  
K. Sreenivasa Rao ◽  
Dinesh Babu Jayagopi ◽  
V. Ramasubramanian

In this study, we evaluate and compare two different approaches for multilingual phone recognition in code-switched and non-code-switched scenarios. First approach is a front-end Language Identification (LID)-switched to a monolingual phone recognizer (LID-Mono), trained individually on each of the languages present in multilingual dataset. In the second approach, a common multilingual phone-set derived from the International Phonetic Alphabet (IPA) transcription of the multilingual dataset is used to develop a Multilingual Phone Recognition System (Multi-PRS). The bilingual code-switching experiments are conducted using Kannada and Urdu languages. In the first approach, LID is performed using the state-of-the-art i-vectors. Both monolingual and multilingual phone recognition systems are trained using Deep Neural Networks. The performance of LID-Mono and Multi-PRS approaches are compared and analysed in detail. It is found that the performance of Multi-PRS approach is superior compared to more conventional LID-Mono approach in both code-switched and non-code-switched scenarios. For code-switched speech, the effect of length of segments (that are used to perform LID) on the performance of LID-Mono system is studied by varying the window size from 500 ms to 5.0 s, and full utterance. The LID-Mono approach heavily depends on the accuracy of the LID system and the LID errors cannot be recovered. But, the Multi-PRS system by virtue of not having to do a front-end LID switching and designed based on the common multilingual phone-set derived from several languages, is not constrained by the accuracy of the LID system, and hence performs effectively on code-switched and non-code-switched speech, offering low Phone Error Rates than the LID-Mono system.


2015 ◽  
Vol 35 (45) ◽  
pp. 15015-15025 ◽  
Author(s):  
J. M. Correia ◽  
B. M. B. Jansma ◽  
M. Bonte

Sign in / Sign up

Export Citation Format

Share Document