Subword-Level Language Identification for Intra-Word Code-Switching

Language identification of intra-word code-switching for Arabic-English

Array ◽

10.1016/j.array.2021.100104 ◽

2021 ◽

pp. 100104

Author(s):

Caroline Sabty ◽

Islam Mohamed ◽

Özlem Çetinoğlu ◽

Slim Abdennadher

Keyword(s):

Language Identification ◽

Code Switching ◽

Word Code

Get full-text (via PubEx)

Applying Grapheme, Word, and Syllable Information for Language Identification in Code Switching Sentences

2011 International Conference on Asian Language Processing ◽

10.1109/ialp.2011.34 ◽

2011 ◽

Author(s):

Yin-Lai Yeong ◽

Tien-Ping Tan

Keyword(s):

Language Identification ◽

Code Switching

Get full-text (via PubEx)

Code-Switching Ubique Est - Language Identification and Part-of-Speech Tagging for Historical Mixed Text

10.18653/v1/w16-2105 ◽

2016 ◽

Cited By ~ 3

Author(s):

Sarah Schulz ◽

Mareike Keller

Keyword(s):

Language Identification ◽

Code Switching ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

Speech Tagging

Get full-text (via PubEx)

Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System

10.3115/v1/w14-3908 ◽

2014 ◽

Cited By ~ 11

Author(s):

Gokul Chittaranjan ◽

Yogarshi Vyas ◽

Kalika Bali ◽

Monojit Choudhury

Keyword(s):

Language Identification ◽

Code Switching ◽

Shared Task ◽

Word Level

Get full-text (via PubEx)

Language identification on code-switching utterances using multiple cues

10.21437/interspeech.2008-223 ◽

2008 ◽

Author(s):

Dau-Cheng Lyu ◽

Ren-Yuan Lyu

Keyword(s):

Language Identification ◽

Code Switching ◽

Multiple Cues

Get full-text (via PubEx)

Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech

Chinese Spoken Language Processing - Lecture Notes in Computer Science ◽

10.1007/11939993_50 ◽

2006 ◽

pp. 475-484 ◽

Cited By ~ 2

Author(s):

Dau-cheng Lyu ◽

Ren-yuan Lyu ◽

Yuang-chin Chiang ◽

Chun-nan Hsu

Keyword(s):

Language Identification ◽

Code Switching

Get full-text (via PubEx)

Syntactic frames and single-word code-switching: A case study of Mandarin Chinese - Norwegian bilingualism

Studies in Language Companion Series - The Sociolinguistics of Grammar ◽

10.1075/slcs.154.08afa ◽

2014 ◽

pp. 153-170 ◽

Cited By ~ 1

Author(s):

Tor A. Åfarli ◽

Fufen Jin

Keyword(s):

Mandarin Chinese ◽

Code Switching ◽

Single Word ◽

Word Code ◽

Syntactic Frames

Get full-text (via PubEx)

Improving Transformer Based End-to-End Code-Switching Speech Recognition Using Language Identification

Applied Sciences ◽

10.3390/app11199106 ◽

2021 ◽

Vol 11 (19) ◽

pp. 9106

Author(s):

Zheying Huang ◽

Pei Wang ◽

Jian Wang ◽

Haoran Miao ◽

Ji Xu ◽

...

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Recurrent Neural Networks ◽

Short Range ◽

Language Identification ◽

Code Switching ◽

Attention Model ◽

Sequential Computation ◽

Attention System ◽

Connectionist Temporal Classification

A Recurrent Neural Networks (RNN) based attention model has been used in code-switching speech recognition (CSSR). However, due to the sequential computation constraint of RNN, there are stronger short-range dependencies and weaker long-range dependencies, which makes it hard to immediately switch languages in CSSR. Firstly, to deal with this problem, we introduce the CTC-Transformer, relying entirely on a self-attention mechanism to draw global dependencies and adopting connectionist temporal classification (CTC) as an auxiliary task for better convergence. Secondly, we proposed two multi-task learning recipes, where a language identification (LID) auxiliary task is learned in addition to the CTC-Transformer automatic speech recognition (ASR) task. Thirdly, we study a decoding strategy to combine the LID into an ASR task. Experiments on the SEAME corpus demonstrate the effects of the proposed methods, achieving a mixed error rate (MER) of 30.95%. It obtains up to 19.35% relative MER reduction compared to the baseline RNN-based CTC-Attention system, and 8.86% relative MER reduction compared to the baseline CTC-Transformer system.

Get full-text (via PubEx)