Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414961 ◽

2021 ◽

Author(s):

Vishwas M. Shetty ◽

S. Umesh

Keyword(s):

Speech Recognition ◽

Indian Languages ◽

Low Resource ◽

Download Full-text

Signal Processing Cues to Improve Automatic Speech Recognition for Low Resource Indian Languages

10.21437/sltu.2018-6 ◽

2018 ◽

Author(s):

Arun Baby ◽

Karthik Pandia D S ◽

Hema A Murthy

Keyword(s):

Signal Processing ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Indian Languages ◽

Download Full-text

TDNN-based Multilingual Speech Recognition System for Low Resource Indian Languages

10.21437/interspeech.2018-2117 ◽

2018 ◽

Author(s):

Noor Fathima ◽

Tanvina Patel ◽

Mahima C ◽

Anuroop Iyengar

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Indian Languages ◽

Low Resource ◽

Multilingual Speech Recognition

Download Full-text

ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages

10.21437/interspeech.2018-2473 ◽

2018 ◽

Author(s):

Jayadev Billa

Keyword(s):

Speech Recognition ◽

Indian Languages ◽

Low Resource ◽

Download Full-text

Improving the Performance of Transformer Based Low Resource Speech Recognition for Indian Languages

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053808 ◽

2020 ◽

Author(s):

Vishwas M. Shetty ◽

Metilda Sagaya Mary N.J.

Keyword(s):

Speech Recognition ◽

Indian Languages ◽

Download Full-text

An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018

10.21437/interspeech.2018-1584 ◽

2018 ◽

Author(s):

Hari Krishna ◽

Krishna Gurugubelli ◽

Vishnu Vidyadhara Raju V ◽

Anil Kumar Vuppala

Keyword(s):

Speech Recognition ◽

Acoustic Modeling ◽

Indian Languages ◽

Download Full-text

Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages

10.21437/sltu.2018-3 ◽

2018 ◽

Author(s):

Brij Mohan Lal Srivastava ◽

Sunayana Sitaram ◽

Rupesh Kumar Mehta ◽

Krishna Doss Mohan ◽

Pallavi Matani ◽

...

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Indian Languages ◽

Download Full-text

Active Learning Methods for Low Resource End-to-End Speech Recognition

10.21437/interspeech.2019-2316 ◽

2019 ◽

Author(s):

Karan Malhotra ◽

Shubham Bansal ◽

Sriram Ganapathy

Keyword(s):

Speech Recognition ◽

Active Learning ◽

Learning Methods ◽

Low Resource ◽

Download Full-text

A General Procedure for Improving Language Models in Low-Resource Speech Recognition

2019 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp48816.2019.9037726 ◽

2019 ◽

Author(s):

Qian Liu ◽

Wei-Qiang Zhang ◽

Jia Liu ◽

Yao Liu

Keyword(s):

Speech Recognition ◽

General Procedure ◽

Language Models ◽

Download Full-text

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification

Circuits Systems and Signal Processing ◽

10.1007/s00034-021-01704-x ◽

2021 ◽

Author(s):

Joyanta Basu ◽

Soma Khan ◽

Rajib Roy ◽

Tapan Kumar Basu ◽

Swanirbhar Majumder

Keyword(s):

Language Identification ◽

Indian Languages ◽

Speech Corpus ◽

Download Full-text

Dynamic Acoustic Unit Augmentation with BPE-Dropout for Low-Resource End-to-End Speech Recognition

Sensors ◽

10.3390/s21093063 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3063

Author(s):

Aleksandr Laptev ◽

Andrei Andrusenko ◽

Ivan Podluzhny ◽

Anton Mitrofanov ◽

Ivan Medennikov ◽

...

Keyword(s):

Speech Recognition ◽

Rapid Development ◽

Computational Cost ◽

Vocabulary Size ◽

Word Error Rate ◽

Low Resource ◽

Steady Improvement ◽

With the rapid development of speech assistants, adapting server-intended automatic speech recognition (ASR) solutions to a direct device has become crucial. For on-device speech recognition tasks, researchers and industry prefer end-to-end ASR systems as they can be made resource-efficient while maintaining a higher quality compared to hybrid systems. However, building end-to-end models requires a significant amount of speech data. Personalization, which is mainly handling out-of-vocabulary (OOV) words, is another challenging task associated with speech assistants. In this work, we consider building an effective end-to-end ASR system in low-resource setups with a high OOV rate, embodied in Babel Turkish and Babel Georgian tasks. We propose a method of dynamic acoustic unit augmentation based on the Byte Pair Encoding with dropout (BPE-dropout) technique. The method non-deterministically tokenizes utterances to extend the token’s contexts and to regularize their distribution for the model’s recognition of unseen words. It also reduces the need for optimal subword vocabulary size search. The technique provides a steady improvement in regular and personalized (OOV-oriented) speech recognition tasks (at least 6% relative word error rate (WER) and 25% relative F-score) at no additional computational cost. Owing to the BPE-dropout use, our monolingual Turkish Conformer has achieved a competitive result with 22.2% character error rate (CER) and 38.9% WER, which is close to the best published multilingual system.

Download Full-text