Data Augmentation and Teacher-Student Training for LF-MMI Based Robust Speech Recognition

Text, Speech, and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-030-00794-2_43 ◽

2018 ◽

pp. 403-410

Author(s):

Asadullah ◽

Tanel Alumäe

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Robust Speech Recognition ◽

Teacher Student ◽

Student Training

Download Full-text

Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) ◽

10.1109/asru.2017.8268911 ◽

2017 ◽

Author(s):

Wei-Ning Hsu ◽

Yu Zhang ◽

James Glass

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Domain Adaptation ◽

Robust Speech Recognition ◽

Unsupervised Domain Adaptation ◽

Variational Autoencoder

Download Full-text

Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2019.2940662 ◽

2019 ◽

Vol 27 (12) ◽

pp. 2080-2091 ◽

Author(s):

Yan-Hui Tu ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Speech Enhancement ◽

Robust Speech Recognition ◽

Teacher Student ◽

Noise Robust Speech Recognition ◽

Download Full-text

Data augmentation using generative adversarial networks for robust speech recognition

Speech Communication ◽

10.1016/j.specom.2019.08.006 ◽

2019 ◽

Vol 114 ◽

pp. 1-9 ◽

Author(s):

Yanmin Qian ◽

Hu Hu ◽

Tian Tan

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Robust Speech Recognition ◽

Adversarial Networks

Download Full-text

Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition

2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP) ◽

10.1109/iscslp.2018.8706651 ◽

2018 ◽

Author(s):

Peiyao Sheng ◽

Zhuolin Yang ◽

Hu Hu ◽

Tian Tan ◽

Yanmin Qian

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Robust Speech Recognition ◽

Adversarial Networks

Download Full-text

Speech Recognition for Task Domains with Sparse Matched Training Data

Applied Sciences ◽

10.3390/app10186155 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6155

Author(s):

Byung Ok Kang ◽

Hyeong Bae Jeon ◽

Jeon Gue Park

Keyword(s):

Speech Recognition ◽

Active Learning ◽

Latent Variables ◽

Data Augmentation ◽

Integrated System ◽

Training Data ◽

Target Domain ◽

Teacher Student ◽

Speech Data ◽

Active Learning Method

We propose two approaches to handle speech recognition for task domains with sparse matched training data. One is an active learning method that selects training data for the target domain from another general domain that already has a significant amount of labeled speech data. This method uses attribute-disentangled latent variables. For the active learning process, we designed an integrated system consisting of a variational autoencoder with an encoder that infers latent variables with disentangled attributes from the input speech, and a classifier that selects training data with attributes matching the target domain. The other method combines data augmentation methods for generating matched target domain speech data and transfer learning methods based on teacher/student learning. To evaluate the proposed method, we experimented with various task domains with sparse matched training data. The experimental results show that the proposed method has qualitative characteristics that are suitable for the desired purpose, it outperforms random selection, and is comparable to using an equal amount of additional target domain data.

Download Full-text

Generative Adversarial Networks Based Data Augmentation for Noise Robust Speech Recognition

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8462624 ◽

2018 ◽

Author(s):

Hu Hu ◽

Tian Tan ◽

Yanmin Qian

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Robust Speech Recognition ◽

Adversarial Networks ◽

Noise Robust Speech Recognition ◽

Download Full-text

A study on data augmentation of reverberant speech for robust speech recognition

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7953152 ◽

2017 ◽

Author(s):

Tom Ko ◽

Vijayaditya Peddinti ◽

Daniel Povey ◽

Michael L. Seltzer ◽

Sanjeev Khudanpur

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Robust Speech Recognition ◽

Reverberant Speech

Download Full-text

Exploring Low-Dimensional Structures of Modulation Spectra for Robust Speech Recognition

10.21437/interspeech.2017-611 ◽

2017 ◽

Author(s):

Bi-Cheng Yan ◽

Chin-Hong Shih ◽

Shih-Hung Liu ◽

Berlin Chen

Keyword(s):

Speech Recognition ◽

Robust Speech Recognition ◽

Low Dimensional

Download Full-text

Toward Robust Speech Recognition and Understanding

The Journal of VLSI Signal Processing Systems for Signal Image and Video Technology ◽

10.1007/s11265-005-4149-x ◽

2005 ◽

Vol 41 (3) ◽

pp. 245-254 ◽

Author(s):

Sadaoki Furui

Keyword(s):

Speech Recognition ◽

Robust Speech Recognition

Download Full-text

Deep bidirectional neural networks for robust speech recognition under heavy background noise

Materials Today Proceedings ◽

10.1016/j.matpr.2021.02.640 ◽

2021 ◽

Author(s):

Jeevan Reddy Koya ◽

S.P. Venu Madhava Rao

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Background Noise ◽

Robust Speech Recognition

Download Full-text