Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition

Mapping Intimacies ◽

10.21437/interspeech.2019-2641 ◽

2019 ◽

Author(s):

Khoi-Nguyen C. Mac ◽

Xiaodong Cui ◽

Wei Zhang ◽

Michael Picheny

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Large Scale ◽

Deep Neural Network ◽

Acoustic Modeling

Download Full-text

Efficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network

Proceedings of the 2015 4th National Conference on Electrical, Electronics and Computer Engineering ◽

10.2991/nceece-15.2016.72 ◽

2016 ◽

Author(s):

Haitao Yao ◽

Maobo An ◽

Ji Xu ◽

Jian Liu

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Deep Neural Network ◽

Modeling Method ◽

Acoustic Modeling

Download Full-text

Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription

2013 IEEE Workshop on Automatic Speech Recognition and Understanding ◽

10.1109/asru.2013.6707758 ◽

2013 ◽

Author(s):

Hank Liao ◽

Erik McDermott ◽

Andrew Senior

Keyword(s):

Neural Network ◽

Large Scale ◽

Deep Neural Network ◽

Training Data ◽

Acoustic Modeling ◽

Supervised Training ◽

Video Transcription

Download Full-text

Deep Neural Network-based Speech Separation Combining with MVDR Beamformer for Automatic Speech Recognition System

2019 IEEE International Conference on Consumer Electronics (ICCE) ◽

10.1109/icce.2019.8662086 ◽

2019 ◽

Author(s):

Bong-Ki Lee ◽

Jaewoong Jeong

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network ◽

Recognition System ◽

Speech Recognition System ◽

Speech Separation ◽

Automatic Speech Recognition System

Download Full-text

Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition

10.21437/interspeech.2016-760 ◽

2016 ◽

Author(s):

Souvik Kundu ◽

Khe Chai Sim ◽

Mark J.F. Gales

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network ◽

Front End ◽

Download Full-text

Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition

10.21437/interspeech.2014-306 ◽

2014 ◽

Author(s):

Zhen Huang ◽

Jinyu Li ◽

Chao Weng ◽

Chin-Hui Lee

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network ◽

Cross Entropy ◽

Neural Network Training ◽

Objective Functions ◽

Network Training

Download Full-text

Joint acoustic factor learning for robust deep neural network based automatic speech recognition

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2016.7472634 ◽

2016 ◽

Author(s):

Souvik Kundu ◽

Gautam Mantena ◽

Yanmin Qian ◽

Tian Tan ◽

Marc Delcroix ◽

...

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network

Download Full-text

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies

IEEE Signal Processing Magazine ◽

10.1109/msp.2020.2969859 ◽

2020 ◽

Vol 37 (3) ◽

pp. 39-49

Author(s):

Xiaodong Cui ◽

Wei Zhang ◽

Ulrich Finkler ◽

George Saon ◽

Michael Picheny ◽

...

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network ◽

Acoustic Models ◽

Distributed Training ◽

Training Strategies

Download Full-text

Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network

Applied System Innovation ◽

10.3390/asi1030028 ◽

2018 ◽

Vol 1 (3) ◽

pp. 28 ◽

Author(s):

Jeih-weih Hung ◽

Jung-Shan Lin ◽

Po-Jen Wu

Keyword(s):

Neural Network ◽

Principal Component Analysis ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network ◽

Principal Component ◽

Component Analysis ◽

Robust Principal Component Analysis ◽

Speech Feature ◽

In recent decades, researchers have been focused on developing noise-robust methods in order to compensate for noise effects in automatic speech recognition (ASR) systems and enhance their performance. In this paper, we propose a feature-based noise-robust method that employs a novel data analysis technique—robust principal component analysis (RPCA). In the proposed scenario, RPCA is employed to process a noise-corrupted speech feature matrix, and the obtained sparse partition is shown to reveal speech-dominant characteristics. One apparent advantage of using RPCA for enhancing noise robustness is that no prior knowledge about the noise is required. The proposed RPCA-based method is evaluated with the Aurora-4 database and a task using a state-of-the-art deep neural network (DNN) architecture as the acoustic models. The evaluation results indicate that the newly proposed method can provide the original speech feature with significant recognition accuracy improvement, and can be cascaded with mean normalization (MN), mean and variance normalization (MVN), and relative spectral (RASTA)—three well-known and widely used feature robustness algorithms—to achieve better performance compared with the individual component method.

Download Full-text

Deep Neural Network for Automatic Speech Recognition from Indonesian Audio using Several Lexicon Types

2020 International Conference on Electrical Engineering and Informatics (ICELTICs) ◽

10.1109/iceltics50595.2020.9315538 ◽

2020 ◽

Author(s):

Taufik Fuadi Abidin ◽

Alim Misbullah ◽

Ridha Ferdhiana ◽

Muammar Zikri Aksana ◽

Laina Farsiah

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Deep Neural Network

Download Full-text

Deep neural network acoustic modeling for native and non-native Mandarin speech recognition

The 9th International Symposium on Chinese Spoken Language Processing ◽

10.1109/iscslp.2014.6936617 ◽

2014 ◽

Author(s):

Xin Chen ◽

Jian Cheng

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Deep Neural Network ◽

Acoustic Modeling ◽

Mandarin Speech Recognition

Download Full-text