Recent advances in LVCSR : A benchmark comparison of performances

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v7i6.pp3358-3368 ◽

2017 ◽

Vol 7 (6) ◽

pp. 3358

Author(s):

Rahhal Errattahi ◽

Asmaa El Hannani

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Markov Models ◽

Key Factors ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Current State ◽

Speech Corpora ◽

Large Vocabulary Speech Recognition

Large Vocabulary Continuous Speech Recognition (LVCSR), which is characterized by a high variability of the speech, is the most challenging task in automatic speech recognition (ASR). Believing that the evaluation of ASR systems on relevant and common speech corpora is one of the key factors that help accelerating research, we present, in this paper, a benchmark comparison of the performances of the current state-of-the-art LVCSR systems over different speech recognition tasks. Furthermore, we put objectively into evidence the best performing technologies and the best accuracy achieved so far in each task. The benchmarks have shown that the Deep Neural Networks and Convolutional Neural Networks have proven their efficiency on several LVCSR tasks by outperforming the traditional Hidden Markov Models and Guaussian Mixture Models. They have also shown that despite the satisfying performances in some LVCSR tasks, the problem of large-vocabulary speech recognition is far from being solved in some others, where more research efforts are still needed.

Download Full-text

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2011.2134090 ◽

2012 ◽

Vol 20 (1) ◽

pp. 30-42 ◽

Cited By ~ 1386

Author(s):

G. E. Dahl ◽

Dong Yu ◽

Li Deng ◽

A. Acero

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Large Vocabulary ◽

Large Vocabulary Speech Recognition ◽

Context Dependent

Download Full-text

Exploiting sparseness in deep neural networks for large vocabulary speech recognition

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2012.6288897 ◽

2012 ◽

Author(s):

Dong Yu ◽

Frank Seide ◽

Gang Li ◽

Li Deng

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Large Vocabulary ◽

Large Vocabulary Speech Recognition

Download Full-text

A cluster-based multiple deep neural networks method for large vocabulary continuous speech recognition

2013 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2013.6638948 ◽

2013 ◽

Author(s):

Pan Zhou ◽

Cong Liu ◽

Qingfeng Liu ◽

Lirong Dai ◽

Hui Jiang

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary

Download Full-text

Application of pretrained deep neural networks to large vocabulary speech recognition

10.21437/interspeech.2012-10 ◽

2012 ◽

Author(s):

Navdeep Jaitly ◽

Patrick Nguyen ◽

Andrew Senior ◽

Vincent Vanhoucke

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Large Vocabulary ◽

Large Vocabulary Speech Recognition

Download Full-text

Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling

2012 8th International Symposium on Chinese Spoken Language Processing ◽

10.1109/iscslp.2012.6423452 ◽

2012 ◽

Author(s):

Jia Pan ◽

Cong Liu ◽

Zhiguo Wang ◽

Yu Hu ◽

Hui Jiang

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Acoustic Modeling ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary

Download Full-text

Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

10.21437/interspeech.2019-2629 ◽

2019 ◽

Author(s):

Muhammad Umar Farooq ◽

Farah Adeeba ◽

Sahar Rauf ◽

Sarmad Hussain

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Recognition System ◽

Speech Recognition System ◽

Large Vocabulary

Download Full-text

Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2019.8682487 ◽

2019 ◽

Author(s):

Shoukang Hu ◽

Max W. Y. Lam ◽

Xurong Xie ◽

Shansong Liu ◽

Jianwei Yu ◽

...

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Gaussian Process ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary

Download Full-text

Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models

Proceedings of the workshop on Speech and Natural Language - HLT '89 ◽

10.3115/1075434.1075480 ◽

1989 ◽

Author(s):

X. D. Huang ◽

H. W. Hon ◽

K. F. Lee

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Speaker Independent

Download Full-text

Speaker adaptation for large vocabulary speech recognition systems using speaker Markov models

10.1109/icassp.1989.266349 ◽

2003 ◽

Author(s):

G. Rigoll

Keyword(s):

Speech Recognition ◽

Markov Models ◽

Large Vocabulary ◽

Recognition Systems ◽

Large Vocabulary Speech Recognition

Download Full-text

A comparative study of continuous speech recognition using neural networks and hidden Markov models

[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing ◽

10.1109/icassp.1991.150353 ◽

1991 ◽

Author(s):

S. Renals ◽

D. McKelvie ◽

F. McInnes

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Comparative Study ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text