Research on Mongolian Acoustic Model Based on the Triphone DDBHMM

Triphone DDBHMM (Duration Distribution Based HMM) is presented as the acoustic model for Mongolian continuous speech recognition, and the Mongolian Acoustic Model is optimized by state-binding. The experiment made a comparison of the triphone DDBHMM, diphone DDBHMM, triphone HMM on HTK platform and analyzed their effects on the accuracy of acoustic layer. The experimental results have showed that Triphone DDBHMM significantly improves the recognition performance of continuous speech recognition in Mongolian.

Download Full-text

Single Stream DBN Model Based Triphone for Continuous Speech Recognition

Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007) ◽

10.1109/ism.workshops.2007.48 ◽

2007 ◽

Cited By ~ 2

Author(s):

Guoyun Lv ◽

Dongmei Jiang ◽

Rongchun Zhao

Keyword(s):

Speech Recognition ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Model Based ◽

Single Stream

Download Full-text

Robust Acoustic Model Training Against Phoneme Variations for Large Vocabulary Continuous Speech Recognition

Signal and Image Processing ◽

10.2316/p.2012.759-070 ◽

2012 ◽

Author(s):

Gil Ho Lee ◽

Nam Soo Kim

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Model Training

Download Full-text

Acoustic model combinations for continuous speech recognition system

International Journal of Computational Systems Engineering ◽

10.1504/ijcsyse.2012.050231 ◽

2012 ◽

Vol 1 (2) ◽

pp. 79

Author(s):

R.K. Aggarwal ◽

Mayank Dave

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text

Continuous speech recognition model based on CTC Technology

Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018) ◽

10.2991/ncce-18.2018.25 ◽

2018 ◽

Author(s):

Yumeng Wang ◽

Jianmin Zhao

Keyword(s):

Speech Recognition ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Recognition Model ◽

Model Based

Download Full-text

Demonstrated real‐time continuous speech recognition performance

The Journal of the Acoustical Society of America ◽

10.1121/1.2018073 ◽

1980 ◽

Vol 67 (S1) ◽

pp. S15-S15

Author(s):

Larry Bahler ◽

Steve Moshier ◽

Peter Brown ◽

James Baker

Keyword(s):

Speech Recognition ◽

Real Time ◽

Recognition Performance ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text

An investigation of continuous speech recognition performance from unconstrained speech utterances

The Journal of the Acoustical Society of America ◽

10.1121/1.409420 ◽

1994 ◽

Vol 95 (5) ◽

pp. 2878-2878

Author(s):

R. C. Rose ◽

B. H. Juang ◽

C. H. Lee ◽

L. Lee

Keyword(s):

Speech Recognition ◽

Recognition Performance ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text

Lightly Supervised Acoustic Model Training for Mandarin Continuous Speech Recognition

Intelligent Science and Intelligent Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-642-36669-7_88 ◽

2013 ◽

pp. 727-734

Author(s):

Xiangang Li ◽

Zaihu Pang ◽

Xihong Wu

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Model Training

Download Full-text

Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tsa.2005.857791 ◽

2006 ◽

Vol 14 (3) ◽

pp. 855-872 ◽

Cited By ~ 11

Author(s):

S. Watanabe ◽

A. Sako ◽

A. Nakamura

Keyword(s):

Speech Recognition ◽

Bayesian Estimation ◽

Acoustic Model ◽

Automatic Determination ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Variational Bayesian ◽

Model Topology

Download Full-text

State clustering in hidden Markov model-based continuous speech recognition

Computer Speech & Language ◽

10.1006/csla.1994.1019 ◽

1994 ◽

Vol 8 (4) ◽

pp. 369-383 ◽

Cited By ~ 69

Author(s):

S.J. Young ◽

P.C. Woodland

Keyword(s):

Speech Recognition ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Model Based

Download Full-text

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v6i2.2684 ◽

2020 ◽

Vol 6 (2) ◽

Author(s):

Vincent Elbert Budiman ◽

Andreas Widjaja

Keyword(s):

Speech Recognition ◽

Error Rate ◽

Language Model ◽

Beam Width ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Word Error Rate ◽

Testing Data ◽

Bahasa Indonesia

Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.

Download Full-text