On Indian English Language Model for Continuous Speech Recognition

Mapping Intimacies ◽

10.1109/icetci53161.2021.9563623 ◽

2021 ◽

Author(s):

Xin Jin ◽

Min Miao ◽

Keliang Zhang ◽

Zongbo Zhang

Keyword(s):

Speech Recognition ◽

English Language ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Download Full-text

An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition

Computer Speech & Language ◽

10.1016/j.csl.2013.04.003 ◽

2014 ◽

Vol 28 (1) ◽

pp. 141-162 ◽

Author(s):

Bert Réveil ◽

Kris Demuynck ◽

Jean-Pierre Martens

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Two Stage ◽

Large Vocabulary ◽

Mixed Language ◽

Download Full-text

Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

Text, Speech and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-540-87391-4_51 ◽

2008 ◽

pp. 397-401 ◽

Author(s):

Piotr Majewski

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary

Download Full-text

On Continuous Speech Recognition of Indian English

Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence - ACAI 2018 ◽

10.1145/3302425.3302489 ◽

2018 ◽

Author(s):

Xin Jin ◽

Keliang Zhang ◽

Xian Huang ◽

Min Miao

Keyword(s):

Speech Recognition ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Download Full-text

N‐best breadth search for large vocabulary continuous speech recognition using a long span language model

The Journal of the Acoustical Society of America ◽

10.1121/1.423450 ◽

1998 ◽

Vol 104 (3) ◽

pp. 1819-1819 ◽

Author(s):

DongSuk Yuk ◽

ChiWei Che ◽

Prabhu Raghavan ◽

Samir Chennoukh ◽

James Flanagan

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Download Full-text

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

Proceedings of the workshop on Speech and Natural Language - HLT '91 ◽

10.3115/1075527.1075624 ◽

1992 ◽

Author(s):

Douglas B. Paul

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Stochastic Language

Download Full-text

A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition

Computer Speech & Language ◽

10.1016/j.csl.2005.11.002 ◽

2007 ◽

Vol 21 (1) ◽

pp. 1-25 ◽

Author(s):

Xiaolong Li ◽

Yunxin Zhao

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

N Gram ◽

Memory Efficient

Download Full-text

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing ◽

10.1109/icassp.1992.225981 ◽

1992 ◽

Author(s):

D.B. Paul

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Stochastic Language

Download Full-text

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v6i2.2684 ◽

2020 ◽

Vol 6 (2) ◽

Author(s):

Vincent Elbert Budiman ◽

Andreas Widjaja

Keyword(s):

Speech Recognition ◽

Language Model ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Word Error Rate ◽

Testing Data ◽

Bahasa Indonesia

Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.

Download Full-text

A unified language model for large vocabulary continuous speech recognition of Turkish

Signal Processing ◽

10.1016/j.sigpro.2005.12.002 ◽

2006 ◽

Vol 86 (10) ◽

pp. 2844-2862 ◽

Author(s):

Ebru Arısoy ◽

Helin Dutağacı ◽

Levent M. Arslan

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary

Download Full-text

An Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model.

10.21236/ada240745 ◽

1991 ◽

Author(s):

D. B. Paul

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Stochastic Language

Download Full-text