N-gram adaptation using Dirichlet class language model based on part-of-speech for speech recognition

2013 21st Iranian Conference on Electrical Engineering (ICEE) ◽

10.1109/iraniancee.2013.6599642 ◽

2013 ◽

Author(s):

Ali Hatami ◽

Ahmad Akbari ◽

Babak Nasersharif

Keyword(s):

Speech Recognition ◽

Language Model ◽

Model Based ◽

Part Of Speech ◽

Download Full-text

N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning

2012 International Conference on Asian Language Processing ◽

10.1109/ialp.2012.55 ◽

2012 ◽

Author(s):

Shinya Takahashi ◽

Tsuyoshi Morimoto

Keyword(s):

Speech Recognition ◽

Language Model ◽

Web Documents ◽

Closed Captioning ◽

Model Based ◽

Download Full-text

Dynamic out-of-vocabulary word registration to language model for speech recognition

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/s13636-020-00193-1 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Norihide Kitaoka ◽

Bohan Chen ◽

Yuya Obashi

Keyword(s):

Speech Recognition ◽

Language Model ◽

Language Models ◽

Natural Occurrence ◽

Registration Method ◽

Acoustic Models ◽

Part Of Speech ◽

Vocabulary Word ◽

N Gram ◽

AbstractWe propose a method of dynamically registering out-of-vocabulary (OOV) words by assigning the pronunciations of these words to pre-inserted OOV tokens, editing the pronunciations of the tokens. To do this, we add OOV tokens to an additional, partial copy of our corpus, either randomly or to part-of-speech (POS) tags in the selected utterances, when training the language model (LM) for speech recognition. This results in an LM containing OOV tokens, to which we can assign pronunciations. We also investigate the impact of acoustic complexity and the “natural” occurrence frequency of OOV words on the recognition of registered OOV words. The proposed OOV word registration method is evaluated using two modern automatic speech recognition (ASR) systems, Julius and Kaldi, using DNN-HMM acoustic models and N-gram language models (plus an additional evaluation using RNN re-scoring with Kaldi). Our experimental results show that when using the proposed OOV registration method, modern ASR systems can recognize OOV words without re-training the language model, that the acoustic complexity of OOV words affects OOV recognition, and that differences between the “natural” and the assigned occurrence frequencies of OOV words have little impact on the final recognition results.

Download Full-text

Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213016500068 ◽

2016 ◽

Vol 25 (02) ◽

pp. 1650006

Author(s):

Aleksander Smywinski-Pohl ◽

Bartosz Ziółko

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Language Model ◽

Language Models ◽

Clustering Method ◽

Training Corpus ◽

Model Based ◽

N Gram ◽

In this paper we investigate the usefulness of morphosyntactic information as well as clustering in modeling Polish for automatic speech recognition. Polish is an inflectional language, thus we investigate the usefulness of an N-gram model based on morphosyntactic features. We present how individual types of features influence the model and which types of features are best suited for building a language model for automatic speech recognition. We compared the results of applying them with a class-based model that is automatically derived from the training corpus. We show that our approach towards clustering performs significantly better than frequently used SRI LM clustering method. However, this difference is apparent only for smaller corpora.

Download Full-text

Evaluating spoken language model based on filler prediction model in speech recognition

10.21437/interspeech.2008-256 ◽

2008 ◽

Author(s):

Kengo Ohta ◽

Masatoshi Tsuchiya ◽

Seiichi Nakagawa

Keyword(s):

Speech Recognition ◽

Prediction Model ◽

Language Model ◽

Spoken Language ◽

Download Full-text

Combination of random indexing based language model and n-gram language model for speech recognition

10.21437/interspeech.2013-525 ◽

2013 ◽

Author(s):

Dominique Fohr ◽

Odile Mella

Keyword(s):

Speech Recognition ◽

Language Model ◽

Random Indexing ◽

Download Full-text

Flick: Japanese Input Method Editor Using N-Gram and Recurrent Neural Network Language Model Based Predictive Text Input

2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS) ◽

10.1109/sitis.2017.19 ◽

2017 ◽

Author(s):

Yukino Ikegami ◽

Yoshitaka Sakurai ◽

Ernesto Damiani ◽

Rainer Knauf ◽

Setsuo Tsuruta

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Language Model ◽

Input Method ◽

Model Based ◽

N Gram ◽

Network Language

Download Full-text

Managed N-gram language model based on Hadoop framework and a Hbase tables

2014 9th International Conference on Informatics and Systems ◽

10.1109/infos.2014.7036678 ◽

2014 ◽

Author(s):

Tahani Mahmoud Allam ◽

Alsayed Abdelhameed Sallam ◽

Hatem M. Abdullkader

Keyword(s):

Language Model ◽

Model Based ◽

N Gram ◽

Hadoop Framework

Download Full-text

A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition

Computer Speech & Language ◽

10.1016/j.csl.2005.11.002 ◽

2007 ◽

Vol 21 (1) ◽

pp. 1-25 ◽

Author(s):

Xiaolong Li ◽

Yunxin Zhao

Keyword(s):

Speech Recognition ◽

Language Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

N Gram ◽

Memory Efficient

Download Full-text

Document-based Dirichlet class language model for speech recognition using document-based n-gram events

2014 IEEE Spoken Language Technology Workshop (SLT) ◽

10.1109/slt.2014.7078547 ◽

2014 ◽

Author(s):

Md. Akmal Haidar ◽

Douglas O'Shaughnessy

Keyword(s):

Speech Recognition ◽

Language Model ◽

Download Full-text

Language Model Based on Word Order Sensitive Matrix Representation in Latent Semantic Analysis for Speech Recognition

2009 WRI World Congress on Computer Science and Information Engineering ◽

10.1109/csie.2009.353 ◽

2009 ◽

Author(s):

Welly Naptali ◽

Masatoshi Tsuchiya ◽

Seiichi Nakagawa

Keyword(s):

Speech Recognition ◽

Latent Semantic Analysis ◽

Semantic Analysis ◽

Matrix Representation ◽

Language Model ◽

Download Full-text