Contextual vector quantization for speech recognition with discrete hidden Markov model

Author(s):  
Qiang Huo ◽  
Chorkin Chan
2017 ◽  
Vol 23 (7) ◽  
pp. 6555-6558 ◽  
Author(s):  
Sukmawati Nur Endah ◽  
Priyo Sidik Sasongko ◽  
Helmie Arif Wibawa ◽  
Sutikno ◽  
Retno Kusumaningrum

2014 ◽  
Vol 2014 ◽  
pp. 1-10 ◽  
Author(s):  
Lokesh Selvaraj ◽  
Balakrishnan Ganesan

Enhancing speech recognition is the primary intention of this work. In this paper a novel speech recognition method based on vector quantization and improved particle swarm optimization (IPSO) is suggested. The suggested methodology contains four stages, namely, (i) denoising, (ii) feature mining (iii), vector quantization, and (iv) IPSO based hidden Markov model (HMM) technique (IP-HMM). At first, the speech signals are denoised using median filter. Next, characteristics such as peak, pitch spectrum, Mel frequency Cepstral coefficients (MFCC), mean, standard deviation, and minimum and maximum of the signal are extorted from the denoised signal. Following that, to accomplish the training process, the extracted characteristics are given to genetic algorithm based codebook generation in vector quantization. The initial populations are created by selecting random code vectors from the training set for the codebooks for the genetic algorithm process and IP-HMM helps in doing the recognition. At this point the creativeness will be done in terms of one of the genetic operation crossovers. The proposed speech recognition technique offers 97.14% accuracy.


2016 ◽  
Vol 7 (2) ◽  
pp. 76-82
Author(s):  
Hugeng Hugeng ◽  
Edbert Hansel

We have built an application of speech recognition for Indonesian geography dictionary based on Android operating system, named GAIA. This application uses a smartphone as a device to receive input in the form of a spoken word from a user. The approach used in recognition is Hidden Markov Model which is contained in the Pocketsphinx library. The phonemes used are Indonesian phonemes’ rule. The advantage of this application is that it can be used without internet access. In the application testing, word detection is done with four conditions to determine the level of accuracy. The four conditions are near silent, near noisy, far silent, and far noisy. From the testing and analysis conducted, it can be concluded that GAIA application can be built as a speech recognition application on Android for Indonesian geography dictionary; with the results in the near silent condition accuracy of word recognition reaches an average of 52.87%, in the near noisy reaches an average of 14.5%, in the far silent condition reaches an average of 23.2%, and in the far noisy condition reaches an average of 2.8%. Index Terms—speech recognition, Indonesian geography dictionary, Hidden Markov Model, Pocketsphinx, Android.


Sign in / Sign up

Export Citation Format

Share Document