A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system

Author(s):  
Kazuo Miura ◽  
Hiroki Noguchi ◽  
Hiroshi Kawaguchi ◽  
Masahiko Yoshimoto
2011 ◽  
Vol 25 (2) ◽  
pp. 404-439 ◽  
Author(s):  
Daniel Povey ◽  
Lukáš Burget ◽  
Mohit Agarwal ◽  
Pinar Akyazi ◽  
Feng Kai ◽  
...  

2014 ◽  
Vol 599-601 ◽  
pp. 814-818 ◽  
Author(s):  
Xue Yuan Chen ◽  
Xia Fu Lv ◽  
Jie Liu

Gaussian Mixture Model is a popular method to detect moving targets for static cameras. Since the traditional Gaussian Mixture Model has a poor adaptability when the illumination is changing in the scene and has passive learning rate, this paper describes a method that can detect illumination variation and update the learning rate adaptively. It proposes an approach which uses the color histogram matching algorithm and adjusts the learning rate automatically after introducing illumination variation factor and model parameters. Furthermore, the proposed method can select the number of describing model component adaptively, so this method reduced the computation complexity and improved the real-time performance. The experiment results indicate that the detection system gets better robustness, adaptability and stability.


2016 ◽  
Vol 211 ◽  
pp. 212-220 ◽  
Author(s):  
Guo Zhou ◽  
Dengming Zhu ◽  
Yi Wei ◽  
Zhaoqi Wang ◽  
Yongquan Zhou

2021 ◽  
Author(s):  
Kehinde Lydia Ajayi ◽  
Victor Azeta ◽  
Isaac Odun-Ayo ◽  
Ambrose Azeta ◽  
Ajayi Peter Taiwo ◽  
...  

Abstract One of the current research areas is speech recognition by aiding in the recognition of speech signals through computer applications. In this research paper, Acoustic Nudging, (AN) Model is used in re-formulating the persistence automatic speech recognition (ASR) errors that involves user’s acoustic irrational behavior which alters speech recognition accuracy. GMM helped in addressing low-resourced attribute of Yorùbá language to achieve better accuracy and system performance. From the simulated results given, it is observed that proposed Acoustic Nudging-based Gaussian Mixture Model (ANGM) improves accuracy and system performance which is evaluated based on Word Recognition Rate (WRR) and Word Error Rate (WER)given by validation accuracy, testing accuracy, and training accuracy. The evaluation results for the mean WRR accuracy achieved for the ANGM model is 95.277% and the mean Word Error Rate (WER) is 4.723%when compared to existing models. This approach thereby reduce error rate by 1.1%, 0.5%, 0.8%, 0.3%, and 1.4% when compared with other models. Therefore this work was able to discover a foundation for advancing current understanding of under-resourced languages and at the same time, development of accurate and precise model for speech recognition.


Sign in / Sign up

Export Citation Format

Share Document