A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system

Gaussian Mixture Model is a popular method to detect moving targets for static cameras. Since the traditional Gaussian Mixture Model has a poor adaptability when the illumination is changing in the scene and has passive learning rate, this paper describes a method that can detect illumination variation and update the learning rate adaptively. It proposes an approach which uses the color histogram matching algorithm and adjusts the learning rate automatically after introducing illumination variation factor and model parameters. Furthermore, the proposed method can select the number of describing model component adaptively, so this method reduced the computation complexity and improved the real-time performance. The experiment results indicate that the detection system gets better robustness, adaptability and stability.

Download Full-text

Hysteresis-based selective Gaussian-mixture model for real-time background update

10.1117/12.704717 ◽

2007 ◽

Author(s):

Firas Achkar ◽

Aishy Amer

Keyword(s):

Real Time ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Background Update

Download Full-text

Real-time online learning of Gaussian mixture model for opacity mapping

Neurocomputing ◽

10.1016/j.neucom.2015.12.135 ◽

2016 ◽

Vol 211 ◽

pp. 212-220 ◽

Cited By ~ 3

Author(s):

Guo Zhou ◽

Dengming Zhu ◽

Yi Wei ◽

Zhaoqi Wang ◽

Yongquan Zhou

Keyword(s):

Online Learning ◽

Real Time ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture

Download Full-text

Indigenuous Vocabulary Reformulation for Continuousyorùbá Speech Recognition In M-Commerce Using Acoustic Nudging-Based Gaussian Mixture Model

10.21203/rs.3.rs-211622/v1 ◽

2021 ◽

Author(s):

Kehinde Lydia Ajayi ◽

Victor Azeta ◽

Isaac Odun-Ayo ◽

Ambrose Azeta ◽

Ajayi Peter Taiwo ◽

...

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Error Rate ◽

System Performance ◽

Recognition Rate ◽

Gaussian Mixture ◽

Computer Applications ◽

Word Error Rate ◽

The Mean

Abstract One of the current research areas is speech recognition by aiding in the recognition of speech signals through computer applications. In this research paper, Acoustic Nudging, (AN) Model is used in re-formulating the persistence automatic speech recognition (ASR) errors that involves user’s acoustic irrational behavior which alters speech recognition accuracy. GMM helped in addressing low-resourced attribute of Yorùbá language to achieve better accuracy and system performance. From the simulated results given, it is observed that proposed Acoustic Nudging-based Gaussian Mixture Model (ANGM) improves accuracy and system performance which is evaluated based on Word Recognition Rate (WRR) and Word Error Rate (WER)given by validation accuracy, testing accuracy, and training accuracy. The evaluation results for the mean WRR accuracy achieved for the ANGM model is 95.277% and the mean Word Error Rate (WER) is 4.723%when compared to existing models. This approach thereby reduce error rate by 1.1%, 0.5%, 0.8%, 0.3%, and 1.4% when compared with other models. Therefore this work was able to discover a foundation for advancing current understanding of under-resourced languages and at the same time, development of accurate and precise model for speech recognition.

Download Full-text

Speaker Normalization using Gaussian Mixture Model for Speaker Independent Speech Recognition

The KIPS Transactions PartB ◽

10.3745/kipstb.2005.12b.4.437 ◽

2005 ◽

Vol 12B (4) ◽

pp. 437-442

Author(s):

Ok-Keun Shin

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Speaker Normalization ◽

Speaker Independent

Download Full-text