Indigenuous Vocabulary Reformulation for Continuousyorùbá Speech Recognition In M-Commerce Using Acoustic Nudging-Based Gaussian Mixture Model

Abstract One of the current research areas is speech recognition by aiding in the recognition of speech signals through computer applications. In this research paper, Acoustic Nudging, (AN) Model is used in re-formulating the persistence automatic speech recognition (ASR) errors that involves user’s acoustic irrational behavior which alters speech recognition accuracy. GMM helped in addressing low-resourced attribute of Yorùbá language to achieve better accuracy and system performance. From the simulated results given, it is observed that proposed Acoustic Nudging-based Gaussian Mixture Model (ANGM) improves accuracy and system performance which is evaluated based on Word Recognition Rate (WRR) and Word Error Rate (WER)given by validation accuracy, testing accuracy, and training accuracy. The evaluation results for the mean WRR accuracy achieved for the ANGM model is 95.277% and the mean Word Error Rate (WER) is 4.723%when compared to existing models. This approach thereby reduce error rate by 1.1%, 0.5%, 0.8%, 0.3%, and 1.4% when compared with other models. Therefore this work was able to discover a foundation for advancing current understanding of under-resourced languages and at the same time, development of accurate and precise model for speech recognition.

Download Full-text

The subspace Gaussian mixture model—A structured model for speech recognition

Computer Speech & Language ◽

10.1016/j.csl.2010.06.003 ◽

2011 ◽

Vol 25 (2) ◽

pp. 404-439 ◽

Cited By ~ 151

Author(s):

Daniel Povey ◽

Lukáš Burget ◽

Mohit Agarwal ◽

Pinar Akyazi ◽

Feng Kai ◽

...

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Structured Model

Download Full-text

Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition

10.21437/interspeech.2014-162 ◽

2014 ◽

Author(s):

M. J. Alam ◽

Patrick Kenny ◽

Pierre Dumouchel ◽

Douglas O'Shaughnessy

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Noise Spectrum ◽

Robust Speech Recognition ◽

Spectrum Estimation ◽

Model Based

Download Full-text

Development of Automatic Speech Recognition for Xitsonga Using Subspace Gaussian Mixture Model

10.1109/icabcd51485.2021.9519355 ◽

2021 ◽

Author(s):

Vukosi Rikhotso ◽

Thipe Modipa ◽

Madimetja Jonas Manamela ◽

Tumisho Bilson Mokgonyane

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Automatic Speech Recognition ◽

Gaussian Mixture

Download Full-text

A Gaussian Mixture Model Based Speech Recognition System Using Matlab

Signal & Image Processing An International Journal ◽

10.5121/sipij.2013.4409 ◽

2013 ◽

Vol 4 (4) ◽

pp. 109-118 ◽

Cited By ~ 5

Author(s):

Manan Vyas

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Recognition System ◽

Speech Recognition System ◽

Model Based

Download Full-text

Speaker Normalization using Gaussian Mixture Model for Speaker Independent Speech Recognition

The KIPS Transactions PartB ◽

10.3745/kipstb.2005.12b.4.437 ◽

2005 ◽

Vol 12B (4) ◽

pp. 437-442

Author(s):

Ok-Keun Shin

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Speaker Normalization ◽

Speaker Independent

Download Full-text

A two-stage speaker adaptation approach for subspace Gaussian mixture model based nonnative speech recognition

10.21437/interspeech.2012-483 ◽

2012 ◽

Author(s):

Bo Li ◽

Khe Chai Sim

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Two Stage ◽

Model Based

Download Full-text

A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system

2008 International Conference on Field-Programmable Technology ◽

10.1109/fpt.2008.4762413 ◽

2008 ◽

Author(s):

Kazuo Miura ◽

Hiroki Noguchi ◽

Hiroshi Kawaguchi ◽

Masahiko Yoshimoto

Keyword(s):

Speech Recognition ◽

Real Time ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Memory Bandwidth

Download Full-text

Speech Emotion Recognition Using Multiple Discriminant Analysis and Gaussian Mixture Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.380-384.3530 ◽

2013 ◽

Vol 380-384 ◽

pp. 3530-3533

Author(s):

Yong Qiang Bao ◽

Li Zhao ◽

Cheng Wei Huang

Keyword(s):

Discriminant Analysis ◽

Emotion Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Speech Signal ◽

Recognition Rate ◽

Gaussian Mixture ◽

Pitch Contour ◽

Speech Emotion Recognition ◽

Multiple Discriminant Analysis

In this paper we studied speech emotion recognition from Mandarin speech signal. Five basic emotion classes and the neutral state are considered. In a listening experiment we verified the speech corpus using a judgment matrix. Acoustic parameters including short-term energy, pitch contour, and formants are extracted from emotional speech signal. Gaussian mixture model is then adopted for training the emotion model. Due to the data challenge in GMM training, we use multiple discriminant analysis for feature optimization and compared with basic Fisher discriminant ratio based method. The experimental results show that using multiple discriminant analysis our GMM classifier gives a promising recognition rate for Mandarin speech emotion recognition.

Download Full-text