Enhancing robustness of zero resource children's speech recognition system through bispectrum based front-end acoustic features

Digital Signal Processing ◽

10.1016/j.dsp.2021.103226 ◽

2021 ◽

pp. 103226

Author(s):

S. Shahnawazuddin ◽

Avinash Kumar ◽

Saurabh Kumar ◽

Waquar Ahmad

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Acoustic Features ◽

Front End ◽

Children’S Speech Recognition ◽

Children's Speech

Download Full-text

Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions

Applied Acoustics ◽

10.1016/j.apacoust.2021.107918 ◽

2021 ◽

Vol 177 ◽

pp. 107918

Author(s):

Vivek Bhardwaj ◽

Vinay Kukreja

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Children’S Speech Recognition ◽

Acoustic Conditions ◽

Children's Speech

Download Full-text

Developing children’s speech recognition system for low resource Punjabi language

Applied Acoustics ◽

10.1016/j.apacoust.2021.108002 ◽

2021 ◽

Vol 178 ◽

pp. 108002

Author(s):

Virender Kadyan ◽

Syed Shanawazuddin ◽

Amitoj Singh

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Low Resource ◽

Children’S Speech Recognition ◽

Children's Speech

Download Full-text

The ZTSpeech system for CHiME-5 Challenge: A far-field speech recognition system with front-end and robust back-end

10.21437/chime.2018-13 ◽

2018 ◽

Author(s):

Chenxing Li ◽

Tieqiang Wang

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Far Field ◽

Speech Recognition System ◽

Download Full-text

Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System

Sustainability ◽

10.3390/su14020614 ◽

2022 ◽

Vol 14 (2) ◽

pp. 614

Author(s):

Taniya Hasija ◽

Virender Kadyan ◽

Kalpna Guleria ◽

Abdullah Alharbi ◽

Hashem Alyami ◽

...

Keyword(s):

Speech Recognition ◽

Mutual Information ◽

Data Augmentation ◽

Recognition System ◽

Speech Recognition System ◽

Prosodic Features ◽

Prosodic Feature ◽

Feature Based ◽

Maximum Mutual Information ◽

Children's Speech

Speech recognition has been an active field of research in the last few decades since it facilitates better human–computer interaction. Native language automatic speech recognition (ASR) systems are still underdeveloped. Punjabi ASR systems are in their infancy stage because most research has been conducted only on adult speech systems; however, less work has been performed on Punjabi children’s ASR systems. This research aimed to build a prosodic feature-based automatic children speech recognition system using discriminative modeling techniques. The corpus of Punjabi children’s speech has various runtime challenges, such as acoustic variations with varying speakers’ ages. Efforts were made to implement out-domain data augmentation to overcome such issues using Tacotron-based text to a speech synthesizer. The prosodic features were extracted from Punjabi children’s speech corpus, then particular prosodic features were coupled with Mel Frequency Cepstral Coefficient (MFCC) features before being submitted to an ASR framework. The system modeling process investigated various approaches, which included Maximum Mutual Information (MMI), Boosted Maximum Mutual Information (bMMI), and feature-based Maximum Mutual Information (fMMI). The out-domain data augmentation was performed to enhance the corpus. After that, prosodic features were also extracted from the extended corpus, and experiments were conducted on both individual and integrated prosodic-based acoustic features. It was observed that the fMMI technique exhibited 20% to 25% relative improvement in word error rate compared with MMI and bMMI techniques. Further, it was enhanced using an augmented dataset and hybrid front-end features (MFCC + POV + Fo + Voice quality) with a relative improvement of 13% compared with the earlier baseline system.

Download Full-text

A low-power, fixed-point, front-end feature extraction for a distributed speech recognition system

IEEE International Conference on Acoustics Speech and Signal Processing ◽

10.1109/icassp.2002.1005859 ◽

2002 ◽

Author(s):

Delaney ◽

Jayant ◽

Hans ◽

Simunic ◽

Acquaviva

Keyword(s):

Feature Extraction ◽

Fixed Point ◽

Speech Recognition ◽

Low Power ◽

Recognition System ◽

Speech Recognition System ◽

Distributed Speech Recognition ◽

Download Full-text

Assessment of pitch-adaptive front-end signal processing for children’s speech recognition

Computer Speech & Language ◽

10.1016/j.csl.2017.10.007 ◽

2018 ◽

Vol 48 ◽

pp. 103-121 ◽

Author(s):

Rohit Sinha ◽

S. Shahnawazuddin

Keyword(s):

Signal Processing ◽

Speech Recognition ◽

Front End ◽

Children’S Speech Recognition ◽

Children's Speech

Download Full-text

Front-end of Wake-Up-Word Speech Recognition System Design on FPGA

Journal of Telecommunications System & Management ◽

10.4172/2167-0919.1000108 ◽

2013 ◽

Vol 02 (01) ◽

Author(s):

Mohamed M Eljhani Brian H Hight

Keyword(s):

Speech Recognition ◽

System Design ◽

Recognition System ◽

Speech Recognition System ◽

Download Full-text

Pitch-Normalized Acoustic Features for Robust Children's Speech Recognition

IEEE Signal Processing Letters ◽

10.1109/lsp.2017.2705085 ◽

2017 ◽

Vol 24 (8) ◽

pp. 1128-1132 ◽

Author(s):

Syed Shahnawazuddin ◽

Rohit Sinha ◽

Gayadhar Pradhan

Keyword(s):

Speech Recognition ◽

Acoustic Features ◽

Children’S Speech Recognition ◽

Children's Speech

Download Full-text

Hindi Speech Recognition System with Robust Front End-Back End Features

International Journal of Computer Applications ◽

10.5120/10601-5305 ◽

2013 ◽

Vol 64 (1) ◽

pp. 42-45

Author(s):

Atul Gairola ◽

Swapna Baadkar

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Download Full-text

Adaptive differential microphone arrays used as a front-end for an automatic speech recognition system

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2015.7178459 ◽

2015 ◽

Author(s):

Elmar Messner ◽

Hannes Pessentheiner ◽

Juan A. Morales-Cordovilla ◽

Martin Hagmuller

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Recognition System ◽

Microphone Arrays ◽

Speech Recognition System ◽

Automatic Speech Recognition System ◽

Download Full-text