Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend

Computer Speech & Language ◽

10.1016/j.csl.2017.01.013 ◽

2017 ◽

Vol 46 ◽

pp. 401-418 ◽

Author(s):

Takaaki Hori ◽

Zhuo Chen ◽

Hakan Erdogan ◽

John R. Hershey ◽

Jonathan Le Roux ◽

...

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Robust Feature Extraction

Download Full-text

Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2006.876776 ◽

2007 ◽

Vol 15 (1) ◽

pp. 224-234 ◽

Author(s):

Satya Dharanipragada ◽

Umit H. Yapanel ◽

Bhaskar D. Rao

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Estimation Method ◽

Spectrum Estimation ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Robust Feature Extraction

Download Full-text

Robust feature extraction for speech recognition by enhancing auditory spectrum

10.21437/interspeech.2012-392 ◽

2012 ◽

Author(s):

Md Jahangir Alam ◽

Patrick Kenny ◽

Douglas O'Shaughnessy

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Robust Feature Extraction

Download Full-text

A robust feature extraction based on the MTF concept for speech recognition in reverberant environment

10.21437/interspeech.2006-638 ◽

2006 ◽

Author(s):

Xugang Lu ◽

Masashi Unoki ◽

Masato Akagi

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Robust Feature Extraction ◽

Reverberant Environment

Download Full-text

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) ◽

10.1109/asru.2015.7404833 ◽

2015 ◽

Author(s):

Takaaki Hori ◽

Zhuo Chen ◽

Hakan Erdogan ◽

John R. Hershey ◽

Jonathan Le Roux ◽

...

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Robust Feature Extraction

Download Full-text

Robust feature extraction methods for speech recognition in noisy environments

2014 First International Conference on Networks & Soft Computing (ICNSC2014) ◽

10.1109/cnsc.2014.6906692 ◽

2014 ◽

Author(s):

Ajinkya Sunil Mukhedkar ◽

John Sahaya Rani Alex

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Extraction Methods ◽

Noisy Environments ◽

Robust Feature Extraction

Download Full-text

Robust Feature Extraction Using Autocorrelation Domain for Noisy Speech Recognition

Signal & Image Processing An International Journal ◽

10.5121/sipij.2017.8103 ◽

2017 ◽

Vol 8 (1) ◽

pp. 23-44

Author(s):

Gholamreza Farahani

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Noisy Speech ◽

Robust Feature Extraction ◽

Noisy Speech Recognition

Download Full-text

Invariant-integration method for robust feature extraction in speaker-independent speech recognition

10.21437/interspeech.2009-753 ◽

2009 ◽

Author(s):

Florian Müller ◽

Alfred Mertins

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Integration Method ◽

Speaker Independent ◽

Robust Feature Extraction ◽

Invariant Integration

Download Full-text

Novel robust feature extraction based on spectrally masked channel energy ratio (SMaChER) for speech recognition

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). ◽

10.1109/icassp.2003.1202288 ◽

2003 ◽

Author(s):

M.A. Changxue

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Energy Ratio ◽

Robust Feature Extraction

Download Full-text

A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.2533 ◽

2019 ◽

Vol 9 (2) ◽

pp. 4066-4070 ◽

Author(s):

A. Mnassri ◽

M. Bennasr ◽

C. Adnane

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Real Time ◽

Extraction Method ◽

Recognition System ◽

Raspberry Pi ◽

Discrete Wavelet ◽

Speech Recognition System ◽

Feature Extraction Method ◽

Robust Feature Extraction

The development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids the need for other pre-processing steps. In this paper, a new robust feature extraction method for real-time ASR system is presented. A combination of Mel-frequency cepstral coefficients (MFCC) and discrete wavelet transform (DWT) is proposed. This hybrid system can conserve more extracted speech features which tend to be invariant to noise. The main idea is to extract MFCC features by denoising the obtained coefficients in the wavelet domain by using a median filter (MF). The proposed system has been implemented on Raspberry Pi 3 which is a suitable platform for real-time requirements. The experiments showed a high recognition rate (100%) in clean environment and satisfying results (ranging from 80% to 100%) in noisy environments at different signal to noise ratios (SNRs).

Download Full-text

Robust Feature Extraction Based on Teager-Entropy and Half Power Spectrum Estimation for Speech Recognition

Lecture Notes in Computer Science - Multi-disciplinary Trends in Artificial Intelligence ◽

10.1007/978-3-319-26181-2_9 ◽

2015 ◽

pp. 91-101

Author(s):

Jing Dong ◽

Dongsheng Zhou ◽

Qiang Zhang

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Power Spectrum ◽

Spectrum Estimation ◽

Power Spectrum Estimation ◽

Robust Feature Extraction ◽

Download Full-text