Reference Channel Input-Based Speech Enhancement for Noise-Robust Recognition in Intelligent TV Applications

The Journal of the Korean Institute of Information and Communication Engineering ◽

10.6109/jkiice.2013.17.2.280 ◽

2013 ◽

Vol 17 (2) ◽

pp. 280-286

Author(s):

Sangbae Jeong

Keyword(s):

Speech Enhancement ◽

Reference Channel ◽

Channel Input ◽

Robust Recognition ◽

Download Full-text

Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2019.2940662 ◽

2019 ◽

Vol 27 (12) ◽

pp. 2080-2091 ◽

Author(s):

Yan-Hui Tu ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Speech Enhancement ◽

Robust Speech Recognition ◽

Teacher Student ◽

Noise Robust Speech Recognition ◽

Download Full-text

A binaural speech processing method using subband-cross correlation analysis for noise robust recognition

1997 IEEE International Conference on Acoustics, Speech, and Signal Processing ◽

10.1109/icassp.1997.596170 ◽

2002 ◽

Author(s):

S. Kajita ◽

K. Takeda ◽

F. Itakura

Keyword(s):

Correlation Analysis ◽

Speech Processing ◽

Cross Correlation ◽

Processing Method ◽

Cross Correlation Analysis ◽

Robust Recognition ◽

Download Full-text

Speech enhancement for noise‐robust speech recognition.

The Journal of the Acoustical Society of America ◽

10.1121/1.4783141 ◽

2008 ◽

Vol 124 (4) ◽

pp. 2577-2577

Author(s):

Vikramjit Mitra ◽

Carol Espy‐Wilson

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Robust Speech Recognition ◽

Noise Robust Speech Recognition ◽

Download Full-text

Combination of GMM-based speech estimation method and temporal domain SVD-based speech enhancement for noise robust speech recognition

Systems and Computers in Japan ◽

10.1002/scj.20487 ◽

2007 ◽

Vol 38 (3) ◽

pp. 23-38

Author(s):

Masakiyo Fujimoto ◽

Yasuo Ariki

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Estimation Method ◽

Robust Speech Recognition ◽

Temporal Domain ◽

Noise Robust Speech Recognition ◽

Download Full-text

Noise-robust recognition of objects by humans and deep neural networks

10.1101/2020.08.03.234625 ◽

2020 ◽

Author(s):

Hojin Jang ◽

Devin McCormack ◽

Frank Tong

Keyword(s):

Neural Networks ◽

Visual Processing ◽

Deep Neural Networks ◽

Signal To Noise Ratio ◽

Human Vision ◽

Training Procedure ◽

Robust Recognition ◽

Recognition Of Objects ◽

Noise Robust ◽

Level Performance

ABSTRACTDeep neural networks (DNNs) can accurately recognize objects in clear viewing conditions, leading to claims that they have attained or surpassed human-level performance. However, standard DNNs are severely impaired at recognizing objects in visual noise, whereas human vision remains robust. We developed a noise-training procedure, generating noisy images of objects with low signal-to-noise ratio, to investigate whether DNNs can acquire robustness that better matches human vision. After noise training, DNNs outperformed human observers while exhibiting more similar patterns of performance, and provided a better model for predicting human recognition thresholds on an image-by-image basis. Noise training also improved DNN recognition of vehicles in noisy weather. Layer-specific analyses revealed that the contaminating effects of noise were dampened, rather than amplified, across successive stages of the noise-trained network, with greater benefit at higher levels of the network. Our findings indicate that DNNs can learn noise-robust representations that better approximate human visual processing.

Download Full-text

Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks

10.21437/interspeech.2016-159 ◽

2016 ◽

Author(s):

Cassia Valentini-Botinhao ◽

Xin Wang ◽

Shinji Takaki ◽

Junichi Yamagishi

Keyword(s):

Neural Networks ◽

Speech Enhancement ◽

Recurrent Neural Networks ◽

Speech Synthesis ◽

Text To Speech ◽

Synthesis System ◽

Text To Speech Synthesis ◽

Download Full-text

A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition

EURASIP Journal on Advances in Signal Processing ◽

10.1155/2007/45821 ◽

2006 ◽

Vol 2007 (1) ◽

Author(s):

Kris Hermus ◽

Patrick Wambacq ◽

Hugo Van hamme

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Robust Speech Recognition ◽

Signal Subspace ◽

Noise Robust Speech Recognition ◽

Download Full-text

Noise robust exemplar matching for speech enhancement: applications to automatic speech recognition

10.21437/interspeech.2015-241 ◽

2015 ◽

Author(s):

Emre Yılmaz ◽

Deepak Baby ◽

Hugo Van hamme

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition ◽

Download Full-text

Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech

10.21437/ssw.2016-24 ◽

2016 ◽

Author(s):

Cassia Valentini-Botinhao ◽

Xin Wang ◽

Shinji Takaki ◽

Junichi Yamagishi

Keyword(s):

Speech Enhancement ◽

Text To Speech ◽

Download Full-text

A noise-type and level-dependent MPO-based speech enhancement architecture with variable frame analysis for noise-robust speech recognition

10.21437/interspeech.2009-703 ◽

2009 ◽

Author(s):

Vikramjit Mitra ◽

Bengt J. Borgstrom ◽

Carol Y. Espy-Wilson ◽

Abeer Alwan

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Frame Analysis ◽

Robust Speech Recognition ◽

Noise Robust Speech Recognition ◽

Download Full-text