Towards a unified optimal spectral amplitude estimator for speech enhancement in various low-SNR environments

Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513) ◽

10.1109/ccece.2004.1344946 ◽

2004 ◽

Author(s):

H. Tolba ◽

Zili Li ◽

D. O'Shaughnessy

Keyword(s):

Speech Enhancement ◽

Spectral Amplitude ◽

Download Full-text

Multiresolution Cochleagram Speech Enhancement Algorithm Using Improved Deep Neural Networks with Skip Connections

10.21203/rs.3.rs-229829/v1 ◽

2021 ◽

Author(s):

chaofeng lan ◽

Chundong Liu ◽

Lei Zhang

Keyword(s):

Speech Enhancement ◽

Loss Function ◽

Minimum Mean Square Error ◽

Spectral Amplitude ◽

Enhancement Effect ◽

Low Snr ◽

Input Feature ◽

Four Levels ◽

Abstract Deep learning based methods have been a recent benchmark method for speech enhancement. However, these approaches are limited in low signal-to-noise ratios (SNR) conditions, for speech loss and low intelligibility. To address this problem, we improve Multi-Resolution Cochleagram (MRCG), and gammachirp filter bank is used to decompose the speech signal in time and frequency, and the low-resolution signal is denoised by the minimum mean-square error short-time spectral amplitude estimator (MMSE-STSA). Improve Multi-Resolution Cochleagram (I-MRCG) is adopted as the input feature of Skip connections-DNN (Skip-DNN). In this paper, the source to distortion ratio (SDR) is used in the training process, and the logarithm is introduced to observe the iterative process more clearly. Experiments were performed on the TIMIT database with four noise types at four levels of SNR. I-MRCG as the input feature of the Skip-DNN model, the average PESQ is 2.6783, and the average STOI is 0.8752. Compared with MRCG, the PESQ and STOI obtained by MRCG are increased 1.4% and 1.5%, respectively. This shows that MRCG is the input feature of the Skip-DNN model, and the speech enhancement effect after training is better than other features. It can not only solve the problem of speech loss in a low SNR environment, but also obtain more robust speech enhancement. The loss function experiment shows that compared to MSE and SDR, the improved SDR as the loss function of the speech enhancement model has the best enhancement effect.

Download Full-text

Robust automatic speech recognition using a perceptually-based optimal spectral amplitude estimator speech enhancement algorithm in various low-SNR environments

10.21437/interspeech.2005-223 ◽

2005 ◽

Author(s):

Hesham Tolba ◽

Zili Li ◽

Douglas O'Shaughnessy

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition ◽

Spectral Amplitude ◽

Low Snr ◽

Enhancement Algorithm

Download Full-text

On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2018.2877909 ◽

2019 ◽

Vol 27 (2) ◽

pp. 283-295 ◽

Author(s):

Morten Kolbaek ◽

Zheng-Hua Tan ◽

Jesper Jensen

Keyword(s):

Mean Square Error ◽

Speech Enhancement ◽

Spectral Amplitude ◽

Mean Square ◽

The Relationship

Download Full-text

Distributed multichannel speech enhancement based on perceptually‐motivated Bayesian estimators of the spectral amplitude

IET Signal Processing ◽

10.1049/iet-spr.2012.0167 ◽

2013 ◽

Vol 7 (4) ◽

pp. 337-344 ◽

Author(s):

Marek B. Trawicki ◽

Michael T. Johnson

Keyword(s):

Speech Enhancement ◽

Spectral Amplitude ◽

Bayesian Estimators

Download Full-text

Unsupervised speech enhancement in low SNR environments via sparseness and temporal gradient regularization

Applied Acoustics ◽

10.1016/j.apacoust.2018.07.027 ◽

2018 ◽

Vol 141 ◽

pp. 333-347 ◽

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak ◽

Muhammad Shafi

Keyword(s):

Speech Enhancement ◽

Temporal Gradient ◽

Download Full-text

Low SNR speech enhancement with DNN based phase estimation

International Journal of Speech Technology ◽

10.1007/s10772-019-09603-y ◽

2019 ◽

Vol 22 (1) ◽

pp. 283-292 ◽

Author(s):

Samba Raju Chiluveru ◽

Manoj Tripathy

Keyword(s):

Speech Enhancement ◽

Phase Estimation ◽

Download Full-text

Low-SNR Speech Enhancement and Separation in Driving Environment

2019 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) ◽

10.1109/icce-tw46550.2019.8991797 ◽

2019 ◽

Author(s):

Jie Wei ◽

Lingling Li ◽

Xiaofeng Zhong

Keyword(s):

Speech Enhancement ◽

Download Full-text

Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments

Latent Variable Analysis and Signal Separation - Lecture Notes in Computer Science ◽

10.1007/978-3-319-22482-4_9 ◽

2015 ◽

pp. 75-82 ◽

Author(s):

Tian Gao ◽

Jun Du ◽

Yong Xu ◽

Cong Liu ◽

Li-Rong Dai ◽

...

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Download Full-text

Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2010.2047681 ◽

2011 ◽

Vol 19 (2) ◽

pp. 277-289 ◽

Author(s):

Colin Breithaupt ◽

Rainer Martin

Keyword(s):

Speech Enhancement ◽

Transient Conditions ◽

Download Full-text

Multiple modules speech enhancement in mixed noise and low SNR environments

Eleventh International Conference on Signal Processing Systems ◽

10.1117/12.2559657 ◽

2019 ◽

Author(s):

Tian Lan ◽

wenzheng ye ◽

Guoqiang Hui ◽

Sen Li ◽

Qiao Liu

Keyword(s):

Speech Enhancement ◽

Low Snr ◽

Download Full-text