Mask estimation incorporating phase-sensitive information for speech enhancement

Applied Acoustics ◽

10.1016/j.apacoust.2019.07.009 ◽

2019 ◽

Vol 156 ◽

pp. 101-112 ◽

Author(s):

Xianyun Wang ◽

Changchun Bao

Keyword(s):

Speech Enhancement ◽

Sensitive Information ◽

Phase Sensitive ◽

Mask Estimation

Download Full-text

Speech Enhancement With Phase Sensitive Mask Estimation Using a Novel Hybrid Neural Network

IEEE Open Journal of Signal Processing ◽

10.1109/ojsp.2021.3067147 ◽

2021 ◽

Vol 2 ◽

pp. 136-150

Author(s):

Mojtaba Hasannezhad ◽

Zhiheng Ouyang ◽

Wei-Ping Zhu ◽

Benoit Champagne

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Hybrid Neural Network ◽

Phase Sensitive ◽

Mask Estimation

Download Full-text

Phase-sensitive speech enhancement for cochlear implant processing

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2011.5947505 ◽

2011 ◽

Author(s):

Pourya S. Jafari ◽

Hou-Yong Kang ◽

Xiaosong Wang ◽

Qian-Jie Fu ◽

Hui Jiang

Keyword(s):

Cochlear Implant ◽

Speech Enhancement ◽

Phase Sensitive

Download Full-text

Unsupervised single-channel speech enhancement based on phase aware time-frequency mask estimation

Applied Speech Processing ◽

10.1016/b978-01-2-823898-1.00006-0 ◽

2021 ◽

pp. 75-99

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Time Frequency ◽

Mask Estimation

Download Full-text

Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR

2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.23919/apsipa.2018.8659564 ◽

2018 ◽

Author(s):

Yan-Hui Tu ◽

Jun Du ◽

Nan Zhou ◽

Chin-Hui Lee

Keyword(s):

Speech Enhancement ◽

Mask Estimation

Download Full-text

DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9054643 ◽

2020 ◽

Author(s):

Nicolas Furnon ◽

Romain Serizel ◽

Irina Illina ◽

Slim Essid

Keyword(s):

Speech Enhancement ◽

Microphone Arrays ◽

Mask Estimation

Download Full-text

Auditory mask estimation by RPCA for monaural speech enhancement

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) ◽

10.1109/icis.2017.7959990 ◽

2017 ◽

Author(s):

Wenhua Shi ◽

Xiongwei Zhang ◽

Xia Zou ◽

Wei Han ◽

Gang Min

Keyword(s):

Speech Enhancement ◽

Mask Estimation

Download Full-text

Speech Enhancement Algorithm of Binary Mask Estimation Based on a Priori SNR Constraints

2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.23919/apsipa.2018.8659475 ◽

2018 ◽

Author(s):

Jie Wang ◽

Chengcheng Yang ◽

Linhuang Yan ◽

Manlu Huang ◽

Jinqiu Sang

Keyword(s):

Speech Enhancement ◽

A Priori ◽

Binary Mask ◽

Mask Estimation ◽

Enhancement Algorithm

Download Full-text

Variance based time-frequency mask estimation for unsupervised speech enhancement

Multimedia Tools and Applications ◽

10.1007/s11042-019-08032-y ◽

2019 ◽

Vol 78 (22) ◽

pp. 31867-31891 ◽

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak ◽

Gunawan Witjaksono ◽

Gulzar Ahmad

Keyword(s):

Speech Enhancement ◽

Time Frequency ◽

Mask Estimation

Download Full-text

Phase-Sensitive Decision-Directed SNR Estimator for Single-Channel Speech Enhancement

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001417580034 ◽

2017 ◽

Vol 31 (08) ◽

pp. 1758003

Author(s):

Shifeng Ou ◽

Peng Song ◽

Ying Gao

Keyword(s):

Speech Enhancement ◽

Speech Processing ◽

Single Channel ◽

Signal To Noise Ratio ◽

A Priori ◽

Processing System ◽

Phase Information ◽

Amplitude Spectra ◽

Phase Sensitive ◽

The a priori signal-to-noise ratio (SNR) plays an essential role in many speech enhancement systems. Most of the existing approaches to estimate the a priori SNR only exploit the amplitude spectra while making the phase neglected. Considering the fact that incorporating phase information into a speech processing system can significantly improve the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach for the a priori SNR estimate. By representing the short-time discrete Fourier transform (STFT) signal spectra geometrically in a complex plane, the proposed approach estimates the a priori SNR using both the magnitude and phase information while making no assumptions about the phase difference between clean speech and noise spectra. Objective evaluations in terms of the spectrograms, segmental SNR, log-spectral distance (LSD) and short-time objective intelligibility (STOI) measures are presented to demonstrate the superiority of the proposed approach compared to several competitive methods at different noise conditions and input SNR levels.

Download Full-text

Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2019.2941592 ◽

2019 ◽

Vol 27 (12) ◽

pp. 2162-2172

Author(s):

Lukas Pfeifenberger ◽

Matthias Zohrer ◽

Franz Pernkopf

Keyword(s):

Speech Enhancement ◽

Mask Estimation

Download Full-text