Mask estimation incorporating phase-sensitive information for speech enhancement

2019 ◽  
Vol 156 ◽  
pp. 101-112 ◽  
Author(s):  
Xianyun Wang ◽  
Changchun Bao
2021 ◽  
Vol 2 ◽  
pp. 136-150
Author(s):  
Mojtaba Hasannezhad ◽  
Zhiheng Ouyang ◽  
Wei-Ping Zhu ◽  
Benoit Champagne

2019 ◽  
Vol 78 (22) ◽  
pp. 31867-31891 ◽  
Author(s):  
Nasir Saleem ◽  
Muhammad Irfan Khattak ◽  
Gunawan Witjaksono ◽  
Gulzar Ahmad

Author(s):  
Shifeng Ou ◽  
Peng Song ◽  
Ying Gao

The a priori signal-to-noise ratio (SNR) plays an essential role in many speech enhancement systems. Most of the existing approaches to estimate the a priori SNR only exploit the amplitude spectra while making the phase neglected. Considering the fact that incorporating phase information into a speech processing system can significantly improve the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach for the a priori SNR estimate. By representing the short-time discrete Fourier transform (STFT) signal spectra geometrically in a complex plane, the proposed approach estimates the a priori SNR using both the magnitude and phase information while making no assumptions about the phase difference between clean speech and noise spectra. Objective evaluations in terms of the spectrograms, segmental SNR, log-spectral distance (LSD) and short-time objective intelligibility (STOI) measures are presented to demonstrate the superiority of the proposed approach compared to several competitive methods at different noise conditions and input SNR levels.


2019 ◽  
Vol 27 (12) ◽  
pp. 2162-2172
Author(s):  
Lukas Pfeifenberger ◽  
Matthias Zohrer ◽  
Franz Pernkopf

Sign in / Sign up

Export Citation Format

Share Document