Environment-aware ideal binary mask estimation using monaural cues

2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ◽

10.1109/waspaa.2013.6701821 ◽

2013 ◽

Author(s):

Tobias May ◽

Torsten Dau

Keyword(s):

Binary Mask ◽

Ideal Binary Mask ◽

Mask Estimation

Download Full-text

Robust automatic speech recognition with decoder oriented ideal binary mask estimation

10.21437/interspeech.2010-583 ◽

2010 ◽

Author(s):

Lae-Hoon Kim ◽

Kyung-Tae Kim ◽

Mark Hasegawa-Johnson

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Binary Mask ◽

Ideal Binary Mask ◽

Mask Estimation

Download Full-text

ASR-driven top-down binary mask estimation using spectral priors

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2012.6288964 ◽

2012 ◽

Author(s):

William Hartmann ◽

Eric Fosler-Lussier

Keyword(s):

Binary Mask ◽

Top Down ◽

Mask Estimation

Download Full-text

Quality Evaluation of Speech Enhancement Algorithms for Normal and Hearing Loss Listeners

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l2479.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 7-12

Keyword(s):

Mean Square Error ◽

Speech Enhancement ◽

Quality Evaluation ◽

Minimum Mean Square Error ◽

Subjective Quality ◽

Binary Mask ◽

Mean Square ◽

Regional Language ◽

Speech Database ◽

Ideal Binary Mask

The subjective quality test of the enhanced speech from different enhancement algorithms for listeners with normal hearing (NH) capability as well as listeners with hearing impairment (HI) is reported. The subjective quality evaluation of speech enhancement methods in the literature survey is mostly done targeting NH listeners and fewer attempts are observed to subjectively evaluate for HI listeners. The algorithms evaluated are from four different classes: spectral subtraction class(SS), statistical model based class (minimum mean square error), subspace class(PKLT) and auditory class (ideal binary mask using STFT, ideal binary mask using gammatone filterbank and ideal binary mask using gammachirp filterbank). The algorithms are evaluated using four types of real world noises recorded in Indian scenarios namely cafeteria, traffic, station and train at -5, 0, 5 and 10 dB SNRs. The evaluation is being done as per ITU-T P.835 standard in terms of three parametersspeech signal alone, background noise and overall quality. The noisy speech database developed in Indian regional language, Marathi, at four SNRs -5, 0, 5 and 10 dB is used for evaluation. Significant improvement is observed in ideal binary mask algorithm in terms of overall quality and signal distortion ratings for NH and HI listeners. The performance of minimum mean square error is also observed comparable with the ideal binary mask algorithm in some cases.

Download Full-text

Dual-microphone based binary mask estimation for robust speaker verification

2012 International Conference on Audio, Language and Image Processing ◽

10.1109/icalip.2012.6376764 ◽

2012 ◽

Author(s):

Yali Zhao ◽

Zhong-Hua Fu ◽

Lei Xie ◽

Jian Zhang ◽

Yanning Zhang

Keyword(s):

Speaker Verification ◽

Binary Mask ◽

Mask Estimation

Download Full-text

Interaural coherence induced ideal binary mask for binaural speech separation and dereverberation

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) ◽

10.1109/iscslp.2016.7918416 ◽

2016 ◽

Author(s):

Yi-Ting Chen ◽

Tzu-Hao Chen ◽

Mao-Chang Huang ◽

Tai-Shih Chi

Keyword(s):

Binary Mask ◽

Speech Separation ◽

Ideal Binary Mask

Download Full-text

Generalization of supervised learning for binary mask estimation

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) ◽

10.1109/iwaenc.2014.6953357 ◽

2014 ◽

Author(s):

Tobias May ◽

Timo Gerkmann

Keyword(s):

Supervised Learning ◽

Binary Mask ◽

Mask Estimation

Download Full-text

The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2012.2213248 ◽

2013 ◽

Vol 21 (1) ◽

pp. 63-72 ◽

Author(s):

N. Madhu ◽

A. Spriet ◽

S. Jansen ◽

R. Koning ◽

J. Wouters

Keyword(s):

Noise Reduction ◽

Speech Intelligibility ◽

Single Channel ◽

Wiener Filter ◽

Channel Noise ◽

Binary Mask ◽

Ideal Binary Mask ◽

Auditory Prostheses ◽

Download Full-text

On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis

Speech Separation by Humans and Machines ◽

10.1007/0-387-22794-6_12 ◽

2006 ◽

pp. 181-197 ◽

Author(s):

DeLiang Wang

Keyword(s):

Auditory Scene Analysis ◽

Scene Analysis ◽

Binary Mask ◽

Auditory Scene ◽

Ideal Binary Mask

Download Full-text

Integrating Binary Mask Estimation With MRF Priors of Cochleagram for Speech Separation

IEEE Signal Processing Letters ◽

10.1109/lsp.2012.2209643 ◽

2012 ◽

Vol 19 (10) ◽

pp. 627-630 ◽

Author(s):

Shan Liang ◽

Wenju Liu ◽

Wei Jiang

Keyword(s):

Binary Mask ◽

Speech Separation ◽

Mask Estimation

Download Full-text

Influence of binary mask estimation errors on robust speaker identification

Speech Communication ◽

10.1016/j.specom.2016.12.002 ◽

2017 ◽

Vol 87 ◽

pp. 40-48

Author(s):

Tobias May

Keyword(s):

Speaker Identification ◽

Binary Mask ◽

Estimation Errors ◽

Robust Speaker Identification ◽

Mask Estimation

Download Full-text