Speech enhancement based on perceptually motivated guided spectrogram filtering

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202278 ◽

2021 ◽

pp. 1-12

Author(s):

Jie Wang ◽

Linhuang Yan ◽

Qiaohe Yang ◽

Minmin Yuan

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel ◽

Noisy Environments ◽

Human Auditory System ◽

Residual Noise ◽

Guided Filtering ◽

Degraded Image ◽

Linear Transform

In this paper, a single-channel speech enhancement algorithm is proposed by using guided spectrogram filtering based on masking properties of human auditory system when considering a speech spectrogram as an image. Guided filtering is capable of sharpening details and estimating unwanted textures or background noise from the noisy speech spectrogram. If we consider the noisy spectrogram as a degraded image, we can estimate the spectrogram of the clean speech signal using guided filtering after subtracting noise components. Combined with masking properties of human auditory system, the proposed algorithm adaptively adjusts and reduces the residual noise of the enhanced speech spectrogram according to the corresponding masking threshold. Because the filtering output is a local linear transform of the guidance spectrogram, the local mask window slides can be efficiently implemented via box filter with O(N) computational complexity. Experimental results show that the proposed algorithm can effectively suppress noise in different noisy environments and thus can greatly improve speech quality and speech intelligibility.

Get full-text (via PubEx)

Single channel speech enhancement based on masking properties of the human auditory system

IEEE Transactions on Speech and Audio Processing ◽

10.1109/89.748118 ◽

1999 ◽

Vol 7 (2) ◽

pp. 126-137 ◽

Cited By ~ 367

Author(s):

N. Virag

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Single Channel ◽

Human Auditory System

Get full-text (via PubEx)

A novel single-channel speech enhancement in noisy environments

4th International Conference on Intelligent Environments (IE 08) ◽

10.1049/cp:20081182 ◽

2008 ◽

Author(s):

Gin-Der Wu

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Noisy Environments

Get full-text (via PubEx)

Speech intelligibility enhancement for Thai-speaking cochlear implant listeners

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v13.i3.pp866-875 ◽

2019 ◽

Vol 13 (3) ◽

pp. 866

Author(s):

Siriporn Dachasilaruk ◽

Niphat Jantharamin ◽

Apichai Rungruang

Keyword(s):

Cochlear Implant ◽

Speech Enhancement ◽

Speech Intelligibility ◽

English Language ◽

Single Channel ◽

Spectral Subtraction ◽

Monosyllabic Words ◽

Listening Environments ◽

Babble Noise ◽

Vocoded Speech

Cochlear implant (CI) listeners encounter difficulties in communicating with other persons in noisy listening environments. However, most CI research has been carried out using the English language. In this study, single-channel speech enhancement (SE) strategies as a pre-processing approach for the CI system were investigated in terms of Thai speech intelligibility improvement. Two SE algorithms, namely multi-band spectral subtraction (MBSS) and Weiner filter (WF) algorithms, were evaluated. Speech signals consisting of monosyllabic and bisyllabic Thai words were degraded by speech-shaped noise and babble noise at SNR levels of 0, 5, and 10 dB. Then the noisy words were enhanced using SE algorithms. The enhanced words were fed into the CI system to synthesize vocoded speech. The vocoded speech was presented to twenty normal-hearing listeners. The results indicated that speech intelligibility was marginally improved by the MBSS algorithm and significantly improved by the WF algorithm in some conditions. The enhanced bisyllabic words showed a noticeably higher intelligibility improvement than the enhanced monosyllabic words in all conditions, particularly in speech-shaped noise. Such outcomes may be beneficial to Thai-speaking CI listeners.

Get full-text (via PubEx)

Improved Spectral Subtraction Speech Enhancement Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.760-762.536 ◽

2013 ◽

Vol 760-762 ◽

pp. 536-541 ◽

Cited By ~ 2

Author(s):

Yu Hong Liu ◽

Dong Mei Zhou ◽

Zhan Jun Jiang

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Critical Frequency ◽

Spectral Subtraction ◽

Human Auditory System ◽

Speech Distortion ◽

Simulation Results ◽

Enhancement Algorithm ◽

Output Snr

The paper addresses the problems of speech distortion and residual musical noise introduced by conventional spectral subtraction (SS) method for speech enhancement. In this paper, we propose a modified SS algorithm for speech enhancement based on the masking properties of human auditory system. The algorithm computes the parameters α and β dynamically according to the masking thresholds of the critical frequency segments for each speech frame. Simulation results show that the proposed algorithm is superior to the conventional SS method, not only in the improvement of output SNR, but in the reduction of the speech distortion and residual musical noise.

Get full-text (via PubEx)

Single channel speech enhancement based on harmonic estimation combined with statistical based method to improve speech intelligibility for cochlear implant recipients

The Journal of the Acoustical Society of America ◽

10.1121/1.4989114 ◽

2017 ◽

Vol 141 (5) ◽

pp. 3985-3986

Author(s):

Dongmei Wang ◽

John H. L. Hansen

Keyword(s):

Cochlear Implant ◽

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel

Get full-text (via PubEx)

A modified spectral subtraction method for speech enhancement based on masking property of human auditory system

2009 International Conference on Wireless Communications & Signal Processing ◽

10.1109/wcsp.2009.5371466 ◽

2009 ◽

Cited By ~ 2

Author(s):

Bing-yin Xia ◽

Yan Liang ◽

Chang-chun Bao

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Spectral Subtraction ◽

Subtraction Method ◽

Human Auditory System ◽

Spectral Subtraction Method

Get full-text (via PubEx)

A single channel speech enhancement technique exploiting human auditory masking properties

Advances in Radio Science ◽

10.5194/ars-8-95-2010 ◽

2010 ◽

Vol 8 ◽

pp. 95-99

Author(s):

F. X. Nsabimana ◽

V. Subbaraman ◽

U. Zölzer

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Noise Power ◽

Auditory Masking ◽

Enhancement Technique ◽

Time Frequency ◽

Residual Noise ◽

Dependent Parameter ◽

A New Technique ◽

Spectral Weighting

Abstract. To enhance extreme corrupted speech signals, an Improved Psychoacoustically Motivated Spectral Weighting Rule (IPMSWR) is proposed, that controls the predefined residual noise level by a time-frequency dependent parameter. Unlike conventional Psychoacoustically Motivated Spectral Weighting Rules (PMSWR), the level of the residual noise is here varied throughout the enhanced speech based on the discrimination between the regions with speech presence and speech absence by means of segmental SNR within critical bands. Controlling in such a way the level of the residual noise in the noise only region avoids the unpleasant residual noise perceived at very low SNRs. To derive the gain coefficients, the computation of the masking curve and the estimation of the corrupting noise power are required. Since the clean speech is generally not available for a single channel speech enhancement technique, the rough clean speech components needed to compute the masking curve are here obtained using advanced spectral subtraction techniques. To estimate the corrupting noise, a new technique is employed, that relies on the noise power estimation using rapid adaptation and recursive smoothing principles. The performances of the proposed approach are objectively and subjectively compared to the conventional approaches to highlight the aforementioned improvement.

Get full-text (via PubEx)

DCT speech enhancement based on masking properties of human auditory system

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS ◽

10.1109/icosp.2010.5655513 ◽

2010 ◽

Cited By ~ 3

Author(s):

Li Yang ◽

Li Shuangtian

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Human Auditory System

Get full-text (via PubEx)

On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms

International Journal of Interactive Multimedia and Artificial Intelligence ◽

10.9781/ijimai.2019.12.001 ◽

2020 ◽

Vol 6 (2) ◽

pp. 12

Author(s):

Elena Verdú ◽

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel

Get full-text (via PubEx)

Perceptual speech enhancement exploiting temporal masking properties of human auditory system

Speech Communication ◽

10.1016/j.specom.2009.12.006 ◽

2010 ◽

Vol 52 (5) ◽

pp. 381-393 ◽

Cited By ~ 17

Author(s):

Teddy Surya Gunawan ◽

Eliathamby Ambikairajah ◽

Julien Epps

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Human Auditory System ◽

Temporal Masking

Get full-text (via PubEx)