Single channel speech enhancement based on harmonic estimation combined with statistical based method to improve speech intelligibility for cochlear implant recipients

Cochlear implant (CI) listeners encounter difficulties in communicating with other persons in noisy listening environments. However, most CI research has been carried out using the English language. In this study, single-channel speech enhancement (SE) strategies as a pre-processing approach for the CI system were investigated in terms of Thai speech intelligibility improvement. Two SE algorithms, namely multi-band spectral subtraction (MBSS) and Weiner filter (WF) algorithms, were evaluated. Speech signals consisting of monosyllabic and bisyllabic Thai words were degraded by speech-shaped noise and babble noise at SNR levels of 0, 5, and 10 dB. Then the noisy words were enhanced using SE algorithms. The enhanced words were fed into the CI system to synthesize vocoded speech. The vocoded speech was presented to twenty normal-hearing listeners. The results indicated that speech intelligibility was marginally improved by the MBSS algorithm and significantly improved by the WF algorithm in some conditions. The enhanced bisyllabic words showed a noticeably higher intelligibility improvement than the enhanced monosyllabic words in all conditions, particularly in speech-shaped noise. Such outcomes may be beneficial to Thai-speaking CI listeners.

Download Full-text

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

Hearing Research ◽

10.1016/j.heares.2016.11.012 ◽

2017 ◽

Vol 344 ◽

pp. 183-194 ◽

Cited By ~ 35

Author(s):

Tobias Goehring ◽

Federico Bolner ◽

Jessica J.M. Monaghan ◽

Bas van Dijk ◽

Andrzej Zarowski ◽

...

Keyword(s):

Neural Networks ◽

Cochlear Implant ◽

Speech Enhancement ◽

Speech Intelligibility

Download Full-text

Speech enhancement based on perceptually motivated guided spectrogram filtering

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202278 ◽

2021 ◽

pp. 1-12

Author(s):

Jie Wang ◽

Linhuang Yan ◽

Qiaohe Yang ◽

Minmin Yuan

Keyword(s):

Auditory System ◽

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel ◽

Noisy Environments ◽

Human Auditory System ◽

Residual Noise ◽

Guided Filtering ◽

Degraded Image ◽

Linear Transform

In this paper, a single-channel speech enhancement algorithm is proposed by using guided spectrogram filtering based on masking properties of human auditory system when considering a speech spectrogram as an image. Guided filtering is capable of sharpening details and estimating unwanted textures or background noise from the noisy speech spectrogram. If we consider the noisy spectrogram as a degraded image, we can estimate the spectrogram of the clean speech signal using guided filtering after subtracting noise components. Combined with masking properties of human auditory system, the proposed algorithm adaptively adjusts and reduces the residual noise of the enhanced speech spectrogram according to the corresponding masking threshold. Because the filtering output is a local linear transform of the guidance spectrogram, the local mask window slides can be efficiently implemented via box filter with O(N) computational complexity. Experimental results show that the proposed algorithm can effectively suppress noise in different noisy environments and thus can greatly improve speech quality and speech intelligibility.

Download Full-text

On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms

International Journal of Interactive Multimedia and Artificial Intelligence ◽

10.9781/ijimai.2019.12.001 ◽

2020 ◽

Vol 6 (2) ◽

pp. 12

Author(s):

Elena Verdú ◽

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel

Download Full-text

Speech Enhancement Based on Harmonic Estimation Combined with MMSE to Improve Speech Intelligibility for Cochlear Implant Recipients

10.21437/interspeech.2017-78 ◽

2017 ◽

Cited By ~ 4

Author(s):

Dongmei Wang ◽

John H.L. Hansen

Keyword(s):

Cochlear Implant ◽

Speech Enhancement ◽

Speech Intelligibility

Download Full-text

On speech intelligibility estimation of phase-aware single-channel speech enhancement

10.21437/interspeech.2015-550 ◽

2015 ◽

Author(s):

Andreas Gaich ◽

Pejman Mowlaee

Keyword(s):

Speech Enhancement ◽

Speech Intelligibility ◽

Single Channel

Download Full-text

Reading and speech intelligibility of a child with auditory impairment and cochlear implant.

Psychology & Neuroscience ◽

10.1037/pne0000139 ◽

2018 ◽

Vol 11 (3) ◽

pp. 306-316 ◽

Cited By ~ 2

Author(s):

Fernando Del Mando Lucchesi ◽

Ana Claudia Moreira Almeida-Verdu ◽

Deisy das Graças de Souza

Keyword(s):

Cochlear Implant ◽

Speech Intelligibility ◽

Auditory Impairment

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement

10.21437/interspeech.2018-1439 ◽

2018 ◽

Author(s):

Li Chai ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Single Channel ◽

Laplace Distribution ◽

Error Modeling ◽

Asymmetric Laplace Distribution

Download Full-text

Improved Environment-Aware–Based Noise Reduction System for Cochlear Implant Users Based on Knowledge Transfer Approach (Preprint)

10.2196/preprints.25460 ◽

2020 ◽

Author(s):

Lieber Po-Hung Li ◽

Ji-Yan Han ◽

Wei-Zhong Zheng ◽

Ren-Jie Huang ◽

Ying-Hui Lai

Keyword(s):

Knowledge Transfer ◽

Cochlear Implant ◽

Noise Reduction ◽

Speech Intelligibility ◽

Middle Layer ◽

Successful Implementation ◽

Perceptual Evaluation ◽

Reduction System ◽

Listening Tests ◽

Implementation Costs

BACKGROUND The cochlear implant technology is a well-known approach to help deaf patients hear speech again. It can improve speech intelligibility in quiet conditions; however, it still has room for improvement in noisy conditions. More recently, it has been proven that deep learning–based noise reduction (NR), such as noise classification and deep denoising autoencoder (NC+DDAE), can benefit the intelligibility performance of patients with cochlear implants compared to classical noise reduction algorithms. OBJECTIVE Following the successful implementation of the NC+DDAE model in our previous study, this study aimed to (1) propose an advanced noise reduction system using knowledge transfer technology, called NC+DDAE_T, (2) examine the proposed NC+DDAE_T noise reduction system using objective evaluations and subjective listening tests, and (3) investigate which layer substitution of the knowledge transfer technology in the NC+DDAE_T noise reduction system provides the best outcome. METHODS The knowledge transfer technology was adopted to reduce the number of parameters of the NC+DDAE_T compared with the NC+DDAE. We investigated which layer should be substituted using short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) scores, as well as t-distributed stochastic neighbor embedding to visualize the features in each model layer. Moreover, we enrolled ten cochlear implant users for listening tests to evaluate the benefits of the newly developed NC+DDAE_T. RESULTS The experimental results showed that substituting the middle layer (ie, the second layer in this study) of the noise-independent DDAE (NI-DDAE) model achieved the best performance gain regarding STOI and PESQ scores. Therefore, the parameters of layer three in the NI-DDAE were chosen to be replaced, thereby establishing the NC+DDAE_T. Both objective and listening test results showed that the proposed NC+DDAE_T noise reduction system achieved similar performances compared with the previous NC+DDAE in several noisy test conditions. However, the proposed NC+DDAE_T only needs a quarter of the number of parameters compared to the NC+DDAE. CONCLUSIONS This study demonstrated that knowledge transfer technology can help to reduce the number of parameters in an NC+DDAE while keeping similar performance rates. This suggests that the proposed NC+DDAE_T model may reduce the implementation costs of this noise reduction system and provide more benefits for cochlear implant users.

Download Full-text