Post-filter design for speech enhancement in various noisy environments

This article describes how to use heterogeneous information in speech enhancement. In most of the current speech enhancement systems, clean speeches are recovered only from the signals collected by acoustic microphones, which will be greatly affected by the acoustic noises. However, heterogeneous information from different kinds of sensors, which is usually called the “multi-stream,” are seldom used in speech enhancement because the speech waveforms cannot be recovered from the signals provided by many kinds of sensors. In this article, the authors propose a new model-based multi-stream speech enhancement framework that can make use of the heterogeneous information provided by the signals from different kinds of sensors even when some of them are not directly related to the speech waveform. Then a new speech enhancement scheme using the acoustic and throat microphone recordings is also proposed based on the new speech enhancement framework. Experimental results show that the proposed scheme outperforms several single-stream speech enhancement methods in different noisy environments.

Download Full-text

A two‐stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments

The Journal of the Acoustical Society of America ◽

10.1121/1.2932611 ◽

2008 ◽

Vol 123 (5) ◽

pp. 3012-3012 ◽

Cited By ~ 2

Author(s):

Junfeng Li ◽

Shuichi Sakamoto ◽

Satoshi Hongo ◽

Masato Akagi ◽

Yôiti Suzuki

Keyword(s):

Speech Enhancement ◽

Hearing Aids ◽

Noisy Environments ◽

Two Stage

Download Full-text

Spatial post-filter estimation for speech enhancement in the specific area using a pair of microphone arrays

The Journal of the Acoustical Society of America ◽

10.1121/1.4970797 ◽

2016 ◽

Vol 140 (4) ◽

pp. 3376-3376

Author(s):

Takuto Yoshimizu ◽

Akitoshi Kataoka

Keyword(s):

Speech Enhancement ◽

Specific Area ◽

Microphone Arrays ◽

Post Filter

Download Full-text

Environmental Attention-Guided Branchy Neural Network for Speech Enhancement

Applied Sciences ◽

10.3390/app10031167 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1167 ◽

Cited By ~ 1

Author(s):

Lu Zhang ◽

Mingjiang Wang ◽

Qiquan Zhang ◽

Ming Liu

Keyword(s):

Neural Network ◽

Noise Reduction ◽

Speech Enhancement ◽

Deep Neural Network ◽

Experimental Results ◽

Noisy Environments ◽

Neural Structure ◽

Noise Interference ◽

Noise Type ◽

Speech Reconstruction

The performance of speech enhancement algorithms can be further improved by considering the application scenarios of speech products. In this paper, we propose an attention-based branchy neural network framework by incorporating the prior environmental information for noise reduction. In the whole denoising framework, first, an environment classification network is trained to distinguish the noise type of each noisy speech frame. Guided by this classification network, the denoising network gradually learns respective noise reduction abilities in different branches. Unlike most deep neural network (DNN)-based methods, which learn speech reconstruction capabilities with a common neural structure from all training noises, the proposed branchy model obtains greater performance benefits from the specially trained branches of prior known noise interference types. Experimental results show that the proposed branchy DNN model not only preserved better enhanced speech quality and intelligibility in seen noisy environments, but also obtained good generalization in unseen noisy environments.

Download Full-text