SFTRLS-Based Speech Enhancement Method Using CNN to Determine the Noise Type and the Optimal Forgetting Factor

In this paper, a speech enhancement method using noise classification and Deep Neural Network (DNN) was proposed. Gaussian mixture model (GMM) was employed to determine the noise type in speech-absent frames. DNN was used to model the relationship between noisy observation and clean speech. Once the noise type was determined, the corresponding DNN model was applied to enhance the noisy speech. GMM was trained with mel-frequency cepstrum coefficients (MFCC) and the parameters were estimated with an iterative expectation-maximization (EM) algorithm. Noise type was updated by spectrum entropy-based voice activity detection (VAD). Experimental results demonstrate that the proposed method could achieve better objective speech quality and smaller distortion under stationary and non-stationary conditions.

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Audio-Visual Speech Enhancement Method Conditioned in the Lip Motion and Speaker-Discriminative Embeddings

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414133 ◽

2021 ◽

Author(s):

Koichiro Ito ◽

Masaaki Yamamoto ◽

Kenji Nagamatsu

Keyword(s):

Speech Enhancement ◽

Visual Speech ◽

Enhancement Method

Download Full-text

A single channel speech enhancement method based on masking properties and minimum statistics

6th International Conference on Signal Processing, 2002. ◽

10.1109/icosp.2002.1181091 ◽

2003 ◽

Cited By ~ 1

Author(s):

Jiang Xiaoping ◽

Fu Hua ◽

Yao Tianren

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Enhancement Method ◽

Minimum Statistics

Download Full-text

A Speech Enhancement Method Using Attention Mechanism and Gated Recurrent Unit

10.1109/iai53119.2021.9619422 ◽

2021 ◽

Author(s):

Kaibei Peng ◽

Xiaoming Sun ◽

Haowei Chen ◽

Zhen He ◽

Jianrong Wang

Keyword(s):

Speech Enhancement ◽

Attention Mechanism ◽

Enhancement Method ◽

Gated Recurrent Unit

Download Full-text

An Optimization Adaptive BWT Speech Enhancement Method

Information Technology Journal ◽

10.3923/itj.2014.1730.1736 ◽

2014 ◽

Vol 13 (10) ◽

pp. 1730-1736 ◽

Cited By ~ 1

Author(s):

Cao Bin-Fang ◽

Li Jian-Qi ◽

Qu Peixin ◽

Peng Guang-Han

Keyword(s):

Speech Enhancement ◽

Enhancement Method

Download Full-text

A Cepstrum Domain HMM-Based Speech Enhancement Method Applied to Non-Stationary Noise

Multimedia Systems and Applications Series - Signal Processing for Telecommunications and Multimedia ◽

10.1007/0-387-22928-0_1 ◽

2005 ◽

pp. 1-13

Author(s):

Mikael Nilsson ◽

Mattias Dahl ◽

Ingvar Claesson

Keyword(s):

Speech Enhancement ◽

Stationary Noise ◽

Enhancement Method

Download Full-text

A New Microphone Array Speech Enhancement Method Based on AR Model

Lecture Notes in Computer Science - Life System Modeling and Intelligent Computing ◽

10.1007/978-3-642-15615-1_17 ◽

2010 ◽

pp. 139-147

Author(s):

Liyan Zhang ◽

Fuliang Yin ◽

Lijun Zhang

Keyword(s):

Speech Enhancement ◽

Microphone Array ◽

Ar Model ◽

Enhancement Method

Download Full-text

A speech enhancement method using the sliding DFT

The Journal of the Acoustical Society of America ◽

10.1121/1.4777207 ◽

2003 ◽

Vol 114 (4) ◽

pp. 2369-2369 ◽

Cited By ~ 1

Author(s):

Hiroyuki Ono ◽

Takahiro Murakami ◽

Yoshihisa Ishida

Keyword(s):

Speech Enhancement ◽

Enhancement Method

Download Full-text

Environmental Attention-Guided Branchy Neural Network for Speech Enhancement

Applied Sciences ◽

10.3390/app10031167 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1167 ◽

Cited By ~ 1

Author(s):

Lu Zhang ◽

Mingjiang Wang ◽

Qiquan Zhang ◽

Ming Liu

Keyword(s):

Neural Network ◽

Noise Reduction ◽

Speech Enhancement ◽

Deep Neural Network ◽

Experimental Results ◽

Noisy Environments ◽

Neural Structure ◽

Noise Interference ◽

Noise Type ◽

Speech Reconstruction

The performance of speech enhancement algorithms can be further improved by considering the application scenarios of speech products. In this paper, we propose an attention-based branchy neural network framework by incorporating the prior environmental information for noise reduction. In the whole denoising framework, first, an environment classification network is trained to distinguish the noise type of each noisy speech frame. Guided by this classification network, the denoising network gradually learns respective noise reduction abilities in different branches. Unlike most deep neural network (DNN)-based methods, which learn speech reconstruction capabilities with a common neural structure from all training noises, the proposed branchy model obtains greater performance benefits from the specially trained branches of prior known noise interference types. Experimental results show that the proposed branchy DNN model not only preserved better enhanced speech quality and intelligibility in seen noisy environments, but also obtained good generalization in unseen noisy environments.

Download Full-text