A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network

An improved fully convolutional network based on post-processing with global variance (GV) equalization and noise-aware training (PN-FCN) for speech enhancement model is proposed. It aims at reducing the complexity of the speech improvement system, and it solves overly smooth speech signal spectrogram problem and poor generalization capability. The PN-FCN is fed with the noisy speech samples augmented with an estimate of the noise. In this way, the PN-FCN uses additional online noise information to better predict the clean speech. Besides, PN-FCN uses the global variance information, which improve the subjective score in a voice conversion task. Finally, the proposed framework adopts FCN, and the number of parameters is one-seventh of deep neural network (DNN). Results of experiments on the Valentini-Botinhaos dataset demonstrate that the proposed framework achieves improvements in both denoising effect and model training speed.

Download Full-text

Single-Channel Speech Enhancement Based on Sparse Regressive Deep Neural Network

Software Engineering and Applications ◽

10.12677/sea.2017.61002 ◽

2017 ◽

Vol 06 (01) ◽

pp. 8-19

Author(s):

海霞孙

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Single Channel

Download Full-text

Supervised speech enhancement based on deep neural network

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-190047 ◽

2019 ◽

Vol 37 (4) ◽

pp. 5187-5201 ◽

Cited By ~ 2

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak ◽

Abdul Baser Qazi

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network

Download Full-text

Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments

Latent Variable Analysis and Signal Separation - Lecture Notes in Computer Science ◽

10.1007/978-3-319-22482-4_9 ◽

2015 ◽

pp. 75-82 ◽

Cited By ~ 8

Author(s):

Tian Gao ◽

Jun Du ◽

Yong Xu ◽

Cong Liu ◽

Li-Rong Dai ◽

...

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Low Snr

Download Full-text

Deep Neural Network Based Monaural Speech Enhancement with Low-Rank Analysis and Speech Present Probability

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e101.a.585 ◽

2018 ◽

Vol E101.A (3) ◽

pp. 585-589

Author(s):

Wenhua SHI ◽

Xiongwei ZHANG ◽

Xia ZOU ◽

Meng SUN ◽

Wei HAN ◽

...

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Low Rank ◽

Rank Analysis

Download Full-text

Environmental Attention-Guided Branchy Neural Network for Speech Enhancement

Applied Sciences ◽

10.3390/app10031167 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1167 ◽

Cited By ~ 1

Author(s):

Lu Zhang ◽

Mingjiang Wang ◽

Qiquan Zhang ◽

Ming Liu

Keyword(s):

Neural Network ◽

Noise Reduction ◽

Speech Enhancement ◽

Deep Neural Network ◽

Experimental Results ◽

Noisy Environments ◽

Neural Structure ◽

Noise Interference ◽

Noise Type ◽

Speech Reconstruction

The performance of speech enhancement algorithms can be further improved by considering the application scenarios of speech products. In this paper, we propose an attention-based branchy neural network framework by incorporating the prior environmental information for noise reduction. In the whole denoising framework, first, an environment classification network is trained to distinguish the noise type of each noisy speech frame. Guided by this classification network, the denoising network gradually learns respective noise reduction abilities in different branches. Unlike most deep neural network (DNN)-based methods, which learn speech reconstruction capabilities with a common neural structure from all training noises, the proposed branchy model obtains greater performance benefits from the specially trained branches of prior known noise interference types. Experimental results show that the proposed branchy DNN model not only preserved better enhanced speech quality and intelligibility in seen noisy environments, but also obtained good generalization in unseen noisy environments.

Download Full-text

A reduced complexity MFCC-based deep neural network approach for speech enhancement

2017 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) ◽

10.1109/isspit.2017.8388664 ◽

2017 ◽

Author(s):

Ryan Razani ◽

Hanwook Chung ◽

Yazid Attabi ◽

Benoit Champagne

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Network Approach ◽

Neural Network Approach ◽

Reduced Complexity

Download Full-text