Multi-scale decomposition based supervised single channel deep speech enhancement

Applied Soft Computing ◽

10.1016/j.asoc.2020.106666 ◽

2020 ◽

Vol 95 ◽

pp. 106666

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Download Full-text

A Convolutional Network with Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement

IEEE Signal Processing Letters ◽

10.1109/lsp.2021.3093859 ◽

2021 ◽

pp. 1-1

Author(s):

Xiang Xiaoxiao ◽

Zhang Xiaojuan ◽

Chen Haozhe

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Convolutional Network ◽

Multi Scale ◽

Download Full-text

A Multi-Scale Feature Recalibration Network for End-to-End Single Channel Speech Enhancement

IEEE Journal of Selected Topics in Signal Processing ◽

10.1109/jstsp.2020.3045846 ◽

2020 ◽

pp. 1-1

Author(s):

Yang Xian ◽

Yang Sun ◽

Wenwu Wang ◽

Syed Mohsen Naqvi

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Scale Feature ◽

Multi Scale ◽

Download Full-text

Multi-Scale Residual Convolutional Encoder Decoder with Bidirectional Long Short-Term Memory for Single Channel Speech Enhancement

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287618 ◽

2021 ◽

Author(s):

Yang Xian ◽

Yang Sun ◽

Wenwu Wang ◽

Syed Mohsen Naqvi

Keyword(s):

Speech Enhancement ◽

Short Term Memory ◽

Single Channel ◽

Term Memory ◽

Multi Scale ◽

Long Short Term Memory ◽

Convolutional Encoder

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement

10.21437/interspeech.2018-1439 ◽

2018 ◽

Author(s):

Li Chai ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Single Channel ◽

Laplace Distribution ◽

Error Modeling ◽

Asymmetric Laplace Distribution

Download Full-text

Multi-Scale TCN: Exploring Better Temporal DNN Model for Causal Speech Enhancement

10.21437/interspeech.2020-1104 ◽

2020 ◽

Author(s):

Lu Zhang ◽

Mingjiang Wang

Keyword(s):

Speech Enhancement ◽

Download Full-text

Perceptual weighting deep neural networks for single-channel speech enhancement

2016 12th World Congress on Intelligent Control and Automation (WCICA) ◽

10.1109/wcica.2016.7578300 ◽

2016 ◽

Author(s):

Wei Han ◽

Xiongwei Zhang ◽

Gang Min ◽

Xingyu Zhou ◽

Wei Zhang

Keyword(s):

Neural Networks ◽

Speech Enhancement ◽

Deep Neural Networks ◽

Single Channel ◽

Perceptual Weighting

Download Full-text

A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment

2020 15th IEEE International Conference on Signal Processing (ICSP) ◽

10.1109/icsp48669.2020.9320989 ◽

2020 ◽

Author(s):

Jian Xiao ◽

Hongqing Liu ◽

Yi Zhou ◽

Zhen Luo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Download Full-text

Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement based on Spherical Uncertainty Set

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2020.3042013 ◽

2020 ◽

pp. 1-1

Author(s):

Doerte Fischer ◽

Simon Doclo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Uncertainty Set

Download Full-text

A Multi-Resolution Approach to GAN-Based Speech Enhancement

Applied Sciences ◽

10.3390/app11020721 ◽

2021 ◽

Vol 11 (2) ◽

pp. 721

Author(s):

Hyung Yong Kim ◽

Ji Won Yoon ◽

Sung Jun Cheon ◽

Woo Hyun Kang ◽

Nam Soo Kim

Keyword(s):

Speech Enhancement ◽

Optimal Solution ◽

Experimental Results ◽

Generative Adversarial Networks ◽

The Real ◽

Multi Scale ◽

Adversarial Networks ◽

Speech Characteristics ◽

Conventional Methods ◽

Convex Property

Recently, generative adversarial networks (GANs) have been successfully applied to speech enhancement. However, there still remain two issues that need to be addressed: (1) GAN-based training is typically unstable due to its non-convex property, and (2) most of the conventional methods do not fully take advantage of the speech characteristics, which could result in a sub-optimal solution. In order to deal with these problems, we propose a progressive generator that can handle the speech in a multi-resolution fashion. Additionally, we propose a multi-scale discriminator that discriminates the real and generated speech at various sampling rates to stabilize GAN training. The proposed structure was compared with the conventional GAN-based speech enhancement algorithms using the VoiceBank-DEMAND dataset. Experimental results showed that the proposed approach can make the training faster and more stable, which improves the performance on various metrics for speech enhancement.

Download Full-text