Time-domain speech enhancement using generative adversarial networks

Speech Communication ◽

10.1016/j.specom.2019.09.001 ◽

2019 ◽

Vol 114 ◽

pp. 10-21 ◽

Author(s):

Santiago Pascual ◽

Joan Serrà ◽

Antonio Bonafonte

Keyword(s):

Speech Enhancement ◽

Time Domain ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

A Multi-Resolution Approach to GAN-Based Speech Enhancement

Applied Sciences ◽

10.3390/app11020721 ◽

2021 ◽

Vol 11 (2) ◽

pp. 721

Author(s):

Hyung Yong Kim ◽

Ji Won Yoon ◽

Sung Jun Cheon ◽

Woo Hyun Kang ◽

Nam Soo Kim

Keyword(s):

Speech Enhancement ◽

Optimal Solution ◽

Experimental Results ◽

Generative Adversarial Networks ◽

The Real ◽

Multi Scale ◽

Adversarial Networks ◽

Speech Characteristics ◽

Conventional Methods ◽

Convex Property

Recently, generative adversarial networks (GANs) have been successfully applied to speech enhancement. However, there still remain two issues that need to be addressed: (1) GAN-based training is typically unstable due to its non-convex property, and (2) most of the conventional methods do not fully take advantage of the speech characteristics, which could result in a sub-optimal solution. In order to deal with these problems, we propose a progressive generator that can handle the speech in a multi-resolution fashion. Additionally, we propose a multi-scale discriminator that discriminates the real and generated speech at various sampling rates to stabilize GAN training. The proposed structure was compared with the conventional GAN-based speech enhancement algorithms using the VoiceBank-DEMAND dataset. Experimental results showed that the proposed approach can make the training faster and more stable, which improves the performance on various metrics for speech enhancement.

Download Full-text

Multi-scale Generative Adversarial Networks for Speech Enhancement

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) ◽

10.1109/globalsip45357.2019.8969193 ◽

2019 ◽

Author(s):

Yihang Li ◽

Ting Jiang ◽

Shan Qin

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Multi Scale ◽

Adversarial Networks

Download Full-text

Speech enhancement based on spectrogram conditional generative adversarial networks

Eleventh International Conference on Graphics and Image Processing (ICGIP 2019) ◽

10.1117/12.2557256 ◽

2020 ◽

Author(s):

Ru Han ◽

Jianming Liu ◽

Mingwen Wang

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

Speech Enhancement Using Forked Generative Adversarial Networks with Spectral Subtraction

10.21437/interspeech.2019-2954 ◽

2019 ◽

Author(s):

Ju Lin ◽

Sufeng Niu ◽

Zice Wei ◽

Xiang Lan ◽

Adriaan J. van Wijngaarden ◽

...

Keyword(s):

Speech Enhancement ◽

Spectral Subtraction ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

A two-stage complex network using cycle-consistent generative adversarial networks for speech enhancement

Speech Communication ◽

10.1016/j.specom.2021.09.001 ◽

2021 ◽

Author(s):

Guochen Yu ◽

Yutian Wang ◽

Hui Wang ◽

Qin Zhang ◽

Chengshi Zheng

Keyword(s):

Complex Network ◽

Speech Enhancement ◽

Generative Adversarial Networks ◽

Two Stage ◽

Adversarial Networks

Download Full-text

End-to-end Speech Enhancement Using Self-Attention Generative Adversarial Networks

10.1109/iscipt53667.2021.00154 ◽

2021 ◽

Author(s):

Zhonghui Cao ◽

Zhihua Huang

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Download Full-text

HiFi-GAN-2: Studio-Quality Speech Enhancement via Generative Adversarial Networks Conditioned on Acoustic Features

10.1109/waspaa52581.2021.9632770 ◽

2021 ◽

Author(s):

Jiaqi Su ◽

Zeyu Jin ◽

Adam Finkelstein

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Acoustic Features ◽

Adversarial Networks

Download Full-text

Time-Domain Signal Synthesis with Style-Based Generative Adversarial Networks Applied to Guided Waves

10.1007/978-3-030-87986-0_7 ◽

2021 ◽

pp. 78-88

Author(s):

Mateusz Heesch ◽

Krzysztof Mendrok ◽

Ziemowit Dworakowski

Keyword(s):

Time Domain ◽

Guided Waves ◽

Generative Adversarial Networks ◽

Time Domain Signal ◽

Adversarial Networks ◽

Signal Synthesis

Download Full-text

PAGAN: A Phase-Adapted Generative Adversarial Networks for Speech Enhancement

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9054256 ◽

2020 ◽

Author(s):

Peishuo Li ◽

Zihang Jiang ◽

Shouyi Yin ◽

Dandan Song ◽

Peng Ouyang ◽

...

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

Speech enhancement through improvised conditional generative adversarial networks

Microprocessors and Microsystems ◽

10.1016/j.micpro.2020.103281 ◽

2020 ◽

Vol 79 ◽

pp. 103281

Author(s):

Saravana Ram Ram ◽

Vinoth Kumar M ◽

Balambigai Subramanian ◽

Nebojsa Bacanin ◽

Miodrag Zivkovic ◽

...

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text