A two-stage complex network using cycle-consistent generative adversarial networks for speech enhancement

Author(s):  
Guochen Yu ◽  
Yutian Wang ◽  
Hui Wang ◽  
Qin Zhang ◽  
Chengshi Zheng
2021 ◽  
Vol 11 (2) ◽  
pp. 721
Author(s):  
Hyung Yong Kim ◽  
Ji Won Yoon ◽  
Sung Jun Cheon ◽  
Woo Hyun Kang ◽  
Nam Soo Kim

Recently, generative adversarial networks (GANs) have been successfully applied to speech enhancement. However, there still remain two issues that need to be addressed: (1) GAN-based training is typically unstable due to its non-convex property, and (2) most of the conventional methods do not fully take advantage of the speech characteristics, which could result in a sub-optimal solution. In order to deal with these problems, we propose a progressive generator that can handle the speech in a multi-resolution fashion. Additionally, we propose a multi-scale discriminator that discriminates the real and generated speech at various sampling rates to stabilize GAN training. The proposed structure was compared with the conventional GAN-based speech enhancement algorithms using the VoiceBank-DEMAND dataset. Experimental results showed that the proposed approach can make the training faster and more stable, which improves the performance on various metrics for speech enhancement.


2020 ◽  
Vol 17 (3) ◽  
pp. 401-405 ◽  
Author(s):  
Chenyang Zhang ◽  
Xuebing Yang ◽  
Yongqiang Tang ◽  
Wensheng Zhang

Author(s):  
Ju Lin ◽  
Sufeng Niu ◽  
Zice Wei ◽  
Xiang Lan ◽  
Adriaan J. van Wijngaarden ◽  
...  

2019 ◽  
Vol 114 ◽  
pp. 10-21 ◽  
Author(s):  
Santiago Pascual ◽  
Joan Serrà ◽  
Antonio Bonafonte

Sign in / Sign up

Export Citation Format

Share Document