Cascaded Conditional Generative Adversarial Networks With Multi-Scale Attention Fusion for Automated Bi-Ventricle Segmentation in Cardiac MRI

Recently, generative adversarial networks (GANs) have been successfully applied to speech enhancement. However, there still remain two issues that need to be addressed: (1) GAN-based training is typically unstable due to its non-convex property, and (2) most of the conventional methods do not fully take advantage of the speech characteristics, which could result in a sub-optimal solution. In order to deal with these problems, we propose a progressive generator that can handle the speech in a multi-resolution fashion. Additionally, we propose a multi-scale discriminator that discriminates the real and generated speech at various sampling rates to stabilize GAN training. The proposed structure was compared with the conventional GAN-based speech enhancement algorithms using the VoiceBank-DEMAND dataset. Experimental results showed that the proposed approach can make the training faster and more stable, which improves the performance on various metrics for speech enhancement.

Download Full-text

Enhancing Underwater Image Using Multi-scale Generative Adversarial Networks

Parallel Architectures, Algorithms and Programming - Communications in Computer and Information Science ◽

10.1007/978-981-16-0010-4_23 ◽

2021 ◽

pp. 259-269

Author(s):

Yujie Zhang ◽

Peixiang Chen ◽

Jiangyi Huang ◽

Yuzhong Chen

Keyword(s):

Generative Adversarial Networks ◽

Multi Scale ◽

Adversarial Networks ◽

Underwater Image

Download Full-text

Laplacian Generative Adversarial Networks for Multi-Scale Super-Resolution

2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC) ◽

10.1109/itoec49072.2020.9141731 ◽

2020 ◽

Author(s):

Hongrui Xia ◽

Yingyun Yang ◽

Xiao Hu

Keyword(s):

Super Resolution ◽

Generative Adversarial Networks ◽

Multi Scale ◽

Adversarial Networks

Download Full-text

Multi-scale Generative Adversarial Networks for Speech Enhancement

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) ◽

10.1109/globalsip45357.2019.8969193 ◽

2019 ◽

Author(s):

Yihang Li ◽

Ting Jiang ◽

Shan Qin

Keyword(s):

Speech Enhancement ◽

Generative Adversarial Networks ◽

Multi Scale ◽

Adversarial Networks

Download Full-text

MW-ACGAN: Generating Multiscale High-Resolution SAR Images for Ship Detection

Sensors ◽

10.3390/s20226673 ◽

2020 ◽

Vol 20 (22) ◽

pp. 6673

Author(s):

Lichuan Zou ◽

Hong Zhang ◽

Chao Wang ◽

Fan Wu ◽

Feng Gu

Keyword(s):

High Resolution ◽

Small Sample ◽

Generative Adversarial Networks ◽

Detection Accuracy ◽

Data Set ◽

Ship Detection ◽

Multi Scale ◽

Adversarial Networks ◽

Sar Data ◽

Composite Data

In high-resolution Synthetic Aperture Radar (SAR) ship detection, the number of SAR samples seriously affects the performance of the algorithms based on deep learning. In this paper, aiming at the application requirements of high-resolution ship detection in small samples, a high-resolution SAR ship detection method combining an improved sample generation network, Multiscale Wasserstein Auxiliary Classifier Generative Adversarial Networks (MW-ACGAN) and the Yolo v3 network is proposed. Firstly, the multi-scale Wasserstein distance and gradient penalty loss are used to improve the original Auxiliary Classifier Generative Adversarial Networks (ACGAN), so that the improved network can stably generate high-resolution SAR ship images. Secondly, the multi-scale loss term is added to the network, so the multi-scale image output layers are added, and multi-scale SAR ship images can be generated. Then, the original ship data set and the generated data are combined into a composite data set to train the Yolo v3 target detection network, so as to solve the problem of low detection accuracy under small sample data set. The experimental results of Gaofen-3 (GF-3) 3 m SAR data show that the MW-ACGAN network can generate multi-scale and multi-class ship slices, and the confidence level of ResNet18 is higher than that of ACGAN network, with an average score of 0.91. The detection results of Yolo v3 network model show that the detection accuracy trained by the composite data set is as high as 94%, which is far better than that trained only by the original SAR data set. These results show that our method can make the best use of the original data set, improve the accuracy of ship detection.

Download Full-text

MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr42600.2020.00782 ◽

2020 ◽

Cited By ~ 5

Author(s):

Animesh Karnewar ◽

Oliver Wang

Keyword(s):

Generative Adversarial Networks ◽

Multi Scale ◽

Adversarial Networks

Download Full-text

Multi-scale Generative Adversarial Networks for Crowd Counting

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8545683 ◽

2018 ◽

Cited By ~ 2

Author(s):

Jianxing Yang ◽

Yuan Zhou ◽

Sun-Yuan Kung

Keyword(s):

Generative Adversarial Networks ◽

Crowd Counting ◽

Multi Scale ◽

Adversarial Networks

Download Full-text

An improved pix2pix model based on Gabor filter for robust color image rendering

Mathematical Biosciences and Engineering ◽

10.3934/mbe.2022004 ◽

2021 ◽

Vol 19 (1) ◽

pp. 86-101

Author(s):

Hong-an Li ◽

◽

Min Zhang ◽

Zhenhua Yu ◽

Zhanli Li ◽

...

Keyword(s):

Color Image ◽

Gabor Filter ◽

Texture Feature ◽

Generative Adversarial Networks ◽

Feature Maps ◽

Penalty Term ◽

Multi Scale ◽

Adversarial Networks ◽

Robust Image ◽

Color Rendering

<abstract><p>In recent years, with the development of deep learning, image color rendering method has become a research hotspot once again. To overcome the detail problems of color overstepping and boundary blurring in the robust image color rendering method, as well as the problems of unstable training based on generative adversarial networks, we propose an color rendering method using Gabor filter based improved pix2pix for robust image. Firstly, the multi-direction/multi-scale selection characteristic of Gabor filter is used to preprocess the image to be rendered, which can retain the detailed features of the image while preprocessing to avoid the loss of features. Moreover, among the Gabor texture feature maps with 6 scales and 4 directions, the texture map with the scale of 7 and the direction of 0° has the comparable rendering performance. Finally, by improving the loss function of pix2pix model and adding the penalty term, not only the training can be stabilized, but also the ideal color image can be obtained. To reflect image color rendering quality of different models more objectively, PSNR and SSIM indexes are adopted to evaluate the rendered images. The experimental results of the proposed method show that the robust image rendered by this method has better visual performance and reduces the influence of light and noise on the image to a certain extent.</p></abstract>

Download Full-text