Blind Image Separation Method Based on Cascade Generative Adversarial Networks

To solve the challenge of single-channel blind image separation (BIS) caused by unknown prior knowledge during the separation process, we propose a BIS method based on cascaded generative adversarial networks (GANs). To ensure that the proposed method can perform well in different scenarios and to address the problem of an insufficient number of training samples, a synthetic network is added to the separation network. This method is composed of two GANs: a U-shaped GAN (UGAN), which is used to learn image synthesis, and a pixel-to-attention GAN (PAGAN), which is used to learn image separation. The two networks jointly complete the task of image separation. UGAN uses the unpaired mixed image and the unmixed image to learn the mixing style, thereby generating an image with the “true” mixing characteristics which addresses the problem of an insufficient number of training samples for the PAGAN. A self-attention mechanism is added to the PAGAN to quickly extract important features from the image data. The experimental results show that the proposed method achieves good results on both synthetic image datasets and real remote sensing image datasets. Moreover, it can be used for image separation in different scenarios which lack prior knowledge and training samples.

Download Full-text

SemGAN: Text to Image Synthesis from Text Semantics using Attentional Generative Adversarial Networks

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429602 ◽

2021 ◽

Author(s):

Ammar Nasr ◽

Ruba Mutasim ◽

Hiba Imam

Keyword(s):

Image Synthesis ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

SAM-GAN: Self-Attention supporting Multi-stage Generative Adversarial Networks for text-to-image synthesis

Neural Networks ◽

10.1016/j.neunet.2021.01.023 ◽

2021 ◽

Vol 138 ◽

pp. 57-67

Author(s):

Dunlu Peng ◽

Wuchen Yang ◽

Cong Liu ◽

Shuairui Lü

Keyword(s):

Image Synthesis ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Multi Stage

Download Full-text

Drawgan: Text to Image Synthesis with Drawing Generative Adversarial Networks

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414166 ◽

2021 ◽

Author(s):

Zhiqiang Zhang ◽

Jinjia Zhou ◽

Wenxin Yu ◽

Ning Jiang

Keyword(s):

Image Synthesis ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

Intrusion detection of railway clearance from infrared images using generative adversarial networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-192141 ◽

2020 ◽

pp. 1-13

Author(s):

Yundong Li ◽

Yi Liu ◽

Han Dong ◽

Wei Hu ◽

Chen Lin

Keyword(s):

Intrusion Detection ◽

Synthetic Data ◽

Generative Adversarial Networks ◽

Generation Model ◽

Single Shot ◽

Data Generation ◽

Infrared Images ◽

Adversarial Networks ◽

Training Samples ◽

Rgb Images

The intrusion detection of railway clearance is crucial for avoiding railway accidents caused by the invasion of abnormal objects, such as pedestrians, falling rocks, and animals. However, detecting intrusions using deep learning methods from infrared images captured at night remains a challenging task because of the lack of sufficient training samples. To address this issue, a transfer strategy that migrates daytime RGB images to the nighttime style of infrared images is proposed in this study. The proposed method consists of two stages. In the first stage, a data generation model is trained on the basis of generative adversarial networks using RGB images and a small number of infrared images, and then, synthetic samples are generated using a well-trained model. In the second stage, a single shot multibox detector (SSD) model is trained using synthetic data and utilized to detect abnormal objects from infrared images at nighttime. To validate the effectiveness of the proposed method, two groups of experiments, namely, railway and non-railway scenes, are conducted. Experimental results demonstrate the effectiveness of the proposed method, and an improvement of 17.8% is achieved for object detection at nighttime.

Download Full-text

mustGAN: multi-stream Generative Adversarial Networks for MR Image Synthesis

Medical Image Analysis ◽

10.1016/j.media.2020.101944 ◽

2021 ◽

pp. 101944

Author(s):

Mahmut Yurt ◽

Salman U.H. Dar ◽

Aykut Erdem ◽

Erkut Erdem ◽

Kader K Oguz ◽

...

Keyword(s):

Image Synthesis ◽

Generative Adversarial Networks ◽

Mr Image ◽

Adversarial Networks

Download Full-text

Microscopic Fluorescence In Situ Hybridization (FISH) Image Synthesis with Generative Adversarial Networks

2021 29th Signal Processing and Communications Applications Conference (SIU) ◽

10.1109/siu53274.2021.9477999 ◽

2021 ◽

Author(s):

Gizem Dursun ◽

Ufuk Ozkaya

Keyword(s):

In Situ Hybridization ◽

Fluorescence In Situ Hybridization ◽

Image Synthesis ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

Harnessing GANs for Zero-Shot Learning of New Classes in Visual Speech Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5649 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2645-2652 ◽

Cited By ~ 2

Author(s):

Yaman Kumar ◽

Dhruva Sahrawat ◽

Shubham Maheshwari ◽

Debanjan Mahata ◽

Amanda Stent ◽

...

Keyword(s):

Speech Recognition ◽

Classification Problem ◽

Visual Speech ◽

Training Data ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Novel Approach ◽

Visual Speech Recognition ◽

Training Samples ◽

English Training

Visual Speech Recognition (VSR) is the process of recognizing or interpreting speech by watching the lip movements of the speaker. Recent machine learning based approaches model VSR as a classification problem; however, the scarcity of training data leads to error-prone systems with very low accuracies in predicting unseen classes. To solve this problem, we present a novel approach to zero-shot learning by generating new classes using Generative Adversarial Networks (GANs), and show how the addition of unseen class samples increases the accuracy of a VSR system by a significant margin of 27% and allows it to handle speaker-independent out-of-vocabulary phrases. We also show that our models are language agnostic and therefore capable of seamlessly generating, using English training data, videos for a new language (Hindi). To the best of our knowledge, this is the first work to show empirical evidence of the use of GANs for generating training samples of unseen classes in the domain of VSR, hence facilitating zero-shot learning. We make the added videos for new classes publicly available along with our code1.

Download Full-text

Designing complex architectured materials with generative adversarial networks

Science Advances ◽

10.1126/sciadv.aaz4169 ◽

2020 ◽

Vol 6 (17) ◽

pp. eaaz4169 ◽

Cited By ~ 3

Author(s):

Yunwei Mao ◽

Qi He ◽

Xuanhe Zhao

Keyword(s):

Prior Knowledge ◽

Mass Production ◽

Systematic Approach ◽

Elastic Stiffness ◽

Upper Bounds ◽

Generative Adversarial Networks ◽

Simulation Data ◽

Adversarial Networks ◽

Isotropic Elasticity ◽

Architectured Materials

Architectured materials on length scales from nanometers to meters are desirable for diverse applications. Recent advances in additive manufacturing have made mass production of complex architectured materials technologically and economically feasible. Existing architecture design approaches such as bioinspiration, Edisonian, and optimization, however, generally rely on experienced designers’ prior knowledge, limiting broad applications of architectured materials. Particularly challenging is designing architectured materials with extreme properties, such as the Hashin-Shtrikman upper bounds on isotropic elasticity in an experience-free manner without prior knowledge. Here, we present an experience-free and systematic approach for the design of complex architectured materials with generative adversarial networks. The networks are trained using simulation data from millions of randomly generated architectures categorized based on different crystallographic symmetries. We demonstrate modeling and experimental results of more than 400 two-dimensional architectures that approach the Hashin-Shtrikman upper bounds on isotropic elastic stiffness with porosities from 0.05 to 0.75.

Download Full-text

Virtual Interpolation Images of Tumor Development and Growth on Breast Ultrasound Image Synthesis With Deep Convolutional Generative Adversarial Networks

Journal of Ultrasound in Medicine ◽

10.1002/jum.15376 ◽

2020 ◽

Vol 40 (1) ◽

pp. 61-69 ◽

Cited By ~ 3

Author(s):

Tomoyuki Fujioka ◽

Kazunori Kubota ◽

Mio Mori ◽

Leona Katsuta ◽

Yuka Kikuchi ◽

...

Keyword(s):

Tumor Development ◽

Ultrasound Image ◽

Image Synthesis ◽

Breast Ultrasound ◽

Generative Adversarial Networks ◽

Adversarial Networks

Download Full-text

Breast Ultrasound Image Synthesis using Deep Convolutional Generative Adversarial Networks

Diagnostics ◽

10.3390/diagnostics9040176 ◽

2019 ◽

Vol 9 (4) ◽

pp. 176 ◽

Cited By ~ 8

Author(s):

Tomoyuki Fujioka ◽

Mio Mori ◽

Kazunori Kubota ◽

Yuka Kikuchi ◽

Leona Katsuta ◽

...

Keyword(s):

Ultrasound Image ◽

Image Synthesis ◽

Breast Ultrasound ◽

Generative Adversarial Networks ◽

Ultrasound Images ◽

Clinical Value ◽

Adversarial Networks ◽

Significant Difference ◽

Definition Of ◽

The Masses

Deep convolutional generative adversarial networks (DCGANs) are newly developed tools for generating synthesized images. To determine the clinical utility of synthesized images, we generated breast ultrasound images and assessed their quality and clinical value. After retrospectively collecting 528 images of 144 benign masses and 529 images of 216 malignant masses in the breasts, synthesized images were generated using a DCGAN with 50, 100, 200, 500, and 1000 epochs. The synthesized (n = 20) and original (n = 40) images were evaluated by two radiologists, who scored them for overall quality, definition of anatomic structures, and visualization of the masses on a five-point scale. They also scored the possibility of images being original. Although there was no significant difference between the images synthesized with 1000 and 500 epochs, the latter were evaluated as being of higher quality than all other images. Moreover, 2.5%, 0%, 12.5%, 37.5%, and 22.5% of the images synthesized with 50, 100, 200, 500, and 1000 epochs, respectively, and 14% of the original images were indistinguishable from one another. Interobserver agreement was very good (|r| = 0.708–0.825, p < 0.001). Therefore, DCGAN can generate high-quality and realistic synthesized breast ultrasound images that are indistinguishable from the original images.

Download Full-text