Evaluation of Generative Adversarial Network for Human Face Image Synthesis

Robotic drawing has become increasingly popular as an entertainment and interactive tool. In this paper we present RoboCoDraw, a real-time collaborative robot-based drawing system that draws stylized human face sketches interactively in front of human users, by using the Generative Adversarial Network (GAN)-based style transfer and a Random-Key Genetic Algorithm (RKGA)-based path optimization. The proposed RoboCoDraw system takes a real human face image as input, converts it to a stylized avatar, then draws it with a robotic arm. A core component in this system is the AvatarGAN proposed by us, which generates a cartoon avatar face image from a real human face. AvatarGAN is trained with unpaired face and avatar images only and can generate avatar images of much better likeness with human face images in comparison with the vanilla CycleGAN. After the avatar image is generated, it is fed to a line extraction algorithm and converted to sketches. An RKGA-based path optimization algorithm is applied to find a time-efficient robotic drawing path to be executed by the robotic arm. We demonstrate the capability of RoboCoDraw on various face images using a lightweight, safe collaborative robot UR5.

Download Full-text

PathosisGAN: Sick Face Image Synthesis with Generative Adversarial Network

10.1145/3469213.3470691 ◽

2021 ◽

Author(s):

Jinyu Hu ◽

Yuchen Ren ◽

Yuan Yuan ◽

Yin Li ◽

Lei Chen

Keyword(s):

Image Synthesis ◽

Face Image ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

A survey on generative adversarial network-based text-to-image synthesis

Neurocomputing ◽

10.1016/j.neucom.2021.04.069 ◽

2021 ◽

Author(s):

Rui Zhou ◽

Cong Jiang ◽

Qingyang Xu

Keyword(s):

Image Synthesis ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

CDL-GAN: Contrastive Distance Learning Generative Adversarial Network for Image Generation

Applied Sciences ◽

10.3390/app11041380 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1380

Author(s):

Yingbo Zhou ◽

Pengcheng Zhao ◽

Weiqin Tong ◽

Yongxin Zhu

Keyword(s):

Distance Learning ◽

Feature Learning ◽

Image Synthesis ◽

Image Feature ◽

Generative Adversarial Networks ◽

Image Generation ◽

Generative Adversarial Network ◽

Feature Representations ◽

Adversarial Network ◽

Public Datasets

While Generative Adversarial Networks (GANs) have shown promising performance in image generation, they suffer from numerous issues such as mode collapse and training instability. To stabilize GAN training and improve image synthesis quality with diversity, we propose a simple yet effective approach as Contrastive Distance Learning GAN (CDL-GAN) in this paper. Specifically, we add Consistent Contrastive Distance (CoCD) and Characteristic Contrastive Distance (ChCD) into a principled framework to improve GAN performance. The CoCD explicitly maximizes the ratio of the distance between generated images and the increment between noise vectors to strengthen image feature learning for the generator. The ChCD measures the sampling distance of the encoded images in Euler space to boost feature representations for the discriminator. We model the framework by employing Siamese Network as a module into GANs without any modification on the backbone. Both qualitative and quantitative experiments conducted on three public datasets demonstrate the effectiveness of our method.

Download Full-text

Realistic High-resolution Body Computed Tomography Image Synthesis Using Progressive Growing Generative Adversarial Network: A Visual Turing Test (Preprint)

JMIR Medical Informatics ◽

10.2196/23328 ◽

2020 ◽

Author(s):

Ho Young Park ◽

Hyun-Jin Bae ◽

Gil-Sun Hong ◽

Minjee Kim ◽

JiHye Yun ◽

...

Keyword(s):

Computed Tomography ◽

High Resolution ◽

Image Synthesis ◽

Turing Test ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Tomography Image ◽

Computed Tomography Image

Download Full-text

IntroVNMT: An Introspective Model for Variational Neural Machine Translation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6411 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8830-8837

Author(s):

Xin Sheng ◽

Linli Xu ◽

Junliang Guo ◽

Jingchang Liu ◽

Ruoyu Zhao ◽

...

Keyword(s):

Machine Translation ◽

Latent Variables ◽

Image Synthesis ◽

Target Language ◽

Generative Adversarial Network ◽

Neural Machine Translation ◽

Adversarial Network ◽

Proposed Model ◽

Model Training ◽

High Level

We propose a novel introspective model for variational neural machine translation (IntroVNMT) in this paper, inspired by the recent successful application of introspective variational autoencoder (IntroVAE) in high quality image synthesis. Different from the vanilla variational NMT model, IntroVNMT is capable of improving itself introspectively by evaluating the quality of the generated target sentences according to the high-level latent variables of the real and generated target sentences. As a consequence of introspective training, the proposed model is able to discriminate between the generated and real sentences of the target language via the latent variables generated by the encoder of the model. In this way, IntroVNMT is able to generate more realistic target sentences in practice. In the meantime, IntroVNMT inherits the advantages of the variational autoencoders (VAEs), and the model training process is more stable than the generative adversarial network (GAN) based models. Experimental results on different translation tasks demonstrate that the proposed model can achieve significant improvements over the vanilla variational NMT model.

Download Full-text

Text to Image Synthesis With Bidirectional Generative Adversarial Network

2020 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme46284.2020.9102904 ◽

2020 ◽

Author(s):

Zixu Wang ◽

Zhe Quan ◽

Zhi-Jie Wang ◽

Xinjian Hu ◽

Yangyang Chen

Keyword(s):

Image Synthesis ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Semantically consistent text to fashion image synthesis with an enhanced attentional generative adversarial network

Pattern Recognition Letters ◽

10.1016/j.patrec.2020.02.030 ◽

2020 ◽

Vol 135 ◽

pp. 22-29 ◽

Cited By ~ 1

Author(s):

Kenan E. Ak ◽

Joo Hwee Lim ◽

Jo Yew Tham ◽

Ashraf A. Kassim

Keyword(s):

Image Synthesis ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Text to Realistic Image Generation with Attentional Concatenation Generative Adversarial Networks

Discrete Dynamics in Nature and Society ◽

10.1155/2020/6452536 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Linyan Li ◽

Yu Sun ◽

Fuyuan Hu ◽

Tao Zhou ◽

Xuefeng Xi ◽

...

Keyword(s):

High Resolution ◽

Image Synthesis ◽

Semantic Space ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Fine Grained ◽

Adversarial Network ◽

Adversarial Networks ◽

Cascade Structure ◽

High Resolution Images

In this paper, we propose an Attentional Concatenation Generative Adversarial Network (ACGAN) aiming at generating 1024 × 1024 high-resolution images. First, we propose a multilevel cascade structure, for text-to-image synthesis. During training progress, we gradually add new layers and, at the same time, use the results and word vectors from the previous layer as inputs to the next layer to generate high-resolution images with photo-realistic details. Second, the deep attentional multimodal similarity model is introduced into the network, and we match word vectors with images in a common semantic space to compute a fine-grained matching loss for training the generator. In this way, we can pay attention to the fine-grained information of the word level in the semantics. Finally, the measure of diversity is added to the discriminator, which enables the generator to obtain more diverse gradient directions and improve the diversity of generated samples. The experimental results show that the inception scores of the proposed model on the CUB and Oxford-102 datasets have reached 4.48 and 4.16, improved by 2.75% and 6.42% compared to Attentional Generative Adversarial Networks (AttenGAN). The ACGAN model has a better effect on text-generated images, and the resulting image is closer to the real image.

Download Full-text

ResFPA-GAN: Text-to-Image Synthesis with Generative Adversarial Network Based on Residual Block Feature Pyramid Attention

2019 IEEE International Conference on Advanced Robotics and its Social Impacts (ARSO) ◽

10.1109/arso46408.2019.8948717 ◽

2019 ◽

Author(s):

Jingcong Sun ◽

Yimin Zhou ◽

Bin Zhang

Keyword(s):

Image Synthesis ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Feature Pyramid ◽

Residual Block

Download Full-text