Realistic Face Image Generation Based on Generative Adversarial Network

While Generative Adversarial Networks (GANs) have shown promising performance in image generation, they suffer from numerous issues such as mode collapse and training instability. To stabilize GAN training and improve image synthesis quality with diversity, we propose a simple yet effective approach as Contrastive Distance Learning GAN (CDL-GAN) in this paper. Specifically, we add Consistent Contrastive Distance (CoCD) and Characteristic Contrastive Distance (ChCD) into a principled framework to improve GAN performance. The CoCD explicitly maximizes the ratio of the distance between generated images and the increment between noise vectors to strengthen image feature learning for the generator. The ChCD measures the sampling distance of the encoded images in Euler space to boost feature representations for the discriminator. We model the framework by employing Siamese Network as a module into GANs without any modification on the backbone. Both qualitative and quantitative experiments conducted on three public datasets demonstrate the effectiveness of our method.

Download Full-text

Does Generative Adversarial Network (GAN) help in SRAF image generation?

10.1109/iwaps54037.2021.9671262 ◽

2021 ◽

Author(s):

Jialu Huang ◽

Ying Huang ◽

Yan-ting Lin ◽

Zi-yang Liu ◽

Yang Lin ◽

...

Keyword(s):

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

RoboCoDraw: Robotic Avatar Drawing with GAN-Based Style Transfer and Time-Efficient Path Optimization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6609 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10402-10409

Author(s):

Tianying Wang ◽

Wei Qi Toh ◽

Hao Zhang ◽

Xiuchao Sui ◽

Shaohua Li ◽

...

Keyword(s):

Face Image ◽

Path Optimization ◽

Robotic Arm ◽

Human Face ◽

Generative Adversarial Network ◽

Style Transfer ◽

Adversarial Network ◽

Face Images ◽

Drawing System ◽

Collaborative Robot

Robotic drawing has become increasingly popular as an entertainment and interactive tool. In this paper we present RoboCoDraw, a real-time collaborative robot-based drawing system that draws stylized human face sketches interactively in front of human users, by using the Generative Adversarial Network (GAN)-based style transfer and a Random-Key Genetic Algorithm (RKGA)-based path optimization. The proposed RoboCoDraw system takes a real human face image as input, converts it to a stylized avatar, then draws it with a robotic arm. A core component in this system is the AvatarGAN proposed by us, which generates a cartoon avatar face image from a real human face. AvatarGAN is trained with unpaired face and avatar images only and can generate avatar images of much better likeness with human face images in comparison with the vanilla CycleGAN. After the avatar image is generated, it is fed to a line extraction algorithm and converted to sketches. An RKGA-based path optimization algorithm is applied to find a time-efficient robotic drawing path to be executed by the robotic arm. We demonstrate the capability of RoboCoDraw on various face images using a lightweight, safe collaborative robot UR5.

Download Full-text

Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network

Sensors ◽

10.3390/s20071810 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1810

Author(s):

Dat Tien Nguyen ◽

Tuyen Danh Pham ◽

Ganbayar Batchuluun ◽

Kyoung Jun Noh ◽

Kang Ryoung Park

Keyword(s):

Recognition Task ◽

Recognition System ◽

Attack Detection ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Face Images ◽

Problem Presentation ◽

Recognition Systems ◽

Public Datasets

Although face-based biometric recognition systems have been widely used in many applications, this type of recognition method is still vulnerable to presentation attacks, which use fake samples to deceive the recognition system. To overcome this problem, presentation attack detection (PAD) methods for face recognition systems (face-PAD), which aim to classify real and presentation attack face images before performing a recognition task, have been developed. However, the performance of PAD systems is limited and biased due to the lack of presentation attack images for training PAD systems. In this paper, we propose a method for artificially generating presentation attack face images by learning the characteristics of real and presentation attack images using a few captured images. As a result, our proposed method helps save time in collecting presentation attack samples for training PAD systems and possibly enhance the performance of PAD systems. Our study is the first attempt to generate PA face images for PAD system based on CycleGAN network, a deep-learning-based framework for image generation. In addition, we propose a new measurement method to evaluate the quality of generated PA images based on a face-PAD system. Through experiments with two public datasets (CASIA and Replay-mobile), we show that the generated face images can capture the characteristics of presentation attack images, making them usable as captured presentation attack samples for PAD system training.

Download Full-text

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053957 ◽

2020 ◽

Cited By ~ 3

Author(s):

Gaowen Liu ◽

Hao Tang ◽

Hugo Latapie ◽

Yan Yan

Keyword(s):

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Complete Face Recovering: An Approach towards Recognizing a Person by a Single Partial Face Image without the Target Photo in Gallery

10.36227/techrxiv.12333176.v1 ◽

2020 ◽

Author(s):

Yiu-ming Cheung ◽

Mengke Li

Keyword(s):

Promising Result ◽

Face Image ◽

Attractive Potential ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Discriminative Feature ◽

New Variant ◽

Potential Applications ◽

Benchmark Datasets

Complete face recovering (CFR) is to recover the complete face image of a given partial face image of a target person whose photo may not be included in the gallery set. The CFR has several attractive potential applications but is challenging. As far as we know, the CFR problem has yet to be explored in the literature. This paper therefore proposes an identity-preserved CFR approach (IP-CFR) to addressing the CFR. First, a denoising auto-encoder based network is applied to acquire the discriminative feature. Then, we propose an identity-preserved loss function to keep the personal identity information. Furthermore, the acquired features are fed into a new variant of the generative adversarial network (GAN) to restore the complete face image. In addition, a two-pathway discriminator is leveraged to enhance the quality of the recovered image. Experimental results on the benchmark datasets show the promising result of the proposed approach.

Download Full-text

A Transfer Deep Generative Adversarial Network Model to Synthetic Brain CT Generation from MR Images

Wireless Communications and Mobile Computing ◽

10.1155/2021/9979606 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Yi Gu ◽

Qiankun Zheng

Keyword(s):

Transfer Learning ◽

Network Model ◽

Medical Images ◽

Image Method ◽

Data Sets ◽

Generation Process ◽

Image Generation ◽

Generative Adversarial Network ◽

Model Based ◽

Adversarial Network

Background. The generation of medical images is to convert the existing medical images into one or more required medical images to reduce the time required for sample diagnosis and the radiation to the human body from multiple medical images taken. Therefore, the research on the generation of medical images has important clinical significance. At present, there are many methods in this field. For example, in the image generation process based on the fuzzy C-means (FCM) clustering method, due to the unique clustering idea of FCM, the images generated by this method are uncertain of the attribution of certain organizations. This will cause the details of the image to be unclear, and the resulting image quality is not high. With the development of the generative adversarial network (GAN) model, many improved methods based on the deep GAN model were born. Pix2Pix is a GAN model based on UNet. The core idea of this method is to use paired two types of medical images for deep neural network fitting, thereby generating high-quality images. The disadvantage is that the requirements for data are very strict, and the two types of medical images must be paired one by one. DualGAN model is a network model based on transfer learning. The model cuts the 3D image into multiple 2D slices, simulates each slice, and merges the generated results. The disadvantage is that every time an image is generated, bar-shaped “shadows” will be generated in the three-dimensional image. Method/Material. To solve the above problems and ensure the quality of image generation, this paper proposes a Dual3D&PatchGAN model based on transfer learning. Since Dual3D&PatchGAN is set based on transfer learning, there is no need for one-to-one paired data sets, only two types of medical image data sets are needed, which has important practical significance for applications. This model can eliminate the bar-shaped “shadows” produced by DualGAN’s generated images and can also perform two-way conversion of the two types of images. Results. From the multiple evaluation indicators of the experimental results, it can be analyzed that Dual3D&PatchGAN is more suitable for the generation of medical images than other models, and its generation effect is better.

Download Full-text

Complete Face Recovering: An Approach towards Recognizing a Person by a Single Partial Face Image without the Target Photo in Gallery

10.36227/techrxiv.12333176 ◽

2020 ◽

Author(s):

Yiu-ming Cheung ◽

Mengke Li

Keyword(s):

Promising Result ◽

Face Image ◽

Attractive Potential ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Discriminative Feature ◽

New Variant ◽

Potential Applications ◽

Benchmark Datasets

Complete face recovering (CFR) is to recover the complete face image of a given partial face image of a target person whose photo may not be included in the gallery set. The CFR has several attractive potential applications but is challenging. As far as we know, the CFR problem has yet to be explored in the literature. This paper therefore proposes an identity-preserved CFR approach (IP-CFR) to addressing the CFR. First, a denoising auto-encoder based network is applied to acquire the discriminative feature. Then, we propose an identity-preserved loss function to keep the personal identity information. Furthermore, the acquired features are fed into a new variant of the generative adversarial network (GAN) to restore the complete face image. In addition, a two-pathway discriminator is leveraged to enhance the quality of the recovered image. Experimental results on the benchmark datasets show the promising result of the proposed approach.

Download Full-text

Asymmetric Encoder-Decoder Structured FCN Based LiDAR to Color Image Generation

Sensors ◽

10.3390/s19214818 ◽

2019 ◽

Vol 19 (21) ◽

pp. 4818 ◽

Cited By ~ 2

Author(s):

Hyun-Koo Kim ◽

Kook-Yeol Yoo ◽

Ju H. Park ◽

Ho-Youl Jung

Keyword(s):

Color Image ◽

Ground Truth ◽

Sensor Data ◽

Image Generation ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Adversarial Network ◽

Ground Truth Image ◽

And Performance ◽

Reflection Intensity

In this paper, we propose a method of generating a color image from light detection and ranging (LiDAR) 3D reflection intensity. The proposed method is composed of two steps: projection of LiDAR 3D reflection intensity into 2D intensity, and color image generation from the projected intensity by using a fully convolutional network (FCN). The color image should be generated from a very sparse projected intensity image. For this reason, the FCN is designed to have an asymmetric network structure, i.e., the layer depth of the decoder in the FCN is deeper than that of the encoder. The well-known KITTI dataset for various scenarios is used for the proposed FCN training and performance evaluation. Performance of the asymmetric network structures are empirically analyzed for various depth combinations for the encoder and decoder. Through simulations, it is shown that the proposed method generates fairly good visual quality of images while maintaining almost the same color as the ground truth image. Moreover, the proposed FCN has much higher performance than conventional interpolation methods and generative adversarial network based Pix2Pix. One interesting result is that the proposed FCN produces shadow-free and daylight color images. This result is caused by the fact that the LiDAR sensor data is produced by the light reflection and is, therefore, not affected by sunlight and shadow.

Download Full-text