Person image generation through graph-based and appearance-decomposed generative adversarial network

Due to the sophisticated entanglements for non-rigid deformation, generating person images from source pose to target pose is a challenging work. In this paper, we present a novel framework to generate person images with shape consistency and appearance consistency. The proposed framework leverages the graph network to infer the global relationship of source pose and target pose in a graph for better pose transfer. Moreover, we decompose the source image into different attributes (e.g., hair, clothes, pants and shoes) and combine them with the pose coding through operation method to generate a more realistic person image. We adopt an alternate updating strategy to promote mutual guidance between pose modules and appearance modules for better person image quality. Qualitative and quantitative experiments were carried out on the DeepFashion dateset. The efficacy of the presented framework are verified.

Download Full-text

An Underwater Image Enhancement Algorithm Based on Generative Adversarial Network and Natural Image Quality Evaluation Index

Journal of Marine Science and Engineering ◽

10.3390/jmse9070691 ◽

2021 ◽

Vol 9 (7) ◽

pp. 691

Author(s):

Kai Hu ◽

Yanwen Zhang ◽

Chenghang Weng ◽

Pengsheng Wang ◽

Zhiliang Deng ◽

...

Keyword(s):

Image Quality ◽

Image Enhancement ◽

Quality Evaluation ◽

High Efficiency ◽

Natural Image ◽

High Quality ◽

Generative Adversarial Network ◽

Image Quality Evaluation ◽

Adversarial Network ◽

Underwater Image

When underwater vehicles work, underwater images are often absorbed by light and scattered and diffused by floating objects, which leads to the degradation of underwater images. The generative adversarial network (GAN) is widely used in underwater image enhancement tasks because it can complete image-style conversions with high efficiency and high quality. Although the GAN converts low-quality underwater images into high-quality underwater images (truth images), the dataset of truth images also affects high-quality underwater images. However, an underwater truth image lacks underwater image enhancement, which leads to a poor effect of the generated image. Thus, this paper proposes to add the natural image quality evaluation (NIQE) index to the GAN to provide generated images with higher contrast and make them more in line with the perception of the human eye, and at the same time, grant generated images a better effect than the truth images set by the existing dataset. In this paper, several groups of experiments are compared, and through the subjective evaluation and objective evaluation indicators, it is verified that the enhanced image of this algorithm is better than the truth image set by the existing dataset.

Download Full-text

CDL-GAN: Contrastive Distance Learning Generative Adversarial Network for Image Generation

Applied Sciences ◽

10.3390/app11041380 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1380

Author(s):

Yingbo Zhou ◽

Pengcheng Zhao ◽

Weiqin Tong ◽

Yongxin Zhu

Keyword(s):

Distance Learning ◽

Feature Learning ◽

Image Synthesis ◽

Image Feature ◽

Generative Adversarial Networks ◽

Image Generation ◽

Generative Adversarial Network ◽

Feature Representations ◽

Adversarial Network ◽

Public Datasets

While Generative Adversarial Networks (GANs) have shown promising performance in image generation, they suffer from numerous issues such as mode collapse and training instability. To stabilize GAN training and improve image synthesis quality with diversity, we propose a simple yet effective approach as Contrastive Distance Learning GAN (CDL-GAN) in this paper. Specifically, we add Consistent Contrastive Distance (CoCD) and Characteristic Contrastive Distance (ChCD) into a principled framework to improve GAN performance. The CoCD explicitly maximizes the ratio of the distance between generated images and the increment between noise vectors to strengthen image feature learning for the generator. The ChCD measures the sampling distance of the encoded images in Euler space to boost feature representations for the discriminator. We model the framework by employing Siamese Network as a module into GANs without any modification on the backbone. Both qualitative and quantitative experiments conducted on three public datasets demonstrate the effectiveness of our method.

Download Full-text

Contrast agent dose reduction in computed tomography with deep learning using a conditional generative adversarial network

European Radiology ◽

10.1007/s00330-021-07714-2 ◽

2021 ◽

Author(s):

Johannes Haubold ◽

René Hosch ◽

Lale Umutlu ◽

Axel Wetter ◽

Patrizia Haubold ◽

...

Keyword(s):

Image Quality ◽

Contrast Media ◽

Animal Experiments ◽

Turing Test ◽

Dual Energy ◽

Generative Adversarial Networks ◽

Arterial Phase ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks

Abstract Objectives To reduce the dose of intravenous iodine-based contrast media (ICM) in CT through virtual contrast-enhanced images using generative adversarial networks. Methods Dual-energy CTs in the arterial phase of 85 patients were randomly split into an 80/20 train/test collective. Four different generative adversarial networks (GANs) based on image pairs, which comprised one image with virtually reduced ICM and the original full ICM CT slice, were trained, testing two input formats (2D and 2.5D) and two reduced ICM dose levels (−50% and −80%). The amount of intravenous ICM was reduced by creating virtual non-contrast series using dual-energy and adding the corresponding percentage of the iodine map. The evaluation was based on different scores (L1 loss, SSIM, PSNR, FID), which evaluate the image quality and similarity. Additionally, a visual Turing test (VTT) with three radiologists was used to assess the similarity and pathological consistency. Results The −80% models reach an SSIM of > 98%, PSNR of > 48, L1 of between 7.5 and 8, and an FID of between 1.6 and 1.7. In comparison, the −50% models reach a SSIM of > 99%, PSNR of > 51, L1 of between 6.0 and 6.1, and an FID between 0.8 and 0.95. For the crucial question of pathological consistency, only the 50% ICM reduction networks achieved 100% consistency, which is required for clinical use. Conclusions The required amount of ICM for CT can be reduced by 50% while maintaining image quality and diagnostic accuracy using GANs. Further phantom studies and animal experiments are required to confirm these initial results. Key Points • The amount of contrast media required for CT can be reduced by 50% using generative adversarial networks. • Not only the image quality but especially the pathological consistency must be evaluated to assess safety. • A too pronounced contrast media reduction could influence the pathological consistency in our collective at 80%.

Download Full-text

Does Generative Adversarial Network (GAN) help in SRAF image generation?

10.1109/iwaps54037.2021.9671262 ◽

2021 ◽

Author(s):

Jialu Huang ◽

Ying Huang ◽

Yan-ting Lin ◽

Zi-yang Liu ◽

Yang Lin ◽

...

Keyword(s):

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Learning a Generative Model for Fusing Infrared and Visible Images via Conditional Generative Adversarial Network with Dual Discriminators

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/549 ◽

2019 ◽

Cited By ~ 12

Author(s):

Han Xu ◽

Pengwei Liang ◽

Wei Yu ◽

Junjun Jiang ◽

Jiayi Ma

Keyword(s):

Probability Distribution ◽

State Of The Art ◽

Infrared Image ◽

Infrared Images ◽

Generative Adversarial Network ◽

Visible Image ◽

Qualitative And Quantitative ◽

Adversarial Network ◽

Fused Image ◽

Visible Images

In this paper, we propose a new end-to-end model, called dual-discriminator conditional generative adversarial network (DDcGAN), for fusing infrared and visible images of different resolutions. Unlike the pixel-level methods and existing deep learning-based methods, the fusion task is accomplished through the adversarial process between a generator and two discriminators, in addition to the specially designed content loss. The generator is trained to generate real-like fused images to fool discriminators. The two discriminators are trained to calculate the JS divergence between the probability distribution of downsampled fused images and infrared images, and the JS divergence between the probability distribution of gradients of fused images and gradients of visible images, respectively. Thus, the fused images can compensate for the features that are not constrained by the single content loss. Consequently, the prominence of thermal targets in the infrared image and the texture details in the visible image can be preserved or even enhanced in the fused image simultaneously. Moreover, by constraining and distinguishing between the downsampled fused image and the low-resolution infrared image, DDcGAN can be preferably applied to the fusion of different resolution images. Qualitative and quantitative experiments on publicly available datasets demonstrate the superiority of our method over the state-of-the-art.

Download Full-text

Improvement of Image Quality of Cone-beam CT Images by Three-dimensional Generative Adversarial Network

10.1109/embc46164.2021.9629952 ◽

2021 ◽

Author(s):

Takumi Hase ◽

Megumi Nakao ◽

Keiho Imanishi ◽

Mitsuhiro Nakamura ◽

Tetsuya Matsuda

Keyword(s):

Image Quality ◽

Cone Beam Ct ◽

Three Dimensional ◽

Ct Images ◽

Cone Beam ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Low-Light Image Enhancement Based on Joint Generative Adversarial Network and Image Quality Assessment

2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2018.8633150 ◽

2018 ◽

Author(s):

Wei Hua ◽

Youshen Xia

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Image Enhancement ◽

Image Quality Assessment ◽

Low Light ◽

Generative Adversarial Network ◽

Light Image ◽

Adversarial Network

Download Full-text

Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network

Sensors ◽

10.3390/s20071810 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1810

Author(s):

Dat Tien Nguyen ◽

Tuyen Danh Pham ◽

Ganbayar Batchuluun ◽

Kyoung Jun Noh ◽

Kang Ryoung Park

Keyword(s):

Recognition Task ◽

Recognition System ◽

Attack Detection ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Face Images ◽

Problem Presentation ◽

Recognition Systems ◽

Public Datasets

Although face-based biometric recognition systems have been widely used in many applications, this type of recognition method is still vulnerable to presentation attacks, which use fake samples to deceive the recognition system. To overcome this problem, presentation attack detection (PAD) methods for face recognition systems (face-PAD), which aim to classify real and presentation attack face images before performing a recognition task, have been developed. However, the performance of PAD systems is limited and biased due to the lack of presentation attack images for training PAD systems. In this paper, we propose a method for artificially generating presentation attack face images by learning the characteristics of real and presentation attack images using a few captured images. As a result, our proposed method helps save time in collecting presentation attack samples for training PAD systems and possibly enhance the performance of PAD systems. Our study is the first attempt to generate PA face images for PAD system based on CycleGAN network, a deep-learning-based framework for image generation. In addition, we propose a new measurement method to evaluate the quality of generated PA images based on a face-PAD system. Through experiments with two public datasets (CASIA and Replay-mobile), we show that the generated face images can capture the characteristics of presentation attack images, making them usable as captured presentation attack samples for PAD system training.

Download Full-text