CR-GAN: Learning Complete Representations for Multi-view Generation

Generating multi-view images from a single-view input is an important yet challenging problem. It has broad applications in vision, graphics, and robotics. Our study indicates that the widely-used generative adversarial network (GAN) may learn ?incomplete? representations due to the single-pathway framework: an encoder-decoder network followed by a discriminator network.We propose CR-GAN to address this problem. In addition to the single reconstruction path, we introduce a generation sideway to maintain the completeness of the learned embedding space. The two learning paths collaborate and compete in a parameter-sharing manner, yielding largely improved generality to ?unseen? dataset. More importantly, the two-pathway framework makes it possible to combine both labeled and unlabeled data for self-supervised learning, which further enriches the embedding space for realistic generations. We evaluate our approach on a wide range of datasets. The results prove that CR-GAN significantly outperforms state-of-the-art methods, especially when generating from ?unseen? inputs in wild conditions.

Download Full-text

Disentangled generative adversarial network for low-dose CT

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00749-z ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Wenchao Du ◽

Hu Chen ◽

Hongyu Yang ◽

Yi Zhang

Keyword(s):

Network Architecture ◽

Low Dose ◽

Noise Suppression ◽

State Of The Art ◽

Visual Quality ◽

Ct Images ◽

Generative Adversarial Network ◽

Low Dose Ct ◽

Adversarial Network ◽

Suppression Method

AbstractGenerative adversarial network (GAN) has been applied for low-dose CT images to predict normal-dose CT images. However, the undesired artifacts and details bring uncertainty to the clinical diagnosis. In order to improve the visual quality while suppressing the noise, in this paper, we mainly studied the two key components of deep learning based low-dose CT (LDCT) restoration models—network architecture and adversarial loss, and proposed a disentangled noise suppression method based on GAN (DNSGAN) for LDCT. Specifically, a generator network, which contains the noise suppression and structure recovery modules, is proposed. Furthermore, a multi-scaled relativistic adversarial loss is introduced to preserve the finer structures of generated images. Experiments on simulated and real LDCT datasets show that the proposed method can effectively remove noise while recovering finer details and provide better visual perception than other state-of-the-art methods.

Download Full-text

Generative Adversarial Networks-Based Semi-Supervised Automatic Modulation Recognition for Cognitive Radio Networks

Sensors ◽

10.3390/s18113913 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3913 ◽

Cited By ~ 6

Author(s):

Mingxuan Li ◽

Ou Li ◽

Guangyi Liu ◽

Ce Zhang

Keyword(s):

Deep Learning ◽

Cognitive Radio ◽

Supervised Learning ◽

Rapid Development ◽

Generative Adversarial Networks ◽

Modulation Recognition ◽

Learning Methods ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Automatic Modulation Recognition

With the recently explosive growth of deep learning, automatic modulation recognition has undergone rapid development. Most of the newly proposed methods are dependent on large numbers of labeled samples. We are committed to using fewer labeled samples to perform automatic modulation recognition in the cognitive radio domain. Here, a semi-supervised learning method based on adversarial training is proposed which is called signal classifier generative adversarial network. Most of the prior methods based on this technology involve computer vision applications. However, we improve the existing network structure of a generative adversarial network by adding the encoder network and a signal spatial transform module, allowing our framework to address radio signal processing tasks more efficiently. These two technical improvements effectively avoid nonconvergence and mode collapse problems caused by the complexity of the radio signals. The results of simulations show that compared with well-known deep learning methods, our method improves the classification accuracy on a synthetic radio frequency dataset by 0.1% to 12%. In addition, we verify the advantages of our method in a semi-supervised scenario and obtain a significant increase in accuracy compared with traditional semi-supervised learning methods.

Download Full-text

Generative Adversarial Network-Based Semi-supervised Learning for Pathological Speech Classification

Statistical Language and Speech Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-59430-5_14 ◽

2020 ◽

pp. 169-181

Author(s):

Nam H. Trinh ◽

Darragh O’Brien

Keyword(s):

Supervised Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Speech Classification

Download Full-text

Learning a Generative Model for Fusing Infrared and Visible Images via Conditional Generative Adversarial Network with Dual Discriminators

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/549 ◽

2019 ◽

Cited By ~ 12

Author(s):

Han Xu ◽

Pengwei Liang ◽

Wei Yu ◽

Junjun Jiang ◽

Jiayi Ma

Keyword(s):

Probability Distribution ◽

State Of The Art ◽

Infrared Image ◽

Infrared Images ◽

Generative Adversarial Network ◽

Visible Image ◽

Qualitative And Quantitative ◽

Adversarial Network ◽

Fused Image ◽

Visible Images

In this paper, we propose a new end-to-end model, called dual-discriminator conditional generative adversarial network (DDcGAN), for fusing infrared and visible images of different resolutions. Unlike the pixel-level methods and existing deep learning-based methods, the fusion task is accomplished through the adversarial process between a generator and two discriminators, in addition to the specially designed content loss. The generator is trained to generate real-like fused images to fool discriminators. The two discriminators are trained to calculate the JS divergence between the probability distribution of downsampled fused images and infrared images, and the JS divergence between the probability distribution of gradients of fused images and gradients of visible images, respectively. Thus, the fused images can compensate for the features that are not constrained by the single content loss. Consequently, the prominence of thermal targets in the infrared image and the texture details in the visible image can be preserved or even enhanced in the fused image simultaneously. Moreover, by constraining and distinguishing between the downsampled fused image and the low-resolution infrared image, DDcGAN can be preferably applied to the fusion of different resolution images. Qualitative and quantitative experiments on publicly available datasets demonstrate the superiority of our method over the state-of-the-art.

Download Full-text

Self-Supervised Learning of Depth and Pose Using Cycle Generative Adversarial Network

2020 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip40778.2020.9191124 ◽

2020 ◽

Author(s):

Yunhe Tong ◽

Anjie Wang ◽

Songchao Tan ◽

Shanshe Wang ◽

Siwei Ma ◽

...

Keyword(s):

Supervised Learning ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Combination of Active Learning and Semi-Supervised Learning under a Self-Training Scheme

Entropy ◽

10.3390/e21100988 ◽

2019 ◽

Vol 21 (10) ◽

pp. 988 ◽

Cited By ~ 4

Author(s):

Fazakis ◽

Kanas ◽

Aridas ◽

Karlos ◽

Kotsiantis

Keyword(s):

Active Learning ◽

Supervised Learning ◽

Unlabeled Data ◽

Classification Algorithms ◽

Training Phase ◽

Learning Methods ◽

Training Scheme ◽

Wide Range ◽

Benchmark Datasets ◽

Scientific Fields

One of the major aspects affecting the performance of the classification algorithms is the amount of labeled data which is available during the training phase. It is widely accepted that the labeling procedure of vast amounts of data is both expensive and time-consuming since it requires the employment of human expertise. For a wide variety of scientific fields, unlabeled examples are easy to collect but hard to handle in a useful manner, thus improving the contained information for a subject dataset. In this context, a variety of learning methods have been studied in the literature aiming to efficiently utilize the vast amounts of unlabeled data during the learning process. The most common approaches tackle problems of this kind by individually applying active learning or semi-supervised learning methods. In this work, a combination of active learning and semi-supervised learning methods is proposed, under a common self-training scheme, in order to efficiently utilize the available unlabeled data. The effective and robust metrics of the entropy and the distribution of probabilities of the unlabeled set, to select the most sufficient unlabeled examples for the augmentation of the initial labeled set, are used. The superiority of the proposed scheme is validated by comparing it against the base approaches of supervised, semi-supervised, and active learning in the wide range of fifty-five benchmark datasets.

Download Full-text

Insulator Segmentation for Power Line Inspection Based on Modified Conditional Generative Adversarial Network

Journal of Sensors ◽

10.1155/2019/4245329 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8

Author(s):

Zishu Gao ◽

Guodong Yang ◽

En Li ◽

Tianyu Shen ◽

Zhe Wang ◽

...

Keyword(s):

Transmission Lines ◽

Power Supply ◽

Power Transmission ◽

State Of The Art ◽

Power Line ◽

Power Transmission Lines ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Adversarial Network ◽

Feature Information

There are a large number of insulators on the transmission line, and insulator damage will have a major impact on power supply security. Image-based segmentation of the insulators in the power transmission lines is a premise and also a critical task for power line inspection. In this paper, a modified conditional generative adversarial network for insulator pixel-level segmentation is proposed. The generator is reconstructed by encoder-decoder layers with asymmetric convolution kernel which can simplify the network complexity and extract more kinds of feature information. The discriminator is composed of a fully convolutional network based on patchGAN and learns the loss to train the generator. It is verified in experiments that the proposed method has better performances on mIoU and computational efficiency than Pix2pix, SegNet, and other state-of-the-art networks.

Download Full-text

Generative adversarial network-based semi-supervised learning for real-time risk warning of process industries

Expert Systems with Applications ◽

10.1016/j.eswa.2020.113244 ◽

2020 ◽

Vol 150 ◽

pp. 113244 ◽

Cited By ~ 3

Author(s):

Rui He ◽

Xinhong Li ◽

Guoming Chen ◽

Guoxing Chen ◽

Yiwei Liu

Keyword(s):

Supervised Learning ◽

Real Time ◽

Generative Adversarial Network ◽

Process Industries ◽

Adversarial Network ◽

Risk Warning

Download Full-text

Joint Entity and Event Extraction with Generative Adversarial Imitation Learning

Data Intelligence ◽

10.1162/dint_a_00014 ◽

2019 ◽

Vol 1 (2) ◽

pp. 99-120 ◽

Cited By ~ 6

Author(s):

Tongtao Zhang ◽

Heng Ji ◽

Avirup Sil

Keyword(s):

State Of The Art ◽

Ground Truth ◽

Event Extraction ◽

Imitation Learning ◽

Learning Method ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

The Difference ◽

New Framework

We propose a new framework for entity and event extraction based on generative adversarial imitation learning—an inverse reinforcement learning method using a generative adversarial network (GAN). We assume that instances and labels yield to various extents of difficulty and the gains and penalties (rewards) are expected to be diverse. We utilize discriminators to estimate proper rewards according to the difference between the labels committed by the ground-truth (expert) and the extractor (agent). Our experiments demonstrate that the proposed framework outperforms state-of-the-art methods.

Download Full-text

An image-computable model of human visual shape similarity

PLoS Computational Biology ◽

10.1371/journal.pcbi.1008981 ◽

2021 ◽

Vol 17 (6) ◽

pp. e1008981

Author(s):

Yaniv Morgenstern ◽

Frieder Hartmann ◽

Filipp Schmidt ◽

Henning Tiedemann ◽

Eugen Prokott ◽

...

Keyword(s):

Shape Similarity ◽

Computable Model ◽

Object Representations ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Wide Range ◽

Invaluable Tool ◽

Different Shapes ◽

Similarity Judgments ◽

Visual Shape

Shape is a defining feature of objects, and human observers can effortlessly compare shapes to determine how similar they are. Yet, to date, no image-computable model can predict how visually similar or different shapes appear. Such a model would be an invaluable tool for neuroscientists and could provide insights into computations underlying human shape perception. To address this need, we developed a model (‘ShapeComp’), based on over 100 shape features (e.g., area, compactness, Fourier descriptors). When trained to capture the variance in a database of >25,000 animal silhouettes, ShapeComp accurately predicts human shape similarity judgments between pairs of shapes without fitting any parameters to human data. To test the model, we created carefully selected arrays of complex novel shapes using a Generative Adversarial Network trained on the animal silhouettes, which we presented to observers in a wide range of tasks. Our findings show that incorporating multiple ShapeComp dimensions facilitates the prediction of human shape similarity across a small number of shapes, and also captures much of the variance in the multiple arrangements of many shapes. ShapeComp outperforms both conventional pixel-based metrics and state-of-the-art convolutional neural networks, and can also be used to generate perceptually uniform stimulus sets, making it a powerful tool for investigating shape and object representations in the human brain.

Download Full-text