Domain Adaptation for Imitation Learning Using Generative Adversarial Network

Imitation learning is an effective approach for an autonomous agent to learn control policies when an explicit reward function is unavailable, using demonstrations provided from an expert. However, standard imitation learning methods assume that the agents and the demonstrations provided by the expert are in the same domain configuration. Such an assumption has made the learned policies difficult to apply in another distinct domain. The problem is formalized as domain adaptive imitation learning, which is the process of learning how to perform a task optimally in a learner domain, given demonstrations of the task in a distinct expert domain. We address the problem by proposing a model based on Generative Adversarial Network. The model aims to learn both domain-shared and domain-specific features and utilizes it to find an optimal policy across domains. The experimental results show the effectiveness of our model in a number of tasks ranging from low to complex high-dimensional.

Download Full-text

Generative Adversarial Network for Imitation Learning from Single Demonstration

Baghdad Science Journal ◽

10.21123/bsj.2021.18.4(suppl.).1350 ◽

2021 ◽

Vol 18 (4(Suppl.)) ◽

pp. 1350

Author(s):

Tho Nguyen Duc ◽

Chanh Minh Tran ◽

Phan Xuan Tan ◽

Eiji Kamioka

Keyword(s):

Imitation Learning ◽

Autonomous Agent ◽

Complex Behavior ◽

Learning Methods ◽

Generative Adversarial Network ◽

Complex Tasks ◽

Adversarial Network ◽

Proposed Model ◽

Optimal Policies ◽

Single Demonstration

Imitation learning is an effective method for training an autonomous agent to accomplish a task by imitating expert behaviors in their demonstrations. However, traditional imitation learning methods require a large number of expert demonstrations in order to learn a complex behavior. Such a disadvantage has limited the potential of imitation learning in complex tasks where the expert demonstrations are not sufficient. In order to address the problem, we propose a Generative Adversarial Network-based model which is designed to learn optimal policies using only a single demonstration. The proposed model is evaluated on two simulated tasks in comparison with other methods. The results show that our proposed model is capable of completing considered tasks despite the limitation in the number of expert demonstrations, which clearly indicate the potential of our model.

Download Full-text

1D conditional generative adversarial network for spectrum-to-spectrum translation of simulated chemical reflectance signatures

Journal of Spectral Imaging ◽

10.1255/jsi.2021.a2 ◽

2021 ◽

Author(s):

Cara Murphy ◽

John Kerekes

Keyword(s):

Classification Accuracy ◽

Domain Adaptation ◽

Real Data ◽

Training Set ◽

Generative Adversarial Network ◽

Average Classification Accuracy ◽

Adversarial Network ◽

Chemical Residues ◽

Reflectance Data

The classification of trace chemical residues through active spectroscopic sensing is challenging due to the lack of physics-based models that can accurately predict spectra. To overcome this challenge, we leveraged the field of domain adaptation to translate data from the simulated to the measured domain for training a classifier. We developed the first 1D conditional generative adversarial network (GAN) to perform spectrum-to-spectrum translation of reflectance signatures. We applied the 1D conditional GAN to a library of simulated spectra and quantified the improvement in classification accuracy on real data using the translated spectra for training the classifier. Using the GAN-translated library, the average classification accuracy increased from 0.622 to 0.723 on real chemical reflectance data, including data from chemicals not included in the GAN training set.

Download Full-text

Conditional Generative Adversarial Network for Structured Domain Adaptation

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2018.00145 ◽

2018 ◽

Cited By ~ 62

Author(s):

Weixiang Hong ◽

Zhenzhen Wang ◽

Ming Yang ◽

Junsong Yuan

Keyword(s):

Domain Adaptation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Deep inverse reinforcement learning for structural evolution of small molecules

Briefings in Bioinformatics ◽

10.1093/bib/bbaa364 ◽

2020 ◽

Author(s):

Brighter Agyemang ◽

Wei-Ping Wu ◽

Daniel Addo ◽

Michael Y Kpiebaareh ◽

Ebenezer Nanor ◽

...

Keyword(s):

Reinforcement Learning ◽

High Throughput Screening ◽

Structural Evolution ◽

Search Space ◽

New Drugs ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Entropy Maximization ◽

Reward Function ◽

Adversarial Network

Abstract The size and quality of chemical libraries to the drug discovery pipeline are crucial for developing new drugs or repurposing existing drugs. Existing techniques such as combinatorial organic synthesis and high-throughput screening usually make the process extraordinarily tough and complicated since the search space of synthetically feasible drugs is exorbitantly huge. While reinforcement learning has been mostly exploited in the literature for generating novel compounds, the requirement of designing a reward function that succinctly represents the learning objective could prove daunting in certain complex domains. Generative adversarial network-based methods also mostly discard the discriminator after training and could be hard to train. In this study, we propose a framework for training a compound generator and learn a transferable reward function based on the entropy maximization inverse reinforcement learning (IRL) paradigm. We show from our experiments that the IRL route offers a rational alternative for generating chemical compounds in domains where reward function engineering may be less appealing or impossible while data exhibiting the desired objective is readily available.

Download Full-text

Joint Entity and Event Extraction with Generative Adversarial Imitation Learning

Data Intelligence ◽

10.1162/dint_a_00014 ◽

2019 ◽

Vol 1 (2) ◽

pp. 99-120 ◽

Cited By ~ 6

Author(s):

Tongtao Zhang ◽

Heng Ji ◽

Avirup Sil

Keyword(s):

State Of The Art ◽

Ground Truth ◽

Event Extraction ◽

Imitation Learning ◽

Learning Method ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

The Difference ◽

New Framework

We propose a new framework for entity and event extraction based on generative adversarial imitation learning—an inverse reinforcement learning method using a generative adversarial network (GAN). We assume that instances and labels yield to various extents of difficulty and the gains and penalties (rewards) are expected to be diverse. We utilize discriminators to estimate proper rewards according to the difference between the labels committed by the ground-truth (expert) and the extractor (agent). Our experiments demonstrate that the proposed framework outperforms state-of-the-art methods.

Download Full-text

Generative Adversarial Network Performance in Low-Dimensional Settings

Journal of Research of the National Institute of Standards and Technology ◽

10.6028/jres.126.008 ◽

2021 ◽

Vol 126 ◽

Author(s):

Felix Jimenez ◽

Amanda Koepke ◽

Mary Gregg ◽

Michael Frey

Keyword(s):

Neural Network ◽

Network Performance ◽

Training Data ◽

High Dimensional ◽

Generative Adversarial Network ◽

Low Dimensions ◽

Target Distribution ◽

Adversarial Network ◽

Low Dimensional ◽

And Training

A generative adversarial network (GAN) is an artificial neural network with a distinctive training architecture, designed to createexamples that faithfully reproduce a target distribution. GANs have recently had particular success in applications involvinghigh-dimensional distributions in areas such as image processing. Little work has been reported for low dimensions, where properties of GANs may be better identified and understood. We studied GAN performance in simulated low-dimensional settings, allowing us totransparently assess effects of target distribution complexity and training data sample size on GAN performance in a simpleexperiment. This experiment revealed two important forms of GAN error, tail underfilling and bridge bias, where the latter is analogousto the tunneling observed in high-dimensional GANs.

Download Full-text

Multi-Source Domain Adaptation for Visual Sentiment Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5651 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2661-2668

Author(s):

Chuang Lin ◽

Sicheng Zhao ◽

Lei Meng ◽

Tat-Seng Chua

Keyword(s):

Domain Adaptation ◽

Sentiment Classification ◽

Similar Distribution ◽

Single Source ◽

Target Domain ◽

Generative Adversarial Network ◽

Source Domain ◽

Adversarial Network ◽

Latent Space ◽

Benchmark Datasets

Existing domain adaptation methods on visual sentiment classification typically are investigated under the single-source scenario, where the knowledge learned from a source domain of sufficient labeled data is transferred to the target domain of loosely labeled or unlabeled data. However, in practice, data from a single source domain usually have a limited volume and can hardly cover the characteristics of the target domain. In this paper, we propose a novel multi-source domain adaptation (MDA) method, termed Multi-source Sentiment Generative Adversarial Network (MSGAN), for visual sentiment classification. To handle data from multiple source domains, it learns to find a unified sentiment latent space where data from both the source and target domains share a similar distribution. This is achieved via cycle consistent adversarial learning in an end-to-end manner. Extensive experiments conducted on four benchmark datasets demonstrate that MSGAN significantly outperforms the state-of-the-art MDA approaches for visual sentiment classification.

Download Full-text

Beyond Mutual Information: Generative Adversarial Network for Domain Adaptation using Information Bottleneck Constraint

IEEE Transactions on Medical Imaging ◽

10.1109/tmi.2021.3117996 ◽

2021 ◽

pp. 1-1

Author(s):

Jiawei Chen ◽

Ziqi Zhang ◽

Xinpeng Xie ◽

Yuexiang Li ◽

Tao Xu ◽

...

Keyword(s):

Mutual Information ◽

Domain Adaptation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Information Bottleneck

Download Full-text

Unsupervised Domain Adaptation with Duplex Generative Adversarial Network

Domain Adaptation in Computer Vision with Deep Learning ◽

10.1007/978-3-030-45529-3_6 ◽

2020 ◽

pp. 97-116

Author(s):

Lanqing Hu ◽

Meina Kan ◽

Shiguang Shan ◽

Xilin Chen

Keyword(s):

Domain Adaptation ◽

Generative Adversarial Network ◽

Unsupervised Domain Adaptation ◽

Adversarial Network

Download Full-text

Improving Deep Interactive Evolution with a Style-Based Generator for Artistic Expression and Creative Exploration

Entropy ◽

10.3390/e23010011 ◽

2020 ◽

Vol 23 (1) ◽

pp. 11

Author(s):

Carlos Tejeda-Ocampo ◽

Armando López-Cuevas ◽

Hugo Terashima-Marin

Keyword(s):

Evolutionary Computation ◽

Artistic Expression ◽

Interactive Evolutionary Computation ◽

Generative Adversarial Network ◽

Domain Specific ◽

Adversarial Network ◽

Latent Space

Deep interactive evolution (DeepIE) combines the capacity of interactive evolutionary computation (IEC) to capture a user’s preference with the domain-specific robustness of a trained generative adversarial network (GAN) generator, allowing the user to control the GAN output through evolutionary exploration of the latent space. However, the traditional GAN latent space presents feature entanglement, which limits the practicability of possible applications of DeepIE. In this paper, we implement DeepIE within a style-based generator from a StyleGAN model trained on the WikiArt dataset and propose StyleIE, a variation of DeepIE that takes advantage of the secondary disentangled latent space in the style-based generator. We performed two AB/BA crossover user tests that compared the performance of DeepIE against StyleIE for art generation. Self-rated evaluations of the performance were collected through a questionnaire. Findings from the tests suggest that StyleIE and DeepIE perform equally in tasks with open-ended goals with relaxed constraints, but StyleIE performs better in close-ended and more constrained tasks.

Download Full-text