Symmetrical Adversarial Training Network: A Novel Model for Text Generation

Neural language models based on recurrent neural networks (RNNLM) have significantly improved the performance for text generation, yet the quality of generated text represented by Turing Test pass rate is still far from satisfying. Some researchers propose to use adversarial training or reinforcement learning to promote the quality, however, such methods usually introduce great challenges in the training and parameter tuning processes. Through our analysis, we find the problem of RNNLM comes from the usage of maximum likelihood estimation (MLE) as the objective function, which requires the generated distribution to precisely recover the true distribution. Such requirement favors high generation diversity which restricted the generation quality. This is not suitable when the overall quality is low, since high generation diversity usually indicates lot of errors rather than diverse good samples. In this paper, we propose to achieve differentiated distribution recovery, DDR for short. The key idea is to make the optimal generation probability proportional to the β-th power of the true probability, where β > 1. In this way, the generation quality can be greatly improved by sacrificing diversity from noises and rare patterns. Experiments on synthetic data and two public text datasets show that our DDR method achieves more flexible quality-diversity trade-off and higher Turing Test pass rate, as compared with baseline methods including RNNLM, SeqGAN and LeakGAN.

Download Full-text

Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00386 ◽

2021 ◽

Vol 9 ◽

pp. 586-604

Author(s):

Abbas Ghaddar ◽

Philippe Langlais ◽

Ahmad Rashid ◽

Mehdi Rezagholizadeh

Keyword(s):

Data Augmentation ◽

State Of The Art ◽

Contextual Information ◽

Named Entity Recognition ◽

Entity Recognition ◽

Context Aware ◽

Named Entity ◽

Feature Based ◽

Adversarial Training ◽

Novel Model

Abstract In this work, we examine the ability of NER models to use contextual information when predicting the type of an ambiguous entity. We introduce NRB, a new testbed carefully designed to diagnose Name Regularity Bias of NER models. Our results indicate that all state-of-the-art models we tested show such a bias; BERT fine-tuned models significantly outperforming feature-based (LSTM-CRF) ones on NRB, despite having comparable (sometimes lower) performance on standard benchmarks. To mitigate this bias, we propose a novel model-agnostic training method that adds learnable adversarial noise to some entity mentions, thus enforcing models to focus more strongly on the contextual signal, leading to significant gains on NRB. Combining it with two other training strategies, data augmentation and parameter freezing, leads to further gains.

Download Full-text

Meta-CoTGAN: A Meta Cooperative Training Paradigm for Improving Adversarial Text Generation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6490 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9466-9473

Author(s):

Haiyan Yin ◽

Dingcheng Li ◽

Xu Li ◽

Ping Li

Keyword(s):

Language Model ◽

Generative Models ◽

Text Generation ◽

Training Paradigm ◽

Novel Approach ◽

Cooperative Training ◽

Meta Learning ◽

Important Open Problem ◽

Adversarial Training ◽

High Level

Training generative models that can generate high-quality text with sufficient diversity is an important open problem for Natural Language Generation (NLG) community. Recently, generative adversarial models have been applied extensively on text generation tasks, where the adversarially trained generators alleviate the exposure bias experienced by conventional maximum likelihood approaches and result in promising generation quality. However, due to the notorious defect of mode collapse for adversarial training, the adversarially trained generators face a quality-diversity trade-off, i.e., the generator models tend to sacrifice generation diversity severely for increasing generation quality. In this paper, we propose a novel approach which aims to improve the performance of adversarial text generation via efficiently decelerating mode collapse of the adversarial training. To this end, we introduce a cooperative training paradigm, where a language model is cooperatively trained with the generator and we utilize the language model to efficiently shape the data distribution of the generator against mode collapse. Moreover, instead of engaging the cooperative update for the generator in a principled way, we formulate a meta learning mechanism, where the cooperative update to the generator serves as a high level meta task, with an intuition of ensuring the parameters of the generator after the adversarial update would stay resistant against mode collapse. In the experiment, we demonstrate our proposed approach can efficiently slow down the pace of mode collapse for the adversarial text generators. Overall, our proposed method is able to outperform the baseline approaches with significant margins in terms of both generation quality and diversity in the testified domains.

Download Full-text

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6488 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9450-9457

Author(s):

Xiaoyuan Yi ◽

Ruoyu Li ◽

Cheng Yang ◽

Wenhao Li ◽

Maosong Sun

Keyword(s):

Latent Variable ◽

Life Experience ◽

Influence Factor ◽

Historical Background ◽

Chinese Poetry ◽

Neural Models ◽

Latent Space ◽

Variational Autoencoder ◽

Adversarial Training ◽

Novel Model

As an essential step towards computer creativity, automatic poetry generation has gained increasing attention these years. Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity. Related literature researches show that different factors, such as life experience, historical background, etc., would influence composition styles of poets, which considerably contributes to the high diversity of human-authored poetry. Inspired by this, we propose MixPoet, a novel model that absorbs multiple factors to create various styles and promote diversity. Based on a semi-supervised variational autoencoder, our model disentangles the latent space into some subspaces, with each conditioned on one influence factor by adversarial training. In this way, the model learns a controllable latent variable to capture and mix generalized factor-related properties. Different factor mixtures lead to diverse styles and hence further differentiate generated poems from each other. Experiment results on Chinese poetry demonstrate that MixPoet improves both diversity and quality against three state-of-the-art models.

Download Full-text

Emergence of perianal fistulae and ulceration in a murine model of IBD: A novel model of perianal Crohn's disease (CD)

Gastroenterology ◽

10.1016/s0016-5085(01)80601-6 ◽

2001 ◽

Vol 120 (5) ◽

pp. A122-A122

Author(s):

S COHN ◽

A VIDRICH ◽

M SUMMY ◽

C MOSKALUK ◽

F COMINELLI

Keyword(s):

Crohn’S Disease ◽

Crohn's Disease ◽

Murine Model ◽

Perianal Crohn’S Disease ◽

Perianal Fistulae ◽

Novel Model ◽

Perianal Crohn's Disease

Download Full-text

630: A Novel Model System for the Treatment of Renal Cancer by Nonmyeloablative Allogeneic Hemopoietic Cell Transplantation Using Cyclophosphamide in Mice

The Journal of Urology ◽

10.1016/s0022-5347(18)34870-5 ◽

2005 ◽

Vol 173 (4S) ◽

pp. 172-172

Author(s):

Masatoshi Eto ◽

Masahiko Harano ◽

Katsunori Tatsugami ◽

Hirofumi Koga ◽

Seiji Naito

Keyword(s):

Cell Transplantation ◽

Renal Cancer ◽

Model System ◽

Hemopoietic Cell ◽

Novel Model

Download Full-text

A Novel Model for Separate Estimates for Explicit and Implicit Memory

PsycEXTRA Dataset ◽

10.1037/e502412013-173 ◽

2012 ◽

Author(s):

Richard A. Chechile ◽

Lara N. Sloboda ◽

Erin L. Warren ◽

Daniel H. Barch ◽

Jessica R. Chamberland

Keyword(s):

Implicit Memory ◽

Novel Model

Download Full-text

The Computational Sublime in Nick Montfort's ‘Round’ and ‘All the Names of God’

CounterText ◽

10.3366/count.2015.0027 ◽

2015 ◽

Vol 1 (3) ◽

pp. 348-365 ◽

Cited By ~ 2

Author(s):

Mario Aquilina

Keyword(s):

Computer Code ◽

Computer Programs ◽

Literary Text ◽

Text Generation ◽

Mathematical Concepts ◽

Literary Space

What if the post-literary also meant that which operates in a literary space (almost) devoid of language as we know it: for instance, a space in which language simply frames the literary or poetic rather than ‘containing’ it? What if the countertextual also meant the (en)countering of literary text with non-textual elements, such as mathematical concepts, or with texts that we would not normally think of as literary, such as computer code? This article addresses these issues in relation to Nick Montfort's #!, a 2014 print collection of poems that presents readers with the output of computer programs as well as the programs themselves, which are designed to operate on principles of text generation regulated by specific constraints. More specifically, it focuses on two works in the collection, ‘Round’ and ‘All the Names of God’, which are read in relation to the notions of the ‘computational sublime’ and the ‘event’.

Download Full-text

Metoclopramide Restores the Sympathoadrenal Response to Hypoglycemia in a Novel Model of HAAF

Diabetes ◽

10.2337/db18-104-or ◽

2018 ◽

Vol 67 (Supplement 1) ◽

pp. 104-OR

Author(s):

ADRIANA VIEIRA DE ABREU ◽

RAHUL AGRAWAL ◽

PARKER HOWE ◽

SIMON J. FISHER

Keyword(s):

Novel Model

Download Full-text