scholarly journals Attention-Informed Mixed-Language Training for Zero-Shot Cross-Lingual Task-Oriented Dialogue Systems

2020 ◽  
Vol 34 (05) ◽  
pp. 8433-8440
Author(s):  
Zihan Liu ◽  
Genta Indra Winata ◽  
Zhaojiang Lin ◽  
Peng Xu ◽  
Pascale Fung

Recently, data-driven task-oriented dialogue systems have achieved promising performance in English. However, developing dialogue systems that support low-resource languages remains a long-standing challenge due to the absence of high-quality data. In order to circumvent the expensive and time-consuming data collection, we introduce Attention-Informed Mixed-Language Training (MLT), a novel zero-shot adaptation method for cross-lingual task-oriented dialogue systems. It leverages very few task-related parallel word pairs to generate code-switching sentences for learning the inter-lingual semantics across languages. Instead of manually selecting the word pairs, we propose to extract source words based on the scores computed by the attention layer of a trained English task-related model and then generate word pairs using existing bilingual dictionaries. Furthermore, intensive experiments with different cross-lingual embeddings demonstrate the effectiveness of our approach. Finally, with very few word pairs, our model achieves significant zero-shot adaptation performance improvements in both cross-lingual dialogue state tracking and natural language understanding (i.e., intent detection and slot filling) tasks compared to the current state-of-the-art approaches, which utilize a much larger amount of bilingual data.

Author(s):  
Siyu Lai ◽  
Hui Huang ◽  
Dong Jing ◽  
Yufeng Chen ◽  
Jinan Xu ◽  
...  

Author(s):  
Lu Xiang ◽  
Junnan Zhu ◽  
Yang Zhao ◽  
Yu Zhou ◽  
Chengqing Zong

Cross-lingual dialogue systems are increasingly important in e-commerce and customer service due to the rapid progress of globalization. In real-world system deployment, machine translation (MT) services are often used before and after the dialogue system to bridge different languages. However, noises and errors introduced in the MT process will result in the dialogue system's low robustness, making the system's performance far from satisfactory. In this article, we propose a novel MT-oriented noise enhanced framework that exploits multi-granularity MT noises and injects such noises into the dialogue system to improve the dialogue system's robustness. Specifically, we first design a method to automatically construct multi-granularity MT-oriented noises and multi-granularity adversarial examples, which contain abundant noise knowledge oriented to MT. Then, we propose two strategies to incorporate the noise knowledge: (i) Utterance-level adversarial learning and (ii) Knowledge-level guided method. The former adopts adversarial learning to learn a perturbation-invariant encoder, guiding the dialogue system to learn noise-independent hidden representations. The latter explicitly incorporates the multi-granularity noises, which contain the noise tokens and their possible correct forms, into the training and inference process, thus improving the dialogue system's robustness. Experimental results on three dialogue models, two dialogue datasets, and two language pairs have shown that the proposed framework significantly improves the performance of the cross-lingual dialogue system.


Author(s):  
Mengshi Yu ◽  
Jian Liu ◽  
Yufeng Chen ◽  
Jinan Xu ◽  
Yujie Zhang

With task-oriented dialogue systems being widely applied in everyday life, slot filling, the essential component of task-oriented dialogue systems, is required to be quickly adapted to new domains that contain domain-specific slots with few or no training data. Previous methods for slot filling usually adopt sequence labeling framework, which, however, often has limited ability when dealing with the domain-specific slots. In this paper, we take a new perspective on cross-domain slot filling by framing it as a machine reading comprehension (MRC) problem. Our approach firstly transforms slot names into well-designed queries, which contain rich informative prior knowledge and are very helpful for the detection of domain-specific slots. In addition, we utilize the large-scale MRC dataset for pre-training, which further alleviates the data scarcity problem. Experimental results on SNIPS and ATIS datasets show that our approach consistently outperforms the existing state-of-the-art methods by a large margin.


Author(s):  
PHILIPPE MORIN ◽  
JEAN-PAUL HATON ◽  
JEAN-MARIE PIERREL ◽  
GUENTHER RUSKE ◽  
WALTER WEIGEL

In the framework of man-machine communication, oral dialogue has a particular place since human speech presents several advantages when used either alone or in multimedia interfaces. The last decade has witnessed a proliferation of research into speech recognition and understanding, but few systems have been defined with a view to managing and understanding an actual man-machine dialogue. The PARTNER system that we describe in this paper proposes a solution in the case of task oriented dialogue with the use of artificial languages. A description of the essential characteristics of dialogue systems is followed by a presentation of the architecture and the principles of the PARTNER system. Finally, we present the most recent results obtained in the oral management of electronic mail in French and German.


Author(s):  
Mariya K. Timofeeva

The aim of this article consists in reviewing the basic areas of studying language scales in pragmatics; several prospects of their investigation are discussed. Presently, language scales are the object of intensive research in semantics and pragmatics, from linguistic, logical, psycholinguistic, and neuro-linguistic perspectives. We are interested mainly in pragmatics (although the area of semantics is also considered) and concentrate on linguistic rather than logical, psycholinguistic, or neuro-linguistic aspects. The article continues the series of publications intending to review and systematize pragmatic investigation in basic topical areas. An interest in studying linguistic scales in pragmatics has increased primarily due to the works of H. P. Grice, L. Horn, G. Gazdar, and S. Levinson. An important class of general pragmatic principles of communication was introduced by H. P. Grice and then was elaborated on greater detail in neo-gricean pragmatics. This class of principles specifies quantity characteristics of communication, and can be defined in terms of scales. Language scales give rise to a special class of implicatures called “scalar implicatures”. In many cases, it is necessary for a speaker to choose some position on a scale. Scalar implicature appears as a result of this choice. Each position potentially generates a certain set of implications. This pragmatic phenomenon is intensively studied in linguistics, logic, and experimental investigations. The literature in the area is ample; the article draws only a general picture of the area. The article proposes: 1) to elicit a system of potential language scales for a concrete language; 2) to consider individual / situational scales; 3) to consider dynamics of scales in speech (in accordance with basic ideas of dynamic semantics). The proposed areas of practical application are the following: stylistic analysis and studying an author’s style, modelling of reasoning and communication (particularly in dialogue systems), constructing formal ontologies of different subject areas.


Author(s):  
Florian Strub ◽  
Harm de Vries ◽  
Jérémie Mary ◽  
Bilal Piot ◽  
Aaron Courville ◽  
...  

End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision may fail to correctly render the planning problem inherent to dialogue as well as its contextual and grounded nature. In this paper, we introduce a Deep Reinforcement Learning method to optimize visually grounded task-oriented dialogues, based on the policy gradient algorithm. This approach is tested on the question generation task from the dataset GuessWhat?! containing 120k dialogues and provides encouraging results at solving both the problem of generating natural dialogues and the task of discovering a specific object in a complex picture.


Sign in / Sign up

Export Citation Format

Share Document