Coreference Augmentation for Multi-Domain Task-Oriented Dialogue State Tracking

We describe an approach to task-oriented dialogue in which dialogue state is represented as a dataflow graph. A dialogue agent maps each user utterance to a program that extends this graph. Programs include metacomputation operators for reference and revision that reuse dataflow fragments from previous turns. Our graph-based state enables the expression and manipulation of complex user intents, and explicit metacomputation makes these intents easier for learned models to predict. We introduce a new dataset, SMCalFlow, featuring complex dialogues about events, weather, places, and people. Experiments show that dataflow graphs and metacomputation substantially improve representability and predictability in these natural dialogues. Additional experiments on the MultiWOZ dataset show that our dataflow representation enables an otherwise off-the-shelf sequence-to-sequence model to match the best existing task-specific state tracking model. The SMCalFlow dataset, code for replicating experiments, and a public leaderboard are available at https://www.microsoft.com/en-us/research/project/dataflow-based-dialogue-semantic-machines .

Download Full-text

CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00314 ◽

2020 ◽

Vol 8 ◽

pp. 281-295

Author(s):

Qi Zhu ◽

Kaili Huang ◽

Zheng Zhang ◽

Xiaoyan Zhu ◽

Minlie Huang

Keyword(s):

Large Scale ◽

Dialogue Systems ◽

Wizard Of Oz ◽

Cross Domain ◽

Large Size ◽

User Simulation ◽

Dialogue Acts ◽

Dialogue Modeling ◽

State Tracking ◽

Task Oriented

To advance multi-domain (cross-domain) dialogue modeling as well as alleviate the shortage of Chinese task-oriented datasets, we propose CrossWOZ, the first large-scale Chinese Cross-Domain Wizard-of-Oz task-oriented dataset. It contains 6K dialogue sessions and 102K utterances for 5 domains, including hotel, restaurant, attraction, metro, and taxi. Moreover, the corpus contains rich annotation of dialogue states and dialogue acts on both user and system sides. About 60% of the dialogues have cross-domain user goals that favor inter-domain dependency and encourage natural transition across domains in conversation. We also provide a user simulator and several benchmark models for pipelined task-oriented dialogue systems, which will facilitate researchers to compare and evaluate their models on this corpus. The large size and rich annotation of CrossWOZ make it suitable to investigate a variety of tasks in cross-domain dialogue modeling, such as dialogue state tracking, policy learning, user simulation, etc.

Download Full-text

A Two-Step Neural Dialog State Tracker for Task-Oriented Dialog Processing

Computational Intelligence and Neuroscience ◽

10.1155/2018/5798684 ◽

2018 ◽

Vol 2018 ◽

pp. 1-11

Author(s):

A-Yeong Kim ◽

Hyun-Je Song ◽

Seong-Bae Park

Keyword(s):

Attention Mechanism ◽

Data Set ◽

Dialog Systems ◽

Dialog System ◽

Fast Training ◽

Proposed Model ◽

Spoken Dialog System ◽

State Tracking ◽

Dialog State Tracking ◽

Task Oriented

Dialog state tracking in a spoken dialog system is the task that tracks the flow of a dialog and identifies accurately what a user wants from the utterance. Since the success of a dialog is influenced by the ability of the system to catch the requirements of the user, accurate state tracking is important for spoken dialog systems. This paper proposes a two-step neural dialog state tracker which is composed of an informativeness classifier and a neural tracker. The informativeness classifier which is implemented by a CNN first filters out noninformative utterances in a dialog. Then, the neural tracker estimates dialog states from the remaining informative utterances. The tracker adopts the attention mechanism and the hierarchical softmax for its performance and fast training. To prove the effectiveness of the proposed model, we do experiments on dialog state tracking in the human-human task-oriented dialogs with the standard DSTC4 data set. Our experimental results prove the effectiveness of the proposed model by showing that the proposed model outperforms the neural trackers without the informativeness classifier, the attention mechanism, or the hierarchical softmax.

Download Full-text

MA-DST: Multi-Attention-Based Scalable Dialog State Tracking

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6322 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8107-8114

Author(s):

Adarsh Kumar ◽

Peter Ku ◽

Anuj Goyal ◽

Angeliki Metallinou ◽

Dilek Hakkani-Tur

Keyword(s):

Natural Language ◽

State Of The Art ◽

Core Component ◽

Full Data ◽

Cross Domain ◽

Natural Language Interface ◽

State Tracking ◽

Dialog State Tracking ◽

Task Oriented ◽

Multiple Granularities

Task oriented dialog agents provide a natural language interface for users to complete their goal. Dialog State Tracking (DST), which is often a core component of these systems, tracks the system's understanding of the user's goal throughout the conversation. To enable accurate multi-domain DST, the model needs to encode dependencies between past utterances and slot semantics and understand the dialog context, including long-range cross-domain references. We introduce a novel architecture for this task to encode the conversation history and slot semantics more robustly by using attention mechanisms at multiple granularities. In particular, we use cross-attention to model relationships between the context and slots at different semantic levels and self-attention to resolve cross-domain coreferences. In addition, our proposed architecture does not rely on knowing the domain ontologies beforehand and can also be used in a zero-shot setting for new domains or unseen slot values. Our model improves the joint goal accuracy by 5% (absolute) in the full-data setting and by up to 2% (absolute) in the zero-shot setting over the present state-of-the-art on the MultiWoZ 2.1 dataset.

Download Full-text

HDRS: Hindi Dialogue Restaurant Search Corpus for Dialogue State Tracking in Task-Oriented Environment

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2021.3065833 ◽

2021 ◽

pp. 1-1

Author(s):

Shrikant Malviya ◽

Rohit Mishra ◽

Santosh Kumar Barnwal ◽

Uma Shankar Tiwary

Keyword(s):

State Tracking ◽

Task Oriented

Download Full-text

Dialogue state tracking accuracy improvement by distinguishing slot-value pairs and dialogue behaviour

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v21.i2.pp1057-1064 ◽

2021 ◽

Vol 21 (2) ◽

pp. 1057

Author(s):

Khaldoon H. Alhussayni ◽

Alexander Zamyatin ◽

S. Eman Alshamery

Keyword(s):

Critical Role ◽

Historical Context ◽

Dialogue Systems ◽

Tracking Accuracy ◽

Dialogue System ◽

Dialogue Acts ◽

Proposed Model ◽

State Tracking ◽

Task Oriented ◽

Previous User

<div><p>Dialog state tracking (DST) plays a critical role in cycle life of a task-oriented dialogue system. DST represents the goals of the consumer at each step by dialogue and describes such objectives as a conceptual structure comprising slot-value pairs and dialogue actions that specifically improve the performance and effectiveness of dialogue systems. DST faces several challenges: diversity of linguistics, dynamic social context and the dissemination of the state of dialogue over candidate values both in slot values and in dialogue acts determined in ontology. In many turns during the dialogue, users indirectly refer to the previous utterances, and that produce a challenge to distinguishing and use of related dialogue history, Recent methods used and popular for that are ineffective. In this paper, we propose a dialogue historical context self-Attention framework for DST that recognizes relevant historical context by including previous user utterance beside current user utterances and previous system actions where specific slot-value piers variations and uses that together with weighted system utterance to outperform existing models by recognizing the related context and the relevance of a system utterance. For the evaluation of the proposed model the WoZ dataset was used. The implementation was attempted with the prior user utterance as a dialogue encoder and second by the additional score combined with all the candidate slot-value pairs in the context of previous user utterances and current utterances. The proposed model obtained 0.8 per cent better results than all state-of-the-art methods in the combined precision of the target, but this is not the turnaround challenge for the submission.</p></div>

Download Full-text

Towards Scalable Multi-Domain Conversational Agents: The Schema-Guided Dialogue Dataset

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6394 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8689-8696

Author(s):

Abhinav Rastogi ◽

Xiaoxue Zang ◽

Srinivas Sunkara ◽

Raghav Gupta ◽

Pranav Khaitan

Keyword(s):

Large Scale ◽

Training Data ◽

Conversational Agents ◽

Dialogue System ◽

Conversational Interface ◽

State Tracking ◽

Public Datasets ◽

Multiple Domains ◽

Task Oriented ◽

Slot Filling

Virtual assistants such as Google Assistant, Alexa and Siri provide a conversational interface to a large number of services and APIs spanning multiple domains. Such systems need to support an ever-increasing number of services with possibly overlapping functionality. Furthermore, some of these services have little to no training data available. Existing public datasets for task-oriented dialogue do not sufficiently capture these challenges since they cover few domains and assume a single static ontology per domain. In this work, we introduce the the Schema-Guided Dialogue (SGD) dataset, containing over 16k multi-domain conversations spanning 16 domains. Our dataset exceeds the existing task-oriented dialogue corpora in scale, while also highlighting the challenges associated with building large-scale virtual assistants. It provides a challenging testbed for a number of tasks including language understanding, slot filling, dialogue state tracking and response generation. Along the same lines, we present a schema-guided paradigm for task-oriented dialogue, in which predictions are made over a dynamic set of intents and slots, provided as input, using their natural language descriptions. This allows a single dialogue system to easily support a large number of services and facilitates simple integration of new services without requiring additional training data. Building upon the proposed paradigm, we release a model for dialogue state tracking capable of zero-shot generalization to new APIs, while remaining competitive in the regular setting.

Download Full-text