End-to-End Task-Oriented Dialog System Through Template Slot Value Generation

Author(s):  
Teakgyu Hong ◽  
Oh-Woog Kwon ◽  
Young-Kil Kim
IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Hwaran Lee ◽  
Seokhwan Jo ◽  
HyungJun Kim ◽  
Sangkeun Jung ◽  
Tae-Yoon Kim

2020 ◽  
Vol 34 (05) ◽  
pp. 8293-8302
Author(s):  
Yu Li ◽  
Kun Qian ◽  
Weiyan Shi ◽  
Zhou Yu

End-to-end task-oriented dialog models have achieved promising performance on collaborative tasks where users willingly coordinate with the system to complete a given task. While in non-collaborative settings, for example, negotiation and persuasion, users and systems do not share a common goal. As a result, compared to collaborate tasks, people use social content to build rapport and trust in these non-collaborative settings in order to advance their goals. To handle social content, we introduce a hierarchical intent annotation scheme, which can be generalized to different non-collaborative dialog tasks. Building upon TransferTransfo (Wolf et al. 2019), we propose an end-to-end neural network model to generate diverse coherent responses. Our model utilizes intent and semantic slots as the intermediate sentence representation to guide the generation process. In addition, we design a filter to select appropriate responses based on whether these intermediate representations fit the designed task and conversation constraints. Our non-collaborative dialog model guides users to complete the task while simultaneously keeps them engaged. We test our approach on our newly proposed AntiScam dataset and an existing PersuasionForGood dataset. Both automatic and human evaluations suggest that our model outperforms multiple baselines in these two non-collaborative tasks.


2020 ◽  
Vol 34 (05) ◽  
pp. 8327-8335
Author(s):  
Weixin Liang ◽  
Youzhi Tian ◽  
Chengcai Chen ◽  
Zhou Yu

A major bottleneck in training end-to-end task-oriented dialog system is the lack of data. To utilize limited training data more efficiently, we propose Modular Supervision Network (MOSS), an encoder-decoder training framework that could incorporate supervision from various intermediate dialog system modules including natural language understanding, dialog state tracking, dialog policy learning and natural language generation. With only 60% of the training data, MOSS-all (i.e., MOSS with supervision from all four dialog modules) outperforms state-of-the-art models on CamRest676. Moreover, introducing modular supervision has even bigger benefits when the dialog task has a more complex dialog state and action space. With only 40% of the training data, MOSS-all outperforms the state-of-the-art model on a complex laptop network trouble shooting dataset, LaptopNetwork, that we introduced. LaptopNetwork consists of conversations between real customers and customer service agents in Chinese. Moreover, MOSS framework can accommodate dialogs that have supervision from different dialog modules at both framework level and model level. Therefore, MOSS is extremely flexible to update in real-world deployment.


Author(s):  
Bowen Zhang ◽  
Xiaofei Xu ◽  
Xutao Li ◽  
Yunming Ye ◽  
Xiaojun Chen ◽  
...  
Keyword(s):  

Author(s):  
Silin Gao ◽  
Ryuichi Takanobu ◽  
Wei Peng ◽  
Qun Liu ◽  
Minlie Huang

Author(s):  
Florian Strub ◽  
Harm de Vries ◽  
Jérémie Mary ◽  
Bilal Piot ◽  
Aaron Courville ◽  
...  

End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision may fail to correctly render the planning problem inherent to dialogue as well as its contextual and grounded nature. In this paper, we introduce a Deep Reinforcement Learning method to optimize visually grounded task-oriented dialogues, based on the policy gradient algorithm. This approach is tested on the question generation task from the dataset GuessWhat?! containing 120k dialogues and provides encouraging results at solving both the problem of generating natural dialogues and the task of discovering a specific object in a complex picture.


2019 ◽  
Vol 23 (3) ◽  
pp. 1989-2002 ◽  
Author(s):  
Haotian Xu ◽  
Haiyun Peng ◽  
Haoran Xie ◽  
Erik Cambria ◽  
Liuyang Zhou ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document