Cooperative Multi-Agent Reinforcement Learning with Conversation Knowledge for Dialogue Management

Dialogue management plays a vital role in task-oriented dialogue systems, which has become an active area of research in recent years. Despite the promising results brought from deep reinforcement learning, most of the studies need to develop a manual user simulator additionally. To address the time-consuming development of simulator policy, we propose a multi-agent dialogue model where an end-to-end dialogue manager and a user simulator are optimized simultaneously. Different from prior work, we optimize the two-agents from scratch and apply the reward shaping technology based on adjacency pairs constraints in conversational analysis to speed up learning and to avoid the derivation from normal human-human conversation. In addition, we generalize the one-to-one learning strategy to one-to-many learning strategy, where a dialogue manager can be concurrently optimized with various user simulators, to improve the performance of trained dialogue manager. The experimental results show that one-to-one agents trained with adjacency pairs constraints can converge faster and avoid derivation. In cross-model evaluation with human users involved, the dialogue manager trained in one-to-many strategy achieves the best performance.

Download Full-text

Multi-UAV-enabled AoI-aware WPCN: A Multi-agent Reinforcement Learning Strategy

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) ◽

10.1109/infocomwkshps51825.2021.9484496 ◽

2021 ◽

Author(s):

Omar Sami Oubbati ◽

Mohammed Atiquzzaman ◽

Abderrahmane Lakas ◽

Abdullah Baz ◽

Hosam Alhakami ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Strategy ◽

Multi Agent ◽

Multi Uav

Download Full-text

A Conventional Dialogue Model Based on Dialogue Patterns

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213017600090 ◽

2017 ◽

Vol 26 (01) ◽

pp. 1760009 ◽

Cited By ~ 2

Author(s):

Guillaume Dubuisson Duplessis ◽

Alexandre Pauchet ◽

Nathalie Chaignaud ◽

Jean-Philippe Kotowicz

Keyword(s):

Open Source ◽

Software Agent ◽

Management Process ◽

Dialogue Model ◽

Dialogue Management ◽

Dialogue Games ◽

Human Interactions ◽

Model Based ◽

Dialogue Game ◽

Dialogue Manager

Our work aims at designing a dialogue manager dedicated to agents that interact with humans. In this article, we investigate how dialogue patterns at the dialogue act level extracted from Human-Human interactions can be fruitfully used by a software agent to interact with a human.We show how these patterns can be leveraged via a dialogue game structure in order to benefit to the dialogue management process of an agent. We describe how empirically specified dialogue games can be employed on both interpretative and generative levels of dialogue management. We present Dogma, an open-source module that can be used by an agent to manage its conventional communicative behaviour. We show that our library of dialogue games can be used into Dogma to generate fragments of dialogue that are strongly coherent from a human perspective.

Download Full-text

Multi-agent deep reinforcement learning strategy for distributed energy

Measurement ◽

10.1016/j.measurement.2021.109955 ◽

2021 ◽

pp. 109955

Author(s):

Lei Xi ◽

Mengmeng Sun ◽

Huan Zhou ◽

Yanchun Xu ◽

Junnan Wu ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Strategy ◽

Distributed Energy ◽

Multi Agent

Download Full-text

Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning

10.18653/v1/w19-5912 ◽

2019 ◽

Cited By ~ 3

Author(s):

Alexandros Papangelis ◽

Yi-Chia Wang ◽

Piero Molino ◽

Gokhan Tur

Keyword(s):

Reinforcement Learning ◽

Dialogue Model ◽

Multi Agent ◽

Model Training

Download Full-text

A Unified Dialogue Management Strategy for Multi-intent Dialogue Conversations in Multiple Languages

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3461763 ◽

2021 ◽

Vol 20 (6) ◽

pp. 1-22

Author(s):

Tulika Saha ◽

Dhawal Gupta ◽

Sriparna Saha ◽

Pushpak Bhattacharyya

Keyword(s):

Reinforcement Learning ◽

Virtual Agents ◽

Dialogue System ◽

Dialogue Management ◽

Hierarchical Reinforcement Learning ◽

Learning Framework ◽

User Query ◽

Dialogue Manager ◽

Task Oriented ◽

Multiple Languages

Building Virtual Agents capable of carrying out complex queries of the user involving multiple intents of a domain is quite a challenge, because it demands that the agent manages several subtasks simultaneously. This article presents a universal Deep Reinforcement Learning framework that can synthesize dialogue managers capable of working in a task-oriented dialogue system encompassing various intents pertaining to a domain. The conversation between agent and user is broken down into hierarchies, to segregate subtasks pertinent to different intents. The concept of Hierarchical Reinforcement Learning, particularly options , is used to learn policies in different hierarchies that operates in distinct time steps to fulfill the user query successfully. The dialogue manager comprises top-level intent meta-policy to select among subtasks or options and a low-level controller policy to pick primitive actions to communicate with the user to complete the subtask provided to it by the top-level policy in varying intents of a domain. The proposed dialogue management module has been trained in a way such that it can be reused for any language for which it has been developed with little to no supervision. The developed system has been demonstrated for “Air Travel” and “Restaurant” domain in English and Hindi languages. Empirical results determine the robustness and efficacy of the learned dialogue policy as it outperforms several baselines and a state-of-the-art system.

Download Full-text

Knowledge-Guided Agent-Tactic-Aware Learning for StarCraft Micromanagement

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/204 ◽

2018 ◽

Author(s):

Yue Hu ◽

Juntao Li ◽

Xi Li ◽

Gang Pan ◽

Mingliang Xu

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Learning Strategy ◽

State Of The Art ◽

Game Playing ◽

Challenging Problem ◽

Two Stage ◽

Exploration Process ◽

Multi Agent ◽

Agent Control

As an important and challenging problem in artificial intelligence (AI) game playing, StarCraft micromanagement involves a dynamically adversarial game playing process with complex multi-agent control within a large action space. In this paper, we propose a novel knowledge-guided agent-tactic-aware learning scheme, that is, opponent-guided tactic learning (OGTL), to cope with this micromanagement problem. In principle, the proposed scheme takes a two-stage cascaded learning strategy which is capable of not only transferring the human tactic knowledge from the human-made opponent agents to our AI agents but also improving the adversarial ability. With the power of reinforcement learning, such a knowledge-guided agent-tactic-aware scheme has the ability to guide the AI agents to achieve high winning-rate performances while accelerating the policy exploration process in a tactic-interpretable fashion. Experimental results demonstrate the effectiveness of the proposed scheme against the state-of-the-art approaches in several benchmark combat scenarios.

Download Full-text