Multi-Agent Deep Reinforcement Learning for Online 3D Human Poses Estimation

Most multi-view based human pose estimation techniques assume the cameras are fixed. While in dynamic scenes, the cameras should be able to move and seek the best views to avoid occlusions and extract 3D information of the target collaboratively. In this paper, we address the problem of online view selection for a fixed number of cameras to estimate multi-person 3D poses actively. The proposed method exploits a distributed multi-agent based deep reinforcement learning framework, where each camera is modeled as an agent, to optimize the action of all the cameras. An inter-agent communication protocol was developed to transfer the cameras’ relative positions between agents for better collaboration. Experiments on the Panoptic dataset show that our method outperforms other view selection methods by a large margin given an identical number of cameras. To the best of our knowledge, our method is the first to address online active multi-view 3D pose estimation with multi-agent reinforcement learning.

Download Full-text

A Multi-Agent Reinforcement Learning Framework with Recurrent Communication Module for Traffic Light Control

10.1109/iciscae52414.2021.9590701 ◽

2021 ◽

Author(s):

Bo Qin ◽

Wei He ◽

Bin Zhang ◽

Jingchen Li

Keyword(s):

Reinforcement Learning ◽

Light Control ◽

Traffic Light ◽

Learning Framework ◽

Traffic Light Control ◽

Communication Module ◽

Multi Agent

Download Full-text

MADES: A Unified Framework for Integrating Agent-Based Simulation with Multi-Agent Reinforcement Learning

10.23919/annsim52504.2021.9552052 ◽

2021 ◽

Author(s):

Xiaohan Wang ◽

Lin Zhang ◽

Yuanjun Laili ◽

Kunyu Xie ◽

Han Lu ◽

...

Keyword(s):

Reinforcement Learning ◽

Unified Framework ◽

Agent Based Simulation ◽

Agent Based ◽

Multi Agent

Download Full-text

Decentralized multi-agent based energy management of microgrid using reinforcement learning

International Journal of Electrical Power & Energy Systems ◽

10.1016/j.ijepes.2020.106211 ◽

2020 ◽

Vol 122 ◽

pp. 106211 ◽

Cited By ~ 3

Author(s):

Esmat Samadi ◽

Ali Badri ◽

Reza Ebrahimpour

Keyword(s):

Reinforcement Learning ◽

Energy Management ◽

Agent Based ◽

Multi Agent

Download Full-text

A Multi-Agent Based System for Activity Configuration and Personalization in a Pervasive Learning Framework

Fifth Annual IEEE International Conference on Pervasive Computing and Communications Workshops (PerComW'07) ◽

10.1109/percomw.2007.8 ◽

2007 ◽

Cited By ~ 2

Author(s):

M. Felisa Verdejo ◽

Carlos Celorrio

Keyword(s):

Agent Based ◽

Learning Framework ◽

Multi Agent ◽

Pervasive Learning

Download Full-text

MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation

Computer Vision -- ACCV 2014 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-16808-1_21 ◽

2015 ◽

pp. 302-315 ◽

Cited By ~ 31

Author(s):

Arjun Jain ◽

Jonathan Tompson ◽

Yann LeCun ◽

Christoph Bregler

Keyword(s):

Deep Learning ◽

Pose Estimation ◽

Human Pose Estimation ◽

Learning Framework ◽

Motion Features ◽

Human Pose

Download Full-text

A cooperative multi-agent deep reinforcement learning framework for real-time residential load scheduling

Proceedings of the International Conference on Internet of Things Design and Implementation - IoTDI '19 ◽

10.1145/3302505.3310069 ◽

2019 ◽

Cited By ~ 1

Author(s):

Chi Zhang ◽

Sanmukh R. Kuppannagari ◽

Chuanxiu Xiong ◽

Rajgopal Kannan ◽

Viktor K. Prasanna

Keyword(s):

Reinforcement Learning ◽

Real Time ◽

Load Scheduling ◽

Learning Framework ◽

Multi Agent

Download Full-text

DTDE: A new cooperative multi-agent reinforcement learning framework

The Innovation ◽

10.1016/j.xinn.2021.100162 ◽

2021 ◽

pp. 100162

Author(s):

Guanghui Wen ◽

Junjie Fu ◽

Pengcheng Dai ◽

Jialing Zhou

Keyword(s):

Reinforcement Learning ◽

Learning Framework ◽

Multi Agent

Download Full-text

Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.3301898 ◽

2019 ◽

Vol 33 ◽

pp. 898-905 ◽

Cited By ~ 2

Author(s):

Wei Feng ◽

Wentao Liu ◽

Tong Li ◽

Jing Peng ◽

Chen Qian ◽

...

Keyword(s):

Pose Estimation ◽

Message Passing ◽

Closed Loop ◽

State Of The Art ◽

Human Action ◽

Complementary Information ◽

Learning Framework ◽

Human Object ◽

Human Pose ◽

Object Interactions

Human-object interactions (HOI) recognition and pose estimation are two closely related tasks. Human pose is an essential cue for recognizing actions and localizing the interacted objects. Meanwhile, human action and their interacted objects’ localizations provide guidance for pose estimation. In this paper, we propose a turbo learning framework to perform HOI recognition and pose estimation simultaneously. First, two modules are designed to enforce message passing between the tasks, i.e. pose aware HOI recognition module and HOI guided pose estimation module. Then, these two modules form a closed loop to utilize the complementary information iteratively, which can be trained in an end-to-end manner. The proposed method achieves the state-of-the-art performance on two public benchmarks including Verbs in COCO (V-COCO) and HICO-DET datasets.

Download Full-text

Dynamic holding control to avoid bus bunching: A multi-agent deep reinforcement learning framework

Transportation Research Part C Emerging Technologies ◽

10.1016/j.trc.2020.102661 ◽

2020 ◽

Vol 116 ◽

pp. 102661 ◽

Cited By ~ 6

Author(s):

Jiawei Wang ◽

Lijun Sun

Keyword(s):

Reinforcement Learning ◽

Learning Framework ◽

Bus Bunching ◽

Multi Agent

Download Full-text

Reinforcement Learning under Threats

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019939 ◽

2019 ◽

Vol 33 ◽

pp. 9939-9940 ◽

Cited By ~ 1

Author(s):

Victor Gallego ◽

Roi Naveiro ◽

David Rios Insua

Keyword(s):

Reinforcement Learning ◽

Single Agent ◽

Potential Threat ◽

Q Learning ◽

Learning Framework ◽

Opponent Modeling ◽

Theoretical Approaches ◽

New Learning ◽

Markov Decision ◽

Multi Agent

In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversaries trying to interfere with the reward generating process. However, when non-stationary environments as such are considered, Q-learning leads to suboptimal results (Busoniu, Babuska, and De Schutter 2010). Previous game-theoretical approaches to this problem have focused on modeling the whole multi-agent system as a game. Instead, we shall face the problem of prescribing decisions to a single agent (the supported decision maker, DM) against a potential threat model (the adversary). We augment the MDP to account for this threat, introducing Threatened Markov Decision Processes (TMDPs). Furthermore, we propose a level-k thinking scheme resulting in a new learning framework to deal with TMDPs. We empirically test our framework, showing the benefits of opponent modeling.

Download Full-text