Image skin segmentation based on multi-agent learning Bayesian and neural network

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of challenging long-horizon decision-making tasks into simpler subtasks. During the past years, the landscape of HRL research has grown profoundly, resulting in copious approaches. A comprehensive overview of this vast landscape is necessary to study HRL in an organized manner. We provide a survey of the diverse HRL approaches concerning the challenges of learning hierarchical policies, subtask discovery, transfer learning, and multi-agent learning using HRL. The survey is presented according to a novel taxonomy of the approaches. Based on the survey, a set of important open problems is proposed to motivate the future research in HRL. Furthermore, we outline a few suitable task domains for evaluating the HRL approaches and a few interesting examples of the practical applications of HRL in the Supplementary Material.

Download Full-text

Neural Network-Based Distributed Finite-Time Tracking Control of Uncertain Multi-Agent Systems With Full State Constraints

IEEE Access ◽

10.1109/access.2020.3025966 ◽

2020 ◽

Vol 8 ◽

pp. 174365-174374

Author(s):

Qiutong Ji ◽

Gang Chen ◽

Qiurui He

Keyword(s):

Neural Network ◽

Finite Time ◽

Tracking Control ◽

State Constraints ◽

Multi Agent Systems ◽

Agent Systems ◽

Full State ◽

Multi Agent ◽

Full State Constraints

Download Full-text

The possible and the impossible in multi-agent learning

Artificial Intelligence ◽

10.1016/j.artint.2006.10.015 ◽

2007 ◽

Vol 171 (7) ◽

pp. 429-433 ◽

Cited By ~ 16

Author(s):

H. Peyton Young

Keyword(s):

Agent Learning ◽

Multi Agent

Download Full-text

Multi-agent Learning Algorithms

Encyclopedia of Machine Learning and Data Mining ◽

10.1007/978-1-4899-7687-1_569 ◽

2017 ◽

pp. 860-863

Author(s):

Yoav Shoham ◽

Rob Powers

Keyword(s):

Learning Algorithms ◽

Agent Learning ◽

Multi Agent

Download Full-text

The Role Of Cooperation in Multi-Agent Learning

CKBS ’90 ◽

10.1007/978-1-4471-1831-2_9 ◽

1991 ◽

pp. 164-180 ◽

Cited By ~ 1

Author(s):

Sati S. Sian

Keyword(s):

Agent Learning ◽

Multi Agent

Download Full-text

IPOMDP-Net: A Deep Neural Network for Partially Observable Multi-Agent Planning Using Interactive POMDPs

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016062 ◽

2019 ◽

Vol 33 ◽

pp. 6062-6069 ◽

Cited By ~ 1

Author(s):

Yanlin Han ◽

Piotr Gmytrasiewicz

Keyword(s):

Neural Network ◽

Network Architecture ◽

State Of The Art ◽

Neural Computing ◽

Neural Network Architecture ◽

Markov Decision ◽

Planning Algorithm ◽

Multi Agent ◽

Partially Observable ◽

Multi Agent Planning

This paper introduces the IPOMDP-net, a neural network architecture for multi-agent planning under partial observability. It embeds an interactive partially observable Markov decision process (I-POMDP) model and a QMDP planning algorithm that solves the model in a neural network architecture. The IPOMDP-net is fully differentiable and allows for end-to-end training. In the learning phase, we train an IPOMDP-net on various fixed and randomly generated environments in a reinforcement learning setting, assuming observable reinforcements and unknown (randomly initialized) model functions. In the planning phase, we test the trained network on new, unseen variants of the environments under the planning setting, using the trained model to plan without reinforcements. Empirical results show that our model-based IPOMDP-net outperforms the other state-of-the-art modelfree network and generalizes better to larger, unseen environments. Our approach provides a general neural computing architecture for multi-agent planning using I-POMDPs. It suggests that, in a multi-agent setting, having a model of other agents benefits our decision-making, resulting in a policy of higher quality and better generalizability.

Download Full-text