Cooperative Multi-Agent Reinforcement Learning for Multi-Component Robotic Systems: guidelines for future research

AbstractReinforcement Learning (RL) as a paradigm aims to develop algorithms that allow to train an agent to optimally achieve a goal with minimal feedback information about the desired behavior, which is not precisely specified. Scalar rewards are returned to the agent as response to its actions endorsing or opposing them. RL algorithms have been successfully applied to robot control design. The extension of the RL paradigm to cope with the design of control systems for Multi-Component Robotic Systems (MCRS) poses new challenges, mainly related to coping with scaling up of complexity due to the exponential state space growth, coordination issues, and the propagation of rewards among agents. In this paper, we identify the main issues which offer opportunities to develop innovative solutions towards fully-scalable cooperative multi-agent systems.

Download Full-text

Output feedback reinforcement learning based optimal output synchronisation of heterogeneous discrete-time multi-agent systems

IET Control Theory and Applications ◽

10.1049/iet-cta.2018.6266 ◽

2019 ◽

Vol 13 (17) ◽

pp. 2866-2876

Author(s):

Syed Ali Asad Rizvi ◽

Zongli Lin

Keyword(s):

Reinforcement Learning ◽

Discrete Time ◽

Output Feedback ◽

Multi Agent Systems ◽

Agent Systems ◽

Optimal Output ◽

Multi Agent

Download Full-text

A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning

Applied Mathematics and Computation ◽

10.1016/j.amc.2019.124821 ◽

2020 ◽

Vol 369 ◽

pp. 124821 ◽

Cited By ~ 10

Author(s):

Zhinan Peng ◽

Jiangping Hu ◽

Kaibo Shi ◽

Rui Luo ◽

Rui Huang ◽

...

Keyword(s):

Reinforcement Learning ◽

Multi Agent Systems ◽

Consensus Control ◽

Agent Systems ◽

Model Free ◽

Control Scheme ◽

Multi Agent ◽

Bipartite Consensus

Download Full-text

Formation Control using Simplified Reinforcement Learning for Multi-agent systems with State Delay

10.23919/ccc52363.2021.9549357 ◽

2021 ◽

Author(s):

Wentai Shao ◽

Yutao Chen ◽

Jie Huang

Keyword(s):

Reinforcement Learning ◽

Formation Control ◽

Multi Agent Systems ◽

State Delay ◽

Agent Systems ◽

Multi Agent

Download Full-text

Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3105548 ◽

2021 ◽

pp. 1-13

Author(s):

Guoxing Wen ◽

C. L. Philip Chen

Keyword(s):

Reinforcement Learning ◽

Multi Agent Systems ◽

Consensus Control ◽

Agent Systems ◽

Multi Agent ◽

Strict Feedback

Download Full-text

Optimal robust formation control for heterogeneous multi‐agent systems based on reinforcement learning

International Journal of Robust and Nonlinear Control ◽

10.1002/rnc.5828 ◽

2021 ◽

Author(s):

Bing Yan ◽

Peng Shi ◽

Cheng‐Chew Lim ◽

Zhiyuan Shi

Keyword(s):

Reinforcement Learning ◽

Formation Control ◽

Multi Agent Systems ◽

Agent Systems ◽

Multi Agent

Download Full-text

Improvement on Supporting Machine Learning Algorithm for Solving Problem in Immediate Decision Making

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.566.572 ◽

2012 ◽

Vol 566 ◽

pp. 572-579

Author(s):

Abdolkarim Niazi ◽

Norizah Redzuan ◽

Raja Ishak Raja Hamzah ◽

Sara Esfandiari

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent Systems ◽

Combined Model ◽

Q Learning ◽

Agent Systems ◽

Multi Agent ◽

Case Base ◽

Case Base Reasoning ◽

Robotic Tool

In this paper, a new algorithm based on case base reasoning and reinforcement learning (RL) is proposed to increase the convergence rate of the reinforcement learning algorithms. RL algorithms are very useful for solving wide variety decision problems when their models are not available and they must make decision correctly in every state of system, such as multi agent systems, artificial control systems, robotic, tool condition monitoring and etc. In the propose method, we investigate how making improved action selection in reinforcement learning (RL) algorithm. In the proposed method, the new combined model using case base reasoning systems and a new optimized function is proposed to select the action, which led to an increase in algorithms based on Q-learning. The algorithm mentioned was used for solving the problem of cooperative Markov’s games as one of the models of Markov based multi-agent systems. The results of experiments Indicated that the proposed algorithms perform better than the existing algorithms in terms of speed and accuracy of reaching the optimal policy.

Download Full-text