On the rationality of profit sharing in multi-agent reinforcement learning

Exploitation-oriented Learning XoL is a new framework of reinforcement learning. XoL aims to learn a rational policy whose expected reward per an action is larger than zero, and does not require a sophisticated design of the value of a reward signal. In this chapter, as examples of learning systems that belongs in XoL, we introduce the rationality theorem of profit Sharing (PS), the rationality theorem of reward sharing in multi-agent PS, and PS-r*. XoL has several features. (1) Though traditional RL systems require appropriate reward and penalty values, XoL only requires an order of importance among them. (2) XoL can learn more quickly since it traces successful experiences very strongly. (3) XoL may be unsuitable for pursuing an optimal policy. The optimal policy can be acquired by the multi-start method that needs to reset all memories to get a better policy. (4) XoL is effective on the classes beyond MDPs, since it is a Bellman-free method that does not depend on DP. We show several numerical examples to confirm these features.

Download Full-text

Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control

CICTP 2020 ◽

10.1061/9780784483053.039 ◽

2020 ◽

Author(s):

Yang Zhao ◽

Jian-Ming Hu ◽

Ming-Yang Gao ◽

Zuo Zhang

Keyword(s):

Reinforcement Learning ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Multi Agent

Download Full-text

Output feedback reinforcement learning based optimal output synchronisation of heterogeneous discrete-time multi-agent systems

IET Control Theory and Applications ◽

10.1049/iet-cta.2018.6266 ◽

2019 ◽

Vol 13 (17) ◽

pp. 2866-2876

Author(s):

Syed Ali Asad Rizvi ◽

Zongli Lin

Keyword(s):

Reinforcement Learning ◽

Discrete Time ◽

Output Feedback ◽

Multi Agent Systems ◽

Agent Systems ◽

Optimal Output ◽

Multi Agent

Download Full-text

Multi-agent deep reinforcement learning with type-based hierarchical group communication

Applied Intelligence ◽

10.1007/s10489-020-02065-9 ◽

2021 ◽

Author(s):

Hao Jiang ◽

Dianxi Shi ◽

Chao Xue ◽

Yajie Wang ◽

Gongju Wang ◽

...

Keyword(s):

Reinforcement Learning ◽

Group Communication ◽

Multi Agent ◽

Hierarchical Group

Download Full-text

Multi-Agent Deep Reinforcement Learning Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks

IEEE Transactions on Communications ◽

10.1109/tcomm.2020.3044298 ◽

2020 ◽

pp. 1-1

Author(s):

Shuangwu Chen ◽

Zhen Yao ◽

Xiaofeng Jiang ◽

Jian Yang ◽

Lajos Hanzo

Keyword(s):

Reinforcement Learning ◽

Next Generation Networks ◽

Next Generation ◽

Multi Agent ◽

Edge Caching

Download Full-text

Coordinated Ramp Metering Control Based on Multi-Agent Reinforcement Learning

2020 35th Youth Academic Annual Conference of Chinese Association of Automation (YAC) ◽

10.1109/yac51587.2020.9337711 ◽

2020 ◽

Author(s):

Jiyuan Tan ◽

Qianqian Qiu ◽

Weiwei Guo

Keyword(s):

Reinforcement Learning ◽

Ramp Metering ◽

Multi Agent

Download Full-text

Multi-Agent Deep Reinforcement Learning for Vehicular Computation Offloading in IoT

IEEE Internet of Things Journal ◽

10.1109/jiot.2020.3040768 ◽

2020 ◽

pp. 1-1 ◽

Cited By ~ 1

Author(s):

Xiaoyu Zhu ◽

Yueyi Luo ◽

Anfeng Liu ◽

Md Zakirul Alam Bhuiyan ◽

Shaobo Zhang

Keyword(s):

Reinforcement Learning ◽

Computation Offloading ◽

Multi Agent

Download Full-text

Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

Applied Sciences ◽

10.3390/app11114948 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4948

Author(s):

Lorenzo Canese ◽

Gian Carlo Cardarilli ◽

Luca Di Di Nunzio ◽

Rocco Fazzolari ◽

Daniele Giardino ◽

...

Keyword(s):

Reinforcement Learning ◽

Mathematical Models ◽

Learning Algorithms ◽

Single Agent ◽

Critical Issues ◽

Multi Agent ◽

Pros And Cons ◽

Application Fields

In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications—namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.

Download Full-text

Multi-Agent Deep Reinforcement Learning based Interdependent Critical Infrastructure Simulation Model for Situational Awareness during a Flood Event

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323380 ◽

2020 ◽

Author(s):

Parashuram Shourya Rajulapati ◽

Nivedita Nukavarapu ◽

Surya Durbha

Keyword(s):

Reinforcement Learning ◽

Simulation Model ◽

Situational Awareness ◽

Critical Infrastructure ◽

Flood Event ◽

Multi Agent

Download Full-text