A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System

In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.

Download Full-text

Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9378191 ◽

2020 ◽

Author(s):

Chi Zhang ◽

Philip Odonkor ◽

Shuai Zheng ◽

Hamed Khorasgani ◽

Susumu Serita ◽

...

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Heterogeneous Fleet ◽

Multi Agent ◽

Dynamic Dispatching

Download Full-text

Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning

Transportation Research Part C Emerging Technologies ◽

10.1016/j.trc.2021.103046 ◽

2021 ◽

Vol 125 ◽

pp. 103046

Author(s):

Tong Wang ◽

Jiahua Cao ◽

Azhar Hussain

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Cooperative Group ◽

Adaptive Traffic Signal Control ◽

Multi Agent

Download Full-text

Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving

Proceedings of the Advances in Robotics 2019 ◽

10.1145/3352593.3352625 ◽

2019 ◽

Author(s):

Meha Kaushik ◽

Nirvan Singhania ◽

Phaniteja S. ◽

K. Madhava Krishna

Keyword(s):

Reinforcement Learning ◽

Multi Agent ◽

Parameter Sharing

Download Full-text

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining ◽

10.1145/3219819.3219993 ◽

2018 ◽

Cited By ~ 31

Author(s):

Kaixiang Lin ◽

Renyu Zhao ◽

Zhe Xu ◽

Jiayu Zhou

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Fleet Management ◽

Multi Agent

Download Full-text

A Negotiation Model for Collaborative Decision Making in Large-Scale Multi-Agent Systems

Innovations and Advanced Techniques in Computer and Information Sciences and Engineering ◽

10.1007/978-1-4020-6268-1_82 ◽

2007 ◽

pp. 463-468

Author(s):

Tom Wanyama

Keyword(s):

Decision Making ◽

Large Scale ◽

Multi Agent Systems ◽

Collaborative Decision Making ◽

Negotiation Model ◽

Agent Systems ◽

Multi Agent ◽

Collaborative Decision

Download Full-text

A Distributed Multi-Agent Reinforcement Learning With Graph Decomposition Approach for Large-Scale Adaptive Traffic Signal Control

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3131596 ◽

2021 ◽

pp. 1-13

Author(s):

Shan Jiang ◽

Yufei Huang ◽

Mohsen Jafari ◽

Mohammad Jalayer

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Graph Decomposition ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Decomposition Approach ◽

Adaptive Traffic Signal Control ◽

Multi Agent

Download Full-text

Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9206765 ◽

2020 ◽

Author(s):

Jun Yamada ◽

John Shawe-Taylor ◽

Zafeirios Fountas

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Predator Prey ◽

Multi Agent

Download Full-text

Development of Reinforcement Learning Methods in Control and Decision Making in the Large Scale Dynamic Game Environments

IEEE International Symposium on Intelligent Control ◽

10.1109/isic.2006.285608 ◽

2006 ◽

Author(s):

S. Orafa ◽

M.J. Yazdanpanah ◽

C. Lucas ◽

A. Rahimikian ◽

M. Ahmadabadi

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Large Scale ◽

Dynamic Game ◽

Learning Methods ◽

Game Environments

Download Full-text

Who defines the need for fishery reform? Participants, discourses and networks in the reform of the Greenland fishery

Polar Record ◽

10.1017/s0032247414000126 ◽

2014 ◽

Vol 50 (4) ◽

pp. 391-402 ◽

Cited By ~ 3

Author(s):

Rikke Becker Jacobsen ◽

Jesper Raakjær

Keyword(s):

Decision Making ◽

Large Scale ◽

Policy Networks ◽

Small Scale ◽

Policy Network ◽

Informal Networks ◽

Fisheries Governance ◽

Coastal Fisheries ◽

Institutional Learning ◽

Actual Decision

ABSTRACTThis article investigates recent reforms of the Greenland coastal fisheries in order to contribute to the general lessons on reform and policy networks in the context of a changing Arctic stakeholdership. It analyses participation in fisheries governance decision-making by examining the emergence of discourses and policy networks that come to define the very need for reform. A policy network is identified across state ministries, powerful officials, banks and large scale industry that defined the need for fisheries reform within a ‘grand reform’ discourse. But inertia characterised the actual decision-making process as reform according to this ‘grand reform’ discourse was blocked by a combination of small-scale fishers’ informal networks and the power of the parliamentary majority. After a parliamentary shift in power the new government implemented the ‘grand reform’ gradually whilst new patterns of participation and exclusion emerged. In this process, the identities of the participating participants were reinterpreted to fit the new patterns of influence and participation. The article argues that fishery reform does not necessarily start with the collective recognition of a problem in marine resource use and a power-neutral process of institutional learning. Instead, it argues that fishery reform is likely to be the ‘reform of somebody’ and that this ‘somebody’ is itself a changing identity.

Download Full-text

Comparative Study on Decision-making Process of Governmental Reorganization: With Special Reference to Large-scale and Small-scale Reorganization

Korean Review of Organizational Studies ◽

10.21484/kros.2009.6.3.143 ◽

2009 ◽

Vol 6 (3) ◽

pp. 143-172 ◽

Cited By ~ 1

Author(s):

Dae Shik Park

Keyword(s):

Decision Making ◽

Comparative Study ◽

Special Reference ◽

Large Scale ◽

Small Scale ◽

Decision Making Process

Download Full-text