Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.

Download Full-text

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining ◽

10.1145/3219819.3219993 ◽

2018 ◽

Cited By ~ 31

Author(s):

Kaixiang Lin ◽

Renyu Zhao ◽

Zhe Xu ◽

Jiayu Zhou

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Fleet Management ◽

Multi Agent

Download Full-text

A Distributed Multi-Agent Reinforcement Learning With Graph Decomposition Approach for Large-Scale Adaptive Traffic Signal Control

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3131596 ◽

2021 ◽

pp. 1-13

Author(s):

Shan Jiang ◽

Yufei Huang ◽

Mohsen Jafari ◽

Mohammad Jalayer

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Graph Decomposition ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Decomposition Approach ◽

Adaptive Traffic Signal Control ◽

Multi Agent

Download Full-text

Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9206765 ◽

2020 ◽

Author(s):

Jun Yamada ◽

John Shawe-Taylor ◽

Zafeirios Fountas

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Predator Prey ◽

Multi Agent

Download Full-text

Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) ◽

10.1109/infocomwkshps51825.2021.9484451 ◽

2021 ◽

Author(s):

Yue Chen ◽

Changle Li ◽

Wenwei Yue ◽

Hehe Zhang ◽

Guoqiang Mao

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Learning Approach ◽

Multi Agent

Download Full-text

Multi-Agent Deep Reinforcement Learning for Solving Large-scale Air Traffic Flow Management Problem: A Time-Step Sequential Decision Approach

10.1109/dasc52595.2021.9594329 ◽

2021 ◽

Author(s):

Yifan Tang ◽

Yan Xu

Keyword(s):

Reinforcement Learning ◽

Traffic Flow ◽

Large Scale ◽

Management Problem ◽

Sequential Decision ◽

Time Step ◽

Flow Management ◽

Air Traffic Flow Management ◽

Traffic Flow Management ◽

Multi Agent

Download Full-text

Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6212 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7219-7226

Author(s):

Hangyu Mao ◽

Wulong Liu ◽

Jianye Hao ◽

Jun Luo ◽

Dong Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Football Player ◽

Superior Performance ◽

Human Society ◽

Packet Routing ◽

Q Learning ◽

Cognitive Consistency ◽

Challenging Tasks ◽

Multi Agent

Social psychology and real experiences show that cognitive consistency plays an important role to keep human society in order: if people have a more consistent cognition about their environments, they are more likely to achieve better cooperation. Meanwhile, only cognitive consistency within a neighborhood matters because humans only interact directly with their neighbors. Inspired by these observations, we take the first step to introduce neighborhood cognitive consistency (NCC) into multi-agent reinforcement learning (MARL). Our NCC design is quite general and can be easily combined with existing MARL methods. As examples, we propose neighborhood cognition consistent deep Q-learning and Actor-Critic to facilitate large-scale multi-agent cooperations. Extensive experiments on several challenging tasks (i.e., packet routing, wifi configuration and Google football player control) justify the superior performance of our methods compared with state-of-the-art MARL approaches.

Download Full-text