Multi-Robot Collision Avoidance with Map-based Deep Reinforcement Learning

Developing a safe and efficient collision-avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generates its paths with limited observation of other robots’ states and intentions. Prior distributed multi-robot collision-avoidance systems often require frequent inter-robot communication or agent-level features to plan a local collision-free action, which is not robust and computationally prohibitive. In addition, the performance of these methods is not comparable with their centralized counterparts in practice. In this article, we present a decentralized sensor-level collision-avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent’s steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy-gradient-based reinforcement-learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy’s robustness and effectiveness. We validate the learned sensor-level collision-3avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller’s robustness against the simulation-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution for safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. More importantly, the policy has been successfully deployed on different types of physical robot platforms without tedious parameter tuning. Videos are available at https://sites.google.com/view/hybridmrca .

Download Full-text

Collision avoidance in multi-robot systems based on multi-layered reinforcement learning

Robotics and Autonomous Systems ◽

10.1016/s0921-8890(99)00035-4 ◽

1999 ◽

Vol 29 (1) ◽

pp. 21-32 ◽

Cited By ~ 11

Author(s):

Yoshikazu Arai ◽

Teruo Fujii ◽

Hajime Asama ◽

Hayato Kaetsu ◽

Isao Endo

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Robot Systems ◽

Multi Robot

Download Full-text

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning

Sensors ◽

10.3390/s20174836 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4836

Author(s):

Guangda Chen ◽

Shunyi Yao ◽

Jun Ma ◽

Lifan Pan ◽

Yu’an Chen ◽

...

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Mobile Robots ◽

Collision Avoidance ◽

Positive Effects ◽

Movement Data ◽

Multi Stage ◽

Local Grid ◽

Multiple Scenarios ◽

Multi Robot

It is challenging to avoid obstacles safely and efficiently for multiple robots of different shapes in distributed and communication-free scenarios, where robots do not communicate with each other and only sense other robots’ positions and obstacles around them. Most existing multi-robot collision avoidance systems either require communication between robots or require expensive movement data of other robots, like velocities, accelerations and paths. In this paper, we propose a map-based deep reinforcement learning approach for multi-robot collision avoidance in a distributed and communication-free environment. We use the egocentric local grid map of a robot to represent the environmental information around it including its shape and observable appearances of other robots and obstacles, which can be easily generated by using multiple sensors or sensor fusion. Then we apply the distributed proximal policy optimization (DPPO) algorithm to train a convolutional neural network that directly maps three frames of egocentric local grid maps and the robot’s relative local goal positions into low-level robot control commands. Compared to other methods, the map-based approach is more robust to noisy sensor data, does not require robots’ movement data and considers sizes and shapes of related robots, which make it to be more efficient and easier to be deployed to real robots. We first train the neural network in a specified simulator of multiple mobile robots using DPPO, where a multi-stage curriculum learning strategy for multiple scenarios is used to improve the performance. Then we deploy the trained model to real robots to perform collision avoidance in their navigation without tedious parameter tuning. We evaluate the approach with multiple scenarios both in the simulator and on four differential-drive mobile robots in the real world. Both qualitative and quantitative experiments show that our approach is efficient and outperforms existing DRL-based approaches in many indicators. We also conduct ablation studies showing the positive effects of using egocentric grid maps and multi-stage curriculum learning.

Download Full-text

Integral Reinforcement Learning-Based Multi-Robot Minimum Time-Energy Path Planning Subject to Collision Avoidance and Unknown Environmental Disturbances

IEEE Control Systems Letters ◽

10.1109/lcsys.2020.3007663 ◽

2021 ◽

Vol 5 (3) ◽

pp. 983-988

Author(s):

Chenyuan He ◽

Yan Wan ◽

Yixin Gu ◽

Frank L. Lewis

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Collision Avoidance ◽

Minimum Time ◽

Environmental Disturbances ◽

Multi Robot

Download Full-text

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

2018 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2018.8461113 ◽

2018 ◽

Cited By ~ 52

Author(s):

Pinxin Long ◽

Tingxiang Fanl ◽

Xinyi Liao ◽

Wenxi Liu ◽

Hao Zhang ◽

...

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Multi Robot

Download Full-text

Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction

Distributed Autonomous Robotic Systems - Springer Proceedings in Advanced Robotics ◽

10.1007/978-3-030-05816-6_3 ◽

2019 ◽

pp. 35-49 ◽

Cited By ~ 10

Author(s):

Guillaume Sartoretti ◽

Yue Wu ◽

William Paivine ◽

T. K. Satish Kumar ◽

Sven Koenig ◽

...

Keyword(s):

Reinforcement Learning ◽

Collective Construction ◽

Distributed Reinforcement ◽

Multi Robot

Download Full-text

Collision Avoidance in IEEE 802.11 DCF using a Reinforcement Learning Method

2020 International Conference on Information and Communication Technology Convergence (ICTC) ◽

10.1109/ictc49870.2020.9289402 ◽

2020 ◽

Author(s):

Chang Kyu Lee ◽

Seung Hyong Rhee

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Ieee 802.11 ◽

Learning Method ◽

Ieee 802.11 Dcf ◽

802.11 Dcf

Download Full-text

Reinforcement-Learning-Based Asynchronous Formation Control Scheme for Multiple Unmanned Surface Vehicles

Applied Sciences ◽

10.3390/app11020546 ◽

2021 ◽

Vol 11 (2) ◽

pp. 546

Author(s):

Jiajia Xie ◽

Rui Zhou ◽

Yuan Liu ◽

Jun Luo ◽

Shaorong Xie ◽

...

Keyword(s):

Reinforcement Learning ◽

Formation Control ◽

Rapid Development ◽

Gradient Algorithm ◽

Robot System ◽

Physical Relationship ◽

Unmanned Surface Vehicles ◽

Main Challenge ◽

Control Scheme ◽

Multi Robot

The high performance and efficiency of multiple unmanned surface vehicles (multi-USV) promote the further civilian and military applications of coordinated USV. As the basis of multiple USVs’ cooperative work, considerable attention has been spent on developing the decentralized formation control of the USV swarm. Formation control of multiple USV belongs to the geometric problems of a multi-robot system. The main challenge is the way to generate and maintain the formation of a multi-robot system. The rapid development of reinforcement learning provides us with a new solution to deal with these problems. In this paper, we introduce a decentralized structure of the multi-USV system and employ reinforcement learning to deal with the formation control of a multi-USV system in a leader–follower topology. Therefore, we propose an asynchronous decentralized formation control scheme based on reinforcement learning for multiple USVs. First, a simplified USV model is established. Simultaneously, the formation shape model is built to provide formation parameters and to describe the physical relationship between USVs. Second, the advantage deep deterministic policy gradient algorithm (ADDPG) is proposed. Third, formation generation policies and formation maintenance policies based on the ADDPG are proposed to form and maintain the given geometry structure of the team of USVs during movement. Moreover, three new reward functions are designed and utilized to promote policy learning. Finally, various experiments are conducted to validate the performance of the proposed formation control scheme. Simulation results and contrast experiments demonstrate the efficiency and stability of the formation control scheme.

Download Full-text

Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

2021 IEEE/SICE International Symposium on System Integration (SII) ◽

10.1109/ieeeconf49454.2021.9382693 ◽

2021 ◽

Author(s):

Hanlin Niu ◽

Ze Ji ◽

Farshad Arvin ◽

Barry Lennox ◽

Hujun Yin ◽

...

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Human Player

Download Full-text

Multi-Robot Collision Avoidance with Map-based Deep Reinforcement Learning

Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning

Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios

Collision avoidance in multi-robot systems based on multi-layered reinforcement learning

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning

Integral Reinforcement Learning-Based Multi-Robot Minimum Time-Energy Path Planning Subject to Collision Avoidance and Unknown Environmental Disturbances

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction

Collision Avoidance in IEEE 802.11 DCF using a Reinforcement Learning Method

Reinforcement-Learning-Based Asynchronous Formation Control Scheme for Multiple Unmanned Surface Vehicles

Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

Export Citation Format