Asynchronous reinforcement learning algorithms for solving discrete space path planning problems

Background: Unmanned systems have been widely used in multiple fields. Many algorithms have been proposed to solve path planning problems. Each algorithm has its advantages and defects and cannot adapt to all kinds of requirements. An appropriate path planning method is needed for various applications. Objective: To select an appropriate algorithm fastly in a given application. This could be helpful for improving the efficiency of path planning for Unmanned systems. Methods: This paper proposes to represent and quantify the features of algorithms based on the physical indicators of results. At the same time, an algorithmic collaborative scheme is developed to search the appropriate algorithm according to the requirement of the application. As an illustration of the scheme, four algorithms, including the A-star (A*) algorithm, reinforcement learning, genetic algorithm, and ant colony optimization algorithm, are implemented in the representation of their features. Results: In different simulations, the algorithmic collaborative scheme can select an appropriate algorithm in a given application based on the representation of algorithms. And the algorithm could plan a feasible and effective path. Conclusion: An algorithmic collaborative scheme is proposed, which is based on the representation of algorithms and requirement of the application. The simulation results prove the feasibility of the scheme and the representation of algorithms.

Download Full-text

Reinforcement learning-based radar-evasive path planning: a comparative analysis

The Aeronautical Journal ◽

10.1017/aer.2021.85 ◽

2021 ◽

pp. 1-18

Author(s):

R.U. Hameed ◽

A. Maqsood ◽

A.J. Hashmi ◽

M.T. Saeed ◽

R. Riaz

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Learning Algorithms ◽

Optimal Path ◽

Trust Region ◽

Optimal Path Planning ◽

Optimal Paths ◽

Policy Gradient ◽

Model Aircraft ◽

Tracking Model

Abstract This paper discusses the utilisation of deep reinforcement learning algorithms to obtain optimal paths for an aircraft to avoid or minimise radar detection and tracking. A modular approach is adopted to formulate the problem, including the aircraft kinematics model, aircraft radar cross-section model and radar tracking model. A virtual environment is designed for single and multiple radar cases to obtain optimal paths. The optimal trajectories are generated through deep reinforcement learning in this study. Specifically, three algorithms, namely deep deterministic policy gradient, trust region policy optimisation and proximal policy optimisation, are used to find optimal paths for five test cases. The comparison is carried out based on six performance indicators. The investigation proves the importance of these reinforcement learning algorithms in optimal path planning. The results indicate that the proximal policy optimisation approach performed better for optimal paths in general.

Download Full-text

Reinforcement Learning Based Approach For Urban Resource Allocation and Path Planning Problems

2020 International Conference on Intelligent Data Science Technologies and Applications (IDSTA) ◽

10.1109/idsta50958.2020.9264062 ◽

2020 ◽

Author(s):

Muhammad Fazalul Rahman ◽

Naveen Sharma

Keyword(s):

Resource Allocation ◽

Reinforcement Learning ◽

Path Planning ◽

Planning Problems

Download Full-text

Reinforcement Learning Algorithms in Global Path Planning for Mobile Robot

2019 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) ◽

10.1109/icieam.2019.8742915 ◽

2019 ◽

Cited By ~ 3

Author(s):

Valentyn N. Sichkar

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Mobile Robot ◽

Learning Algorithms ◽

Global Path Planning

Download Full-text

Simulation-Based Evaluations of Reinforcement Learning Algorithms for Autonomous Mobile Robot Path Planning

Lecture Notes in Electrical Engineering - IT Convergence and Services ◽

10.1007/978-94-007-2598-0_49 ◽

2011 ◽

pp. 467-476 ◽

Cited By ~ 6

Author(s):

Hoang Huu Viet ◽

Phyo Htet Kyaw ◽

TaeChoong Chung

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Mobile Robot ◽

Learning Algorithms ◽

Autonomous Mobile Robot ◽

Robot Path Planning ◽

Simulation Based ◽

Robot Path

Download Full-text

Cognitive Radio Networks with Reinforcement Learning Algorithms for Spectrum Allocation: A Survey

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/211952020 ◽

2020 ◽

Vol 9 (5) ◽

pp. 8371-8384

Keyword(s):

Reinforcement Learning ◽

Cognitive Radio ◽

Cognitive Radio Networks ◽

Learning Algorithms ◽

Radio Networks ◽

Spectrum Allocation

Download Full-text

Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

Applied Sciences ◽

10.3390/app11114948 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4948

Author(s):

Lorenzo Canese ◽

Gian Carlo Cardarilli ◽

Luca Di Di Nunzio ◽

Rocco Fazzolari ◽

Daniele Giardino ◽

...

Keyword(s):

Reinforcement Learning ◽

Mathematical Models ◽

Learning Algorithms ◽

Single Agent ◽

Critical Issues ◽

Multi Agent ◽

Pros And Cons ◽

Application Fields

In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications—namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.

Download Full-text