Hybrid Bidirectional Rapidly Exploring Random Tree Path Planning Algorithm with Reinforcement Learning

The randomness of path generation and slow convergence to the optimal path are two major problems in the current rapidly exploring random tree (RRT) path planning algorithm. Herein, a novel reinforcement-learning-based hybrid bidirectional rapidly exploring random tree (H-BRRT) is presented to solve these problems. To model the random exploration process, a target gravitational strategy is introduced. Reinforcement learning is applied to the improved target gravitational strategy using two operations: random exploration and target gravitational exploration. The algorithm is controlled to switch operations adaptively according to the accumulated performance. It not only improves the search efficiency, but also shortens the generated path after the proposed strategy is applied to a bidirectional rapidly exploring random tree (BRRT). In addition, to solve the problem of the traditional RRT continuously falling into the local optimum, an improved exploration strategy with collision weight is applied to the BRRT. Experimental results implemented in a robot operating system indicate that the proposed H-BRRT significantly outperforms alternative approaches such as the RRT and BRRT. The proposed algorithm enhances the capability of identifying unknown spaces and avoiding local optima.

Download Full-text

A 2D Optimal Path Planning Algorithm for Autonomous Underwater Vehicle Driving in Unknown Underwater Canyons

Journal of Marine Science and Engineering ◽

10.3390/jmse9030252 ◽

2021 ◽

Vol 9 (3) ◽

pp. 252

Author(s):

Yushan Sun ◽

Xiaokun Luo ◽

Xiangrui Ran ◽

Guocheng Zhang

Keyword(s):

Path Planning ◽

Obstacle Avoidance ◽

Autonomous Underwater Vehicles ◽

Optimal Path ◽

Small Scale ◽

Target Point ◽

Safe Driving ◽

Policy Gradient ◽

Planning Algorithm ◽

Path Planning Algorithm

This research aims to solve the safe navigation problem of autonomous underwater vehicles (AUVs) in deep ocean, which is a complex and changeable environment with various mountains. When an AUV reaches the deep sea navigation, it encounters many underwater canyons, and the hard valley walls threaten its safety seriously. To solve the problem on the safe driving of AUV in underwater canyons and address the potential of AUV autonomous obstacle avoidance in uncertain environments, an improved AUV path planning algorithm based on the deep deterministic policy gradient (DDPG) algorithm is proposed in this work. This method refers to an end-to-end path planning algorithm that optimizes the strategy directly. It takes sensor information as input and driving speed and yaw angle as outputs. The path planning algorithm can reach the predetermined target point while avoiding large-scale static obstacles, such as valley walls in the simulated underwater canyon environment, as well as sudden small-scale dynamic obstacles, such as marine life and other vehicles. In addition, this research aims at the multi-objective structure of the obstacle avoidance of path planning, modularized reward function design, and combined artificial potential field method to set continuous rewards. This research also proposes a new algorithm called deep SumTree-deterministic policy gradient algorithm (SumTree-DDPG), which improves the random storage and extraction strategy of DDPG algorithm experience samples. According to the importance of the experience samples, the samples are classified and stored in combination with the SumTree structure, high-quality samples are extracted continuously, and SumTree-DDPG algorithm finally improves the speed of the convergence model. Finally, this research uses Python language to write an underwater canyon simulation environment and builds a deep reinforcement learning simulation platform on a high-performance computer to conduct simulation learning training for AUV. Data simulation verified that the proposed path planning method can guide the under-actuated underwater robot to navigate to the target without colliding with any obstacles. In comparison with the DDPG algorithm, the stability, training’s total reward, and robustness of the improved Sumtree-DDPG algorithm planner in this study are better.

Download Full-text

UCAV Path Planning Algorithm Based on Deep Reinforcement Learning

Lecture Notes in Computer Science - Image and Graphics ◽

10.1007/978-3-030-34110-7_59 ◽

2019 ◽

pp. 702-714

Author(s):

Kaiyuan Zheng ◽

Jingpeng Gao ◽

Liangxi Shen

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Optimal Collision-Free Path Planning for an Autonomous Multi-Wheeled Combat Vehicle

Volume 3: 19th International Conference on Advanced Vehicle Technologies; 14th International Conference on Design Education; 10th Frontiers in Biomedical Devices ◽

10.1115/detc2017-67025 ◽

2017 ◽

Author(s):

Amr Mohamed ◽

Moustafa El-Gindy ◽

Jing Ren ◽

Haoxiang Lang

Keyword(s):

Path Planning ◽

Potential Field ◽

Vehicle Dynamics ◽

Autonomous Vehicle ◽

Optimal Path ◽

Vehicle Model ◽

Optimal Path Planning ◽

Combat Vehicle ◽

Planning Algorithm ◽

Path Planning Algorithm

This paper presents an optimal collision-free path planning algorithm of an autonomous multi-wheeled combat vehicle using optimal control theory and artificial potential field function (APF). The optimal path of the autonomous vehicle between a given starting and goal points is generated by an optimal path planning algorithm. The cost function of the path planning is solved together with vehicle dynamics equations to satisfy the vehicle dynamics constraints and the boundary conditions. For this purpose, a simplified four-axle bicycle model of the actual vehicle considering the vehicle body lateral and yaw dynamics while neglecting roll dynamics is used. The obstacle avoidance technique is mathematically modeled based on the proposed sigmoid function as the artificial potential field method. This potential function is assigned to each obstacle as a repulsive potential field. The inclusion of these potential fields results in a new APF which controls the steering angle of the autonomous vehicle to reach the goal point. A full nonlinear multi-wheeled combat vehicle model in TruckSim software is used for validation. This is done by importing the generated optimal path data from the introduced optimal path planning MATLAB algorithm and comparing lateral acceleration, yaw rate and curvature at different speeds (9 km/h, 28 km/h) for both simplified and TruckSim vehicle model. The simulation results show that the obtained optimal path for the autonomous multi-wheeled combat vehicle satisfies all vehicle dynamics constraints and successfully validated with TruckSim vehicle model.

Download Full-text

Wind farm water area path planning algorithm based on A* and reinforcement learning

2019 5th International Conference on Transportation Information and Safety (ICTIS) ◽

10.1109/ictis.2019.8883718 ◽

2019 ◽

Author(s):

Tianqi Zha ◽

Lei Xie ◽

Jiliang Chang

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Wind Farm ◽

Water Area ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Collision-Free Path Planning Method for Robots Based on an Improved Rapidly-Exploring Random Tree Algorithm

Applied Sciences ◽

10.3390/app10041381 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1381 ◽

Cited By ~ 4

Author(s):

Xinda Wang ◽

Xiao Luo ◽

Baoling Han ◽

Yuhan Chen ◽

Guanhao Liang ◽

...

Keyword(s):

Path Planning ◽

Control Mechanism ◽

Random Tree ◽

High Dimensional ◽

Control Factor ◽

Sampling Area ◽

Planning Algorithm ◽

Search Speed ◽

The Many ◽

Path Planning Algorithm

Sampling-based methods are popular in the motion planning of robots, especially in high-dimensional spaces. Among the many such methods, the Rapidly-exploring Random Tree (RRT) algorithm has been widely used in multi-degree-of-freedom manipulators and has yielded good results. However, existing RRT planners have low exploration efficiency and slow convergence speed and have been unable to meet the requirements of the intelligence level in the Industry 4.0 mode. To solve these problems, a general autonomous path planning algorithm of Node Control (NC-RRT) is proposed in this paper based on the architecture of the RRT algorithm. Firstly, a method of gradually changing the sampling area is proposed to guide exploration, thereby effectively improving the search speed. In addition, the node control mechanism is introduced to constrain the extended nodes of the tree and thus reduce the extension of invalid nodes and extract boundary nodes (or near-boundary nodes). By changing the value of the node control factor, the random tree is prevented from falling into a so-called “local trap” phenomenon, and boundary nodes are selected as extended nodes. The proposed algorithm is simulated in different environments. Results reveal that the algorithm greatly reduces the invalid exploration in the configuration space and significantly improves planning efficiency. In addition, because this method can efficiently use boundary nodes, it has a stronger applicability to narrow environments compared with existing RRT algorithms and can effectively improve the success rate of exploration.

Download Full-text

Heuristically arrival time field-biased (HeAT) random tree: An online path planning algorithm for mobile robot considering kinodynamic constraints

2011 IEEE International Conference on Robotics and Biomimetics ◽

10.1109/robio.2011.6181312 ◽

2011 ◽

Cited By ~ 5

Author(s):

Igi Ardiyanto ◽

Jun Miura

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Arrival Time ◽

Random Tree ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Research on optimal path planning algorithm of task-oriented optical remote sensing satellites

10.1117/12.2204736 ◽

2015 ◽

Author(s):

Yunhe Liu ◽

Shengli Xu ◽

Fengjing Liu ◽

Jingpeng Yuan

Keyword(s):

Remote Sensing ◽

Path Planning ◽

Optimal Path ◽

Optical Remote Sensing ◽

Optimal Path Planning ◽

Planning Algorithm ◽

Path Planning Algorithm ◽

Task Oriented

Download Full-text

NPQ-RRT ∗ : An Improved RRT ∗ Approach to Hybrid Path Planning

Complexity ◽

10.1155/2021/6633878 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Zihan Yu ◽

Linying Xiang

Keyword(s):

Path Planning ◽

Optimal Path ◽

Research Direction ◽

Random Trees ◽

Target Point ◽

Evaluation Function ◽

Local Planning ◽

Planning Algorithm ◽

Window Approach ◽

Path Planning Algorithm

In recent years, the path planning of robot has been a hot research direction, and multirobot formation has practical application prospect in our life. This article proposes a hybrid path planning algorithm applied to robot formation. The improved Rapidly Exploring Random Trees algorithm PQ-RRT ∗ with new distance evaluation function is used as a global planning algorithm to generate the initial global path. The determined parent nodes and child nodes are used as the starting points and target points of the local planning algorithm, respectively. The dynamic window approach is used as the local planning algorithm to avoid dynamic obstacles. At the same time, the algorithm restricts the movement of robots inside the formation to avoid internal collisions. The local optimal path is selected by the evaluation function containing the possibility of formation collision. Therefore, multiple mobile robots can quickly and safely reach the global target point in a complex environment with dynamic and static obstacles through the hybrid path planning algorithm. Numerical simulations are given to verify the effectiveness and superiority of the proposed hybrid path planning algorithm.

Download Full-text