Research on Performance Evaluation of Autonomous Underwater Vehicle’ Path Planning in Complex Scenes

2020 ◽

pp. 095965182093708

Author(s):

Zhuo Wang ◽

Shiwei Zhang ◽

Xiaoning Feng ◽

Yancheng Sui

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Value Function ◽

Autonomous Underwater Vehicle ◽

Autonomous Underwater Vehicles ◽

Underwater Vehicle ◽

Learning Efficiency ◽

Environmental Adaptability ◽

Vehicle Path ◽

The Value Function

The environmental adaptability of autonomous underwater vehicles is always a problem for its path planning. Although reinforcement learning can improve the environmental adaptability, the slow convergence of reinforcement learning is caused by multi-behavior coupling, so it is difficult for autonomous underwater vehicle to avoid moving obstacles. This article proposes a multi-behavior critic reinforcement learning algorithm applied to autonomous underwater vehicle path planning to overcome problems associated with oscillating amplitudes and low learning efficiency in the early stages of training which are common in traditional actor–critic algorithms. Behavior critic reinforcement learning assesses the actions of the actor from perspectives such as energy saving and security, combining these aspects into a whole evaluation of the actor. In this article, the policy gradient method is selected as the actor part, and the value function method is selected as the critic part. The strategy gradient and the value function methods for actor and critic, respectively, are approximated by a backpropagation neural network, the parameters of which are updated using the gradient descent method. The simulation results show that the method has the ability of optimizing learning in the environment and can improve learning efficiency, which meets the needs of real time and adaptability for autonomous underwater vehicle dynamic obstacle avoidance.

Download Full-text

Water wave optimization algorithm for autonomous underwater vehicle path planning problem

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201544 ◽

2021 ◽

pp. 1-15

Author(s):

Zheping Yan ◽

Jinzhong Zhang ◽

Jia Zeng ◽

Jialing Tang

Keyword(s):

Path Planning ◽

Water Wave ◽

Autonomous Underwater Vehicle ◽

Optimal Path ◽

Wave Theory ◽

Underwater Vehicle ◽

Local Optimum ◽

Planning Problem ◽

Vehicle Path ◽

Path Planning Problem

In this paper, a water wave optimization (WWO) algorithm is proposed to solve the autonomous underwater vehicle (AUV) path planning problem to obtain an optimal or near-optimal path in the marine environment. Path planning is a prerequisite for the realization of submarine reconnaissance, surveillance, combat and other underwater tasks. The WWO algorithm based on shallow wave theory is a novel evolutionary algorithm that mimics wave motions containing propagation, refraction and breaking to obtain the global optimization solution. The WWO algorithm not only avoids jumps out of the local optimum and premature convergence but also has a faster convergence speed and higher calculation accuracy. To verify the effectiveness and feasibility, the WWO algorithm is applied to solve the randomly generated threat areas and generated fixed threat areas. Compared with other algorithms, the WWO algorithm can effectively balance exploration and exploitation to avoid threat areas and reach the intended target with minimum fuel costs. The experimental results demonstrate that the WWO algorithm has better optimization performance and is robust.

Download Full-text