Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm

Abstract An autonomous path-planning strategy based on Skinner operant conditioning principle and reinforcement learning principle is developed in this paper. The core strategies are the use of tendency cell and cognitive learning cell, which simulate bionic orientation and asymptotic learning ability. Cognitive learning cell is designed on the base of Boltzmann machine and improved Q-Learning algorithm, which executes operant action learning function to approximate the operative part of robot system. The tendency cell adjusts network weights by the use of information entropy to evaluate the function of operate action. The results of the simulation experiment in mobile robot showed that the designed autonomous path-planning strategy lets the robot realize autonomous navigation path planning. The robot learns to select autonomously according to the bionic orientate action and have fast convergence rate and higher adaptability.

Download Full-text

Multi-Robot Path Planning Method Using Reinforcement Learning

Applied Sciences ◽

10.3390/app9153057 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3057 ◽

Cited By ~ 9

Author(s):

Hyansu Bae ◽

Gidong Kim ◽

Jonguk Kim ◽

Dianwei Qian ◽

Sukgyu Lee

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Robot Path Planning ◽

Q Learning ◽

Efficient Performance ◽

Planning Algorithm ◽

Neural Network Algorithm ◽

Path Planning Algorithm ◽

Robot Path ◽

Multi Robot

This paper proposes a noble multi-robot path planning algorithm using Deep q learning combined with CNN (Convolution Neural Network) algorithm. In conventional path planning algorithms, robots need to search a comparatively wide area for navigation and move in a predesigned formation under a given environment. Each robot in the multi-robot system is inherently required to navigate independently with collaborating with other robots for efficient performance. In addition, the robot collaboration scheme is highly depends on the conditions of each robot, such as its position and velocity. However, the conventional method does not actively cope with variable situations since each robot has difficulty to recognize the moving robot around it as an obstacle or a cooperative robot. To compensate for these shortcomings, we apply Deep q learning to strengthen the learning algorithm combined with CNN algorithm, which is needed to analyze the situation efficiently. CNN analyzes the exact situation using image information on its environment and the robot navigates based on the situation analyzed through Deep q learning. The simulation results using the proposed algorithm shows the flexible and efficient movement of the robots comparing with conventional methods under various environments.

Download Full-text

Path Planning Collision Avoidance using Reinforcement Learning

10.48011/asba.v2i1.1597 ◽

2020 ◽

Author(s):

Josias G. Batista ◽

Felipe J. S. Vasconcelos ◽

Kaio M. Ramos ◽

Darielson A. Souza ◽

José L. N. Silva

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Production Process ◽

Collision Avoidance ◽

Production Systems ◽

Learning Algorithm ◽

Computational Cost ◽

Trajectory Generation ◽

Industrial Robots ◽

Q Learning

Industrial robots have grown over the years making production systems more and more efficient, requiring the need for efficient trajectory generation algorithms that optimize and, if possible, generate collision-free trajectories without interrupting the production process. In this work is presented the use of Reinforcement Learning (RL), based on the Q-Learning algorithm, in the trajectory generation of a robotic manipulator and also a comparison of its use with and without constraints of the manipulator kinematics, in order to generate collisionfree trajectories. The results of the simulations are presented with respect to the efficiency of the algorithm and its use in trajectory generation, a comparison of the computational cost for the use of constraints is also presented.

Download Full-text

The optimization of path planning for multi-robot system using Boltzmann Policy based Q-learning algorithm

2013 IEEE International Conference on Robotics and Biomimetics (ROBIO) ◽

10.1109/robio.2013.6739627 ◽

2013 ◽

Cited By ~ 2

Author(s):

Zeying Wang ◽

Zhiguo Shi ◽

Yuankai Li ◽

Jun Tu

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Robot System ◽

Q Learning ◽

Multi Robot

Download Full-text

Conditional Q-learning algorithm for path-planning of a mobile robot

2010 International Conference on Industrial Electronics, Control and Robotics ◽

10.1109/iecr.2010.5720165 ◽

2010 ◽

Cited By ~ 3

Author(s):

Indrani Goswami ◽

Pradipta Kumar Das ◽

Amit Konar ◽

R. Janarthanan

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning

Download Full-text

A Modified Q-Learning Algorithm for Robot Path Planning in a Digital Twin Assembly System

10.21203/rs.3.rs-825772/v1 ◽

2021 ◽

Author(s):

Xiaowei Guoa

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Three Dimensional ◽

Virtual Space ◽

Assembly System ◽

Case Based Reasoning ◽

Planning Problem ◽

Q Learning ◽

Digital Twin ◽

Product Assembly

Abstract Product assembly is an important stage in complex product manufacturing. How to intelligently plan the assembly process based on dynamic product and environment information has become an pressing issue needs to be addressed. For this reason, this research has constructed a digital twin assembly system, including virtual and real interactive feedback, data fusion analysis and decision-making iterative optimization modules. In the virtual space, a modified Q-learning algorithm is proposed to solve the path planning problem in product assembly. The proposed algorithm speeds up the convergence speed by adding dynamic reward function, optimizes the initial Q table by introducing knowledge and experience through the case-based reasoning (CBR) algorithm, and prevents entry into the trapped area through the obstacle avoiding method. Finally, take the six-joint robot UR10 as an example to verify the performance of the algorithm in the three-dimensional pathfinding space. The experimental results show that the modified Q-learning algorithm's pathfinding performance is significantly better than the original Q-learning algorithm.

Download Full-text

Path planning based on Q-learning and three-segment method for aircraft fuel tank inspection robot

Filomat ◽

10.2298/fil1805797g ◽

2018 ◽

Vol 32 (5) ◽

pp. 1797-1807 ◽

Cited By ~ 1

Author(s):

Niu Guochen ◽

Xu Kailu

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Initial Point ◽

Fuel Tank ◽

Simulation Experiments ◽

Q Learning ◽

Inspection Robot ◽

Computing Complexity ◽

Aircraft Fuel ◽

Continuum Robot

In order to realize the path planning of continuum robot for inspecting defects in the aircraft fuel tank compartment, an approach based on Q-learning and Three-segment Method was proposed, and the posture of the robot meeting the inherent and spatial structure constraint requirements was planned. Firstly, the simulation model of the aircraft fuel tank was established. Moreover, the workspace was rasterized to decrease the computing complexity. Secondly, the Q-learning algorithm was applied and the path from the initial point to the target was generated. In terms of target guided angle and three-segment method, the joint variables corresponding to each transition point on the path could be obtained. Finally, the robot reached the target by progressively updating the joint variables. Simulation experiments were implemented, and the results verified the effectiveness and feasibility of the algorithm.

Download Full-text

A modified Q-learning algorithm for robot path planning in a digital twin assembly system

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-08597-9 ◽

2022 ◽

Author(s):

Xiaowei Guo ◽

Gongzhuang Peng ◽

Yingying Meng

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Assembly System ◽

Robot Path Planning ◽

Q Learning ◽

Digital Twin ◽

Robot Path

Download Full-text

Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm

A Novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem

An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

Autonomous Path Planning Scheme Research for Mobile Robot

Multi-Robot Path Planning Method Using Reinforcement Learning

Path Planning Collision Avoidance using Reinforcement Learning

The optimization of path planning for multi-robot system using Boltzmann Policy based Q-learning algorithm

Conditional Q-learning algorithm for path-planning of a mobile robot

A Modified Q-Learning Algorithm for Robot Path Planning in a Digital Twin Assembly System

Path planning based on Q-learning and three-segment method for aircraft fuel tank inspection robot

A modified Q-learning algorithm for robot path planning in a digital twin assembly system

Export Citation Format