A Novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem

A Modified Q-Learning Algorithm for Robot Path Planning in a Digital Twin Assembly System

10.21203/rs.3.rs-825772/v1 ◽

2021 ◽

Author(s):

Xiaowei Guoa

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Three Dimensional ◽

Virtual Space ◽

Assembly System ◽

Case Based Reasoning ◽

Planning Problem ◽

Q Learning ◽

Digital Twin ◽

Product Assembly

Abstract Product assembly is an important stage in complex product manufacturing. How to intelligently plan the assembly process based on dynamic product and environment information has become an pressing issue needs to be addressed. For this reason, this research has constructed a digital twin assembly system, including virtual and real interactive feedback, data fusion analysis and decision-making iterative optimization modules. In the virtual space, a modified Q-learning algorithm is proposed to solve the path planning problem in product assembly. The proposed algorithm speeds up the convergence speed by adding dynamic reward function, optimizes the initial Q table by introducing knowledge and experience through the case-based reasoning (CBR) algorithm, and prevents entry into the trapped area through the obstacle avoiding method. Finally, take the six-joint robot UR10 as an example to verify the performance of the algorithm in the three-dimensional pathfinding space. The experimental results show that the modified Q-learning algorithm's pathfinding performance is significantly better than the original Q-learning algorithm.

Download Full-text

Constrained Path Planning for Unmanned Aerial Vehicle in 3D Terrain Using Modified Multi-Objective Particle Swarm Optimization

Actuators ◽

10.3390/act10100255 ◽

2021 ◽

Vol 10 (10) ◽

pp. 255

Author(s):

Shuang Xia ◽

Xiangyin Zhang

Keyword(s):

Particle Swarm Optimization ◽

Path Planning ◽

Unmanned Aerial Vehicle ◽

Gaussian Distribution ◽

Planning Problem ◽

Swarm Optimization ◽

Q Learning ◽

Multi Objective ◽

Aerial Vehicle ◽

Path Planning Problem

This paper considered the constrained unmanned aerial vehicle (UAV) path planning problem as the multi-objective optimization problem, in which both costs and constraints are treated as the objective functions. A novel multi-objective particle swarm optimization algorithm based on the Gaussian distribution and the Q-Learning technique (GMOPSO-QL) is proposed and applied to determine the feasible and optimal path for UAV. In GMOPSO-QL, the Gaussian distribution based updating operator is adopted to generate new particles, and the exploration and exploitation modes are introduced to enhance population diversity and convergence speed, respectively. Moreover, the Q-Learning based mode selection logic is introduced to balance the global search with the local search in the evolution process. Simulation results indicate that our proposed GMOPSO-QL can deal with the constrained UAV path planning problem and is superior to existing optimization algorithms in terms of efficiency and robustness.

Download Full-text

Autonomous Vehicle Path Planning using Q-Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/2128/1/012018 ◽

2021 ◽

Vol 2128 (1) ◽

pp. 012018

Author(s):

Mohammed M S Ibrahim ◽

Mostafa Rostom Atia ◽

MW Fakhr

Keyword(s):

Path Planning ◽

Autonomous Vehicle ◽

Optimal Path ◽

Lookup Table ◽

Planning Problem ◽

Training Time ◽

Q Learning ◽

Starting Position ◽

Planning Algorithm ◽

Path Planning Problem

Abstract Path planning is vital in autonomous vehicle technology, from robots to self-driving cars and driverless trucks, it is impossible to navigate without a proper path planning algorithm, various algorithms exist Q-learning being one of them. Q-learning is used extensively in discrete applications as it is effective in finding solutions to these problems. This research investigates the possibility of using Q-learning for solving the local path planning problem with obstacle avoidance. Q-learning is split into two phases, the first being the training phase, and the second being the application phase. During training, Q-learning requires exponentially increasing training time based on the system’s state space. However, when Q-learning is applied it becomes as simple as a lookup table which allows it to run on even the simplest microcontrollers. Two simulations are conducted with varying environments. One to showcase the ability to learn the optimal path, the other to showcase the ability for learning navigation in variable environments. The first simulation was run on a static environment with one obstacle, with enough training episodes, Q-learning could solve the path planning problem with minimal movement steps. The second simulation focuses on a randomized environment, obstacles and the agent’s starting position are randomly chosen at the start of every episode. During testing, Q-learning was able to find a path to the target when a path did exist, as It was possible in certain configurations for the vehicle to be stuck in between obstacles with no feasible path or solution.

Download Full-text

An Optimized Artificial Bee Colony Algorithm for the Shortest Path Planning Problem

CICTP 2018 ◽

10.1061/9780784481523.262 ◽

2018 ◽

Author(s):

Jian Zheng ◽

Zhen Zhang

Keyword(s):

Path Planning ◽

Shortest Path ◽

Artificial Bee Colony Algorithm ◽

Artificial Bee Colony ◽

Planning Problem ◽

Bee Colony ◽

Path Planning Problem

Download Full-text

Reliability oriented multi-AGVs online scheduling and path planning problem of automated sorting warehouse system

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1043/2/022035 ◽

2021 ◽

Vol 1043 (2) ◽

pp. 022035

Author(s):

N N Yu ◽

T K Li ◽

B L Wang ◽

S P Yuan ◽

Y Wang

Keyword(s):

Path Planning ◽

Online Scheduling ◽

Planning Problem ◽

Path Planning Problem

Download Full-text

An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

International Journal of Computer Applications ◽

10.5120/8073-1468 ◽

2012 ◽

Vol 51 (9) ◽

pp. 40-46 ◽

Cited By ~ 3

Author(s):

Pradipta KDas ◽

S. C. Mandhata ◽

H. S. Behera ◽

S. N. Patro

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning

Download Full-text

Solving the multi-objective path planning problem in mobile robotics with a firefly-based approach

Soft Computing ◽

10.1007/s00500-015-1825-z ◽

2015 ◽

Vol 21 (4) ◽

pp. 949-964 ◽

Cited By ~ 30

Author(s):

Alejandro Hidalgo-Paniagua ◽

Miguel A. Vega-Rodríguez ◽

Joaquín Ferruz ◽

Nieves Pavón

Keyword(s):

Path Planning ◽

Mobile Robotics ◽

Planning Problem ◽

Multi Objective ◽

Path Planning Problem

Download Full-text

Collision-Free Motion Planning for an Aligned Multiple-turret System Operating in Extreme Environment

Robotica ◽

10.1017/s026357472100076x ◽

2021 ◽

pp. 1-30

Author(s):

Ümit Yerlikaya ◽

R.Tuna Balkan

Keyword(s):

Path Planning ◽

Extreme Environment ◽

Configuration Spaces ◽

Planning Problem ◽

Different Types ◽

Planning Algorithm ◽

Line Path ◽

Path Planning Algorithm ◽

Path Planning Problem ◽

System Operating

Abstract Instead of using the tedious process of manual positioning, an off-line path planning algorithm has been developed for military turrets to improve their accuracy and efficiency. In the scope of this research, an algorithm is proposed to search a path in three different types of configuration spaces which are rectangular-, circular-, and torus-shaped by providing three converging options named as fast, medium, and optimum depending on the application. With the help of the proposed algorithm, 4-dimensional (D) path planning problem was realized as 2-D + 2-D by using six sequences and their options. The results obtained were simulated and no collision was observed between any bodies in these three options.

Download Full-text

Path Planning for Spheres in Three Dimensional Environments With Low Interference Index

20th Design Automation Conference: Volume 1 — Dynamic Mechanical Systems; Geometric Modeling and Features; Concurrent Engineering ◽

10.1115/detc1994-0041 ◽

1994 ◽

Author(s):

Duane W. Storti ◽

Debasish Dutta

Keyword(s):

Path Planning ◽

Free Path ◽

Configuration Space ◽

Local Knowledge ◽

Three Dimensional ◽

Target Point ◽

Planning Problem ◽

Spherical Object ◽

Starting Point ◽

Path Planning Problem

Abstract We consider the path planning problem for a spherical object moving through a three-dimensional environment composed of spherical obstacles. Given a starting point and a terminal or target point, we wish to determine a collision free path from start to target for the moving sphere. We define an interference index to count the number of configuration space obstacles whose surfaces interfere simultaneously. In this paper, we present algorithms for navigating the sphere when the interference index is ≤ 2. While a global calculation is necessary to characterize the environment as a whole, only local knowledge is needed for path construction.

Download Full-text

Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm

2018 37th Chinese Control Conference (CCC) ◽

10.23919/chicc.2018.8482604 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tianze Zhang ◽

Xin Huo ◽

Songlin Chen ◽

Baoqing Yang ◽

Guojiang Zhang

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Q Learning ◽

Quadrotor Uav

Download Full-text