A Novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem

Author(s):  
Jian Li ◽  
Fei Rong ◽  
Yu Tang
2021 ◽  
Author(s):  
Xiaowei Guoa

Abstract Product assembly is an important stage in complex product manufacturing. How to intelligently plan the assembly process based on dynamic product and environment information has become an pressing issue needs to be addressed. For this reason, this research has constructed a digital twin assembly system, including virtual and real interactive feedback, data fusion analysis and decision-making iterative optimization modules. In the virtual space, a modified Q-learning algorithm is proposed to solve the path planning problem in product assembly. The proposed algorithm speeds up the convergence speed by adding dynamic reward function, optimizes the initial Q table by introducing knowledge and experience through the case-based reasoning (CBR) algorithm, and prevents entry into the trapped area through the obstacle avoiding method. Finally, take the six-joint robot UR10 as an example to verify the performance of the algorithm in the three-dimensional pathfinding space. The experimental results show that the modified Q-learning algorithm's pathfinding performance is significantly better than the original Q-learning algorithm.


Actuators ◽  
2021 ◽  
Vol 10 (10) ◽  
pp. 255
Author(s):  
Shuang Xia ◽  
Xiangyin Zhang

This paper considered the constrained unmanned aerial vehicle (UAV) path planning problem as the multi-objective optimization problem, in which both costs and constraints are treated as the objective functions. A novel multi-objective particle swarm optimization algorithm based on the Gaussian distribution and the Q-Learning technique (GMOPSO-QL) is proposed and applied to determine the feasible and optimal path for UAV. In GMOPSO-QL, the Gaussian distribution based updating operator is adopted to generate new particles, and the exploration and exploitation modes are introduced to enhance population diversity and convergence speed, respectively. Moreover, the Q-Learning based mode selection logic is introduced to balance the global search with the local search in the evolution process. Simulation results indicate that our proposed GMOPSO-QL can deal with the constrained UAV path planning problem and is superior to existing optimization algorithms in terms of efficiency and robustness.


2021 ◽  
Vol 2128 (1) ◽  
pp. 012018
Author(s):  
Mohammed M S Ibrahim ◽  
Mostafa Rostom Atia ◽  
MW Fakhr

Abstract Path planning is vital in autonomous vehicle technology, from robots to self-driving cars and driverless trucks, it is impossible to navigate without a proper path planning algorithm, various algorithms exist Q-learning being one of them. Q-learning is used extensively in discrete applications as it is effective in finding solutions to these problems. This research investigates the possibility of using Q-learning for solving the local path planning problem with obstacle avoidance. Q-learning is split into two phases, the first being the training phase, and the second being the application phase. During training, Q-learning requires exponentially increasing training time based on the system’s state space. However, when Q-learning is applied it becomes as simple as a lookup table which allows it to run on even the simplest microcontrollers. Two simulations are conducted with varying environments. One to showcase the ability to learn the optimal path, the other to showcase the ability for learning navigation in variable environments. The first simulation was run on a static environment with one obstacle, with enough training episodes, Q-learning could solve the path planning problem with minimal movement steps. The second simulation focuses on a randomized environment, obstacles and the agent’s starting position are randomly chosen at the start of every episode. During testing, Q-learning was able to find a path to the target when a path did exist, as It was possible in certain configurations for the vehicle to be stuck in between obstacles with no feasible path or solution.


2012 ◽  
Vol 51 (9) ◽  
pp. 40-46 ◽  
Author(s):  
Pradipta KDas ◽  
S. C. Mandhata ◽  
H. S. Behera ◽  
S. N. Patro

2015 ◽  
Vol 21 (4) ◽  
pp. 949-964 ◽  
Author(s):  
Alejandro Hidalgo-Paniagua ◽  
Miguel A. Vega-Rodríguez ◽  
Joaquín Ferruz ◽  
Nieves Pavón

Robotica ◽  
2021 ◽  
pp. 1-30
Author(s):  
Ümit Yerlikaya ◽  
R.Tuna Balkan

Abstract Instead of using the tedious process of manual positioning, an off-line path planning algorithm has been developed for military turrets to improve their accuracy and efficiency. In the scope of this research, an algorithm is proposed to search a path in three different types of configuration spaces which are rectangular-, circular-, and torus-shaped by providing three converging options named as fast, medium, and optimum depending on the application. With the help of the proposed algorithm, 4-dimensional (D) path planning problem was realized as 2-D + 2-D by using six sequences and their options. The results obtained were simulated and no collision was observed between any bodies in these three options.


Author(s):  
Duane W. Storti ◽  
Debasish Dutta

Abstract We consider the path planning problem for a spherical object moving through a three-dimensional environment composed of spherical obstacles. Given a starting point and a terminal or target point, we wish to determine a collision free path from start to target for the moving sphere. We define an interference index to count the number of configuration space obstacles whose surfaces interfere simultaneously. In this paper, we present algorithms for navigating the sphere when the interference index is ≤ 2. While a global calculation is necessary to characterize the environment as a whole, only local knowledge is needed for path construction.


Author(s):  
Tianze Zhang ◽  
Xin Huo ◽  
Songlin Chen ◽  
Baoqing Yang ◽  
Guojiang Zhang

Sign in / Sign up

Export Citation Format

Share Document