Conditional Q-learning algorithm for path-planning of a mobile robot

Abstract An autonomous path-planning strategy based on Skinner operant conditioning principle and reinforcement learning principle is developed in this paper. The core strategies are the use of tendency cell and cognitive learning cell, which simulate bionic orientation and asymptotic learning ability. Cognitive learning cell is designed on the base of Boltzmann machine and improved Q-Learning algorithm, which executes operant action learning function to approximate the operative part of robot system. The tendency cell adjusts network weights by the use of information entropy to evaluate the function of operate action. The results of the simulation experiment in mobile robot showed that the designed autonomous path-planning strategy lets the robot realize autonomous navigation path planning. The robot learns to select autonomously according to the bionic orientate action and have fast convergence rate and higher adaptability.

Download Full-text

Mobile robot path planning based on Q-learning algorithm*

2019 WRC Symposium on Advanced Robotics and Automation (WRC SARA) ◽

10.1109/wrc-sara.2019.8931944 ◽

2019 ◽

Author(s):

Shaochuan Li ◽

Xiuqing Wang ◽

Liwei Hu ◽

Ying Liu

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Robot Path Planning ◽

Q Learning ◽

Robot Path

Download Full-text

Extended Q-Learning Algorithm for Path-Planning of a Mobile Robot

Lecture Notes in Computer Science - Simulated Evolution and Learning ◽

10.1007/978-3-642-17298-4_40 ◽

2010 ◽

pp. 379-383 ◽

Cited By ~ 3

Author(s):

Indrani Goswami ◽

Pradipta Kumar Das ◽

Amit Konar ◽

R. Janarthanan

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning

Download Full-text

Double BP Q-Learning Algorithm for Local Path Planning of Mobile Robot

Journal of Computer and Communications ◽

10.4236/jcc.2021.96008 ◽

2021 ◽

Vol 09 (06) ◽

pp. 138-157

Author(s):

Guoming Liu ◽

Caihong Li ◽

Tengteng Gao ◽

Yongdi Li ◽

Xiaopei He

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning ◽

Local Path Planning ◽

Local Path

Download Full-text

Mobile Robot Path Planning using Q-Learning with Guided Distance

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.27.22480 ◽

2018 ◽

Vol 7 (4.27) ◽

pp. 57

Author(s):

Ee Soong Low ◽

Pauline Ong ◽

Cheng Yee Low

Keyword(s):

Decision Making ◽

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Robot Path Planning ◽

Local Optima ◽

Q Learning ◽

Random Direction ◽

Robot Path

In path planning for mobile robot, classical Q-learning algorithm requires high iteration counts and longer time taken to achieve convergence. This is due to the beginning stage of classical Q-learning for path planning consists of mostly exploration, involving random direction decision making. This paper proposed the addition of distance aspect into direction decision making in Q-learning. This feature is used to reduce the time taken for the Q-learning to fully converge. In the meanwhile, random direction decision making is added and activated when mobile robot gets trapped in local optima. This strategy enables the mobile robot to escape from local optimal trap. The results show that the time taken for the improved Q-learning with distance guiding to converge is longer than the classical Q-learning. However, the total number of steps used is lower than the classical Q-learning.

Download Full-text