A Novel Hybrid Path Planning Method Based on Q-Learning and Neural Network for Robot Arm

Ali Abdi; Dibash Adhikari; Ju Hong Park

doi:10.3390/app11156770

A Novel Hybrid Path Planning Method Based on Q-Learning and Neural Network for Robot Arm

Applied Sciences ◽

10.3390/app11156770 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6770

Author(s):

Ali Abdi ◽

Dibash Adhikari ◽

Ju Hong Park

Keyword(s):

Neural Network ◽

Path Planning ◽

Learning Algorithm ◽

Hybrid Approach ◽

Active Phase ◽

Planning Method ◽

Environment Interaction ◽

Robot Arm ◽

Q Learning ◽

Passive Phase

Path planning for robot arms to reach a target and avoid obstacles has had a crucial role in manufacturing automation. Although many path planning algorithms, including RRT, APF, PRM, and RL-based, have been presented, they have many problems: a time-consuming process, high computational costs, slowness, non-optimal paths, irregular paths, failure to find a path, and complexity. Scholars have tried to address some of these issues. However, those methods still suffer from slowness and complexity. In order to address these two limitations, this paper presents a new hybrid path planning method that contains two separate parts: action-finding (active approach) and angle-finding (passive approach). In the active phase, the Q-learning algorithm is used to find a sequence of simple actions, including up, down, left, and right, to reach the target cell in a gridded workspace. In the passive phase, the joints angles of the robot arm, with respect to the found actions, are obtained by the trained neural network. The simulation and test results show that this hybrid approach significantly improves the slowness and complexity due to using the simplified agent-environment interaction in the active phase and simple computing the joints angles in the passive phase.

Download Full-text

A Novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem

2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) ◽

10.1109/trustcom50675.2020.00270 ◽

2020 ◽

Author(s):

Jian Li ◽

Fei Rong ◽

Yu Tang

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Planning Problem ◽

Stochastic Environment ◽

Q Learning ◽

Path Planning Problem

Download Full-text

An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

International Journal of Computer Applications ◽

10.5120/8073-1468 ◽

2012 ◽

Vol 51 (9) ◽

pp. 40-46 ◽

Cited By ~ 3

Author(s):

Pradipta KDas ◽

S. C. Mandhata ◽

H. S. Behera ◽

S. N. Patro

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning

Download Full-text

Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm

2018 37th Chinese Control Conference (CCC) ◽

10.23919/chicc.2018.8482604 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tianze Zhang ◽

Xin Huo ◽

Songlin Chen ◽

Baoqing Yang ◽

Guojiang Zhang

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Q Learning ◽

Quadrotor Uav

Download Full-text

Optimal prosodic feature extraction and classification in parametric excitation source information for Indian language identification using neural network based Q-learning algorithm

International Journal of Speech Technology ◽

10.1007/s10772-018-09582-6 ◽

2018 ◽

Vol 22 (1) ◽

pp. 67-77 ◽

Cited By ~ 2

Author(s):

Himanish Shekhar Das ◽

Pinki Roy

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Parametric Excitation ◽

Learning Algorithm ◽

Language Identification ◽

Excitation Source ◽

Source Information ◽

Indian Language ◽

Q Learning ◽

Prosodic Feature

Download Full-text

Autonomous Path Planning Scheme Research for Mobile Robot

Cybernetics and Information Technologies ◽

10.1515/cait-2016-0072 ◽

2016 ◽

Vol 16 (4) ◽

pp. 113-125

Author(s):

Jianxian Cai ◽

Xiaogang Ruan ◽

Pengxuan Li

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Autonomous Navigation ◽

Learning Algorithm ◽

Action Learning ◽

Cognitive Learning ◽

Learning Ability ◽

Q Learning ◽

Planning Strategy ◽

Navigation Path

Abstract An autonomous path-planning strategy based on Skinner operant conditioning principle and reinforcement learning principle is developed in this paper. The core strategies are the use of tendency cell and cognitive learning cell, which simulate bionic orientation and asymptotic learning ability. Cognitive learning cell is designed on the base of Boltzmann machine and improved Q-Learning algorithm, which executes operant action learning function to approximate the operative part of robot system. The tendency cell adjusts network weights by the use of information entropy to evaluate the function of operate action. The results of the simulation experiment in mobile robot showed that the designed autonomous path-planning strategy lets the robot realize autonomous navigation path planning. The robot learns to select autonomously according to the bionic orientate action and have fast convergence rate and higher adaptability.

Download Full-text

BIOINSPIRED NEURAL NETWORK-BASED Q-LEARNING APPROACH FOR ROBOT PATH PLANNING IN UNKNOWN ENVIRONMENTS

International Journal of Robotics and Automation ◽

10.2316/journal.206.2016.6.206-4526 ◽

2016 ◽

Vol 31 (6) ◽

Cited By ~ 3

Author(s):

Jianjun Ni ◽

Xinyun Li ◽

Mingang Hua ◽

Simon X. Yang

Keyword(s):

Neural Network ◽

Path Planning ◽

Learning Approach ◽

Robot Path Planning ◽

Q Learning ◽

Unknown Environments ◽

Robot Path

Download Full-text

Cognition of a Robotic Manipulator Using the Q-Learning Based Situation-Operator Model

Journal of Information Technology Research ◽

10.4018/jitr.2018010109 ◽

2018 ◽

Vol 11 (1) ◽

pp. 146-157 ◽

Cited By ~ 1

Author(s):

Akash Dutt Dubey ◽

Ravi Bhushan Mishra

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Path Planning ◽

Mobile Robot ◽

Knowledge Base ◽

Operator Model ◽

Q Learning ◽

Sensory Inputs ◽

Learning Module ◽

Som Model

In this article, we have applied cognition on robot using Q-learning based situation operator model. The situation operator model takes the initial situation of the mobile robot and applies a set of operators in order to move the robot to the destination. The initial situation of the mobile robot is defined by a set of characteristics inferred by the sensor inputs. The Situation-Operator Model (SOM) model comprises of a planning and learning module which uses certain heuristics for learning through the mobile robot and a knowledge base which stored the experiences of the mobile robot. The control and learning of the robot is done using q-learning. A camera sensor and an ultrasonic sensor were used as the sensory inputs for the mobile robot. These sensory inputs are used to define the initial situation, which is then used in the learning module to apply the valid operator. The results obtained by the proposed method were compared to the result obtained by Reinforcement-Based Artificial Neural Network for path planning.

Download Full-text

A Path Planning Method of Robot Arm Obstacle Avoidance Based on Dynamic Recursive Ant Colony Algorithm

2019 IEEE International Conference on Power, Intelligent Computing and Systems (ICPICS) ◽

10.1109/icpics47731.2019.8942495 ◽

2019 ◽

Author(s):

Zhao Huadong ◽

Lei Chaofan ◽

Jiang Nan

Keyword(s):

Path Planning ◽

Obstacle Avoidance ◽

Ant Colony Algorithm ◽

Planning Method ◽

Ant Colony ◽

Robot Arm

Download Full-text

Multi-Robot Path Planning Method Using Reinforcement Learning

Applied Sciences ◽

10.3390/app9153057 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3057 ◽

Cited By ~ 9

Author(s):

Hyansu Bae ◽

Gidong Kim ◽

Jonguk Kim ◽

Dianwei Qian ◽

Sukgyu Lee

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Robot Path Planning ◽

Q Learning ◽

Efficient Performance ◽

Planning Algorithm ◽

Neural Network Algorithm ◽

Path Planning Algorithm ◽

Robot Path ◽

Multi Robot

This paper proposes a noble multi-robot path planning algorithm using Deep q learning combined with CNN (Convolution Neural Network) algorithm. In conventional path planning algorithms, robots need to search a comparatively wide area for navigation and move in a predesigned formation under a given environment. Each robot in the multi-robot system is inherently required to navigate independently with collaborating with other robots for efficient performance. In addition, the robot collaboration scheme is highly depends on the conditions of each robot, such as its position and velocity. However, the conventional method does not actively cope with variable situations since each robot has difficulty to recognize the moving robot around it as an obstacle or a cooperative robot. To compensate for these shortcomings, we apply Deep q learning to strengthen the learning algorithm combined with CNN algorithm, which is needed to analyze the situation efficiently. CNN analyzes the exact situation using image information on its environment and the robot navigates based on the situation analyzed through Deep q learning. The simulation results using the proposed algorithm shows the flexible and efficient movement of the robots comparing with conventional methods under various environments.

Download Full-text

Path Planning Collision Avoidance using Reinforcement Learning

10.48011/asba.v2i1.1597 ◽

2020 ◽

Author(s):

Josias G. Batista ◽

Felipe J. S. Vasconcelos ◽

Kaio M. Ramos ◽

Darielson A. Souza ◽

José L. N. Silva

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Production Process ◽

Collision Avoidance ◽

Production Systems ◽

Learning Algorithm ◽

Computational Cost ◽

Trajectory Generation ◽

Industrial Robots ◽

Q Learning

Industrial robots have grown over the years making production systems more and more efficient, requiring the need for efficient trajectory generation algorithms that optimize and, if possible, generate collision-free trajectories without interrupting the production process. In this work is presented the use of Reinforcement Learning (RL), based on the Q-Learning algorithm, in the trajectory generation of a robotic manipulator and also a comparison of its use with and without constraints of the manipulator kinematics, in order to generate collisionfree trajectories. The results of the simulations are presented with respect to the efficiency of the algorithm and its use in trajectory generation, a comparison of the computational cost for the use of constraints is also presented.

Download Full-text