The optimization of path planning for multi-robot system using Boltzmann Policy based Q-learning algorithm

This paper proposes a noble multi-robot path planning algorithm using Deep q learning combined with CNN (Convolution Neural Network) algorithm. In conventional path planning algorithms, robots need to search a comparatively wide area for navigation and move in a predesigned formation under a given environment. Each robot in the multi-robot system is inherently required to navigate independently with collaborating with other robots for efficient performance. In addition, the robot collaboration scheme is highly depends on the conditions of each robot, such as its position and velocity. However, the conventional method does not actively cope with variable situations since each robot has difficulty to recognize the moving robot around it as an obstacle or a cooperative robot. To compensate for these shortcomings, we apply Deep q learning to strengthen the learning algorithm combined with CNN algorithm, which is needed to analyze the situation efficiently. CNN analyzes the exact situation using image information on its environment and the robot navigates based on the situation analyzed through Deep q learning. The simulation results using the proposed algorithm shows the flexible and efficient movement of the robots comparing with conventional methods under various environments.

Download Full-text

Cloud-Based Multi-Robot Path Planning in Complex and Crowded Environment Using Fuzzy Logic and Online Learning

Information Technology And Control ◽

10.5755/j01.itc.50.2.28234 ◽

2021 ◽

Vol 50 (2) ◽

pp. 357-374

Author(s):

Novak Zagradjanin ◽

Aleksandar Rodic ◽

Dragan Pamucar ◽

Bojan Pavkovic

Keyword(s):

Path Planning ◽

High Efficiency ◽

Fuzzy Inference ◽

Learning Algorithm ◽

Dynamic Environment ◽

Cloud Services ◽

Robot System ◽

Inference System ◽

Crowded Environment ◽

Multi Robot

This paper considers an autonomous cloud-based multi-robot system designed to execute highly repetitive tasksin a dynamic environment such as a modern megastore. Cloud level is intended for performing the most demandingoperations in order to unload the robots that are users of cloud services in this architecture. For path planningon global level D* Lite algorithm is applied, bearing in mind its high efficiency in dynamic environments. In orderto introduce smart cost map for further improvement of path planning in complex and crowded environment, implementationof fuzzy inference system and learning algorithm is proposed. The results indicate the possibility ofapplying a similar concept in different real-world robotics applications, in order to reduce the total paths length,as well as to minimize the risk in path planning related to the human-robot interactions.

Download Full-text

A Novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem

2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) ◽

10.1109/trustcom50675.2020.00270 ◽

2020 ◽

Author(s):

Jian Li ◽

Fei Rong ◽

Yu Tang

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Planning Problem ◽

Stochastic Environment ◽

Q Learning ◽

Path Planning Problem

Download Full-text

Research Progress on Synergistic Technologies of Agricultural Multi-Robots

Applied Sciences ◽

10.3390/app11041448 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1448

Author(s):

Wenju Mao ◽

Zhijie Liu ◽

Heng Liu ◽

Fuzeng Yang ◽

Meirong Wang

Keyword(s):

Path Planning ◽

Formation Control ◽

Research Progress ◽

Hybrid Architecture ◽

Labor Costs ◽

Robot System ◽

System Architectures ◽

Research Results ◽

Robot Systems ◽

Multi Robot

Multi-robots have shown good application prospects in agricultural production. Studying the synergistic technologies of agricultural multi-robots can not only improve the efficiency of the overall robot system and meet the needs of precision farming but also solve the problems of decreasing effective labor supply and increasing labor costs in agriculture. Therefore, starting from the point of view of an agricultural multiple robot system architectures, this paper reviews the representative research results of five synergistic technologies of agricultural multi-robots in recent years, namely, environment perception, task allocation, path planning, formation control, and communication, and summarizes the technological progress and development characteristics of these five technologies. Finally, because of these development characteristics, it is shown that the trends and research focus for agricultural multi-robots are to optimize the existing technologies and apply them to a variety of agricultural multi-robots, such as building a hybrid architecture of multi-robot systems, SLAM (simultaneous localization and mapping), cooperation learning of robots, hybrid path planning and formation reconstruction. While synergistic technologies of agricultural multi-robots are extremely challenging in production, in combination with previous research results for real agricultural multi-robots and social development demand, we conclude that it is realistic to expect automated multi-robot systems in the future.

Download Full-text

An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

International Journal of Computer Applications ◽

10.5120/8073-1468 ◽

2012 ◽

Vol 51 (9) ◽

pp. 40-46 ◽

Cited By ~ 3

Author(s):

Pradipta KDas ◽

S. C. Mandhata ◽

H. S. Behera ◽

S. N. Patro

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Learning Algorithm ◽

Q Learning

Download Full-text

Dynamic correlation matrix based multi-Q learning for a multi-robot system

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2008.4651021 ◽

2008 ◽

Cited By ~ 1

Author(s):

Hongliang Guo ◽

Yan Meng

Keyword(s):

Correlation Matrix ◽

Robot System ◽

Q Learning ◽

Dynamic Correlation ◽

Multi Robot

Download Full-text

Hybrid Path Planning of A Quadrotor UAV Based on Q-Learning Algorithm

2018 37th Chinese Control Conference (CCC) ◽

10.23919/chicc.2018.8482604 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tianze Zhang ◽

Xin Huo ◽

Songlin Chen ◽

Baoqing Yang ◽

Guojiang Zhang

Keyword(s):

Path Planning ◽

Learning Algorithm ◽

Q Learning ◽

Quadrotor Uav

Download Full-text

Autonomous Path Planning Scheme Research for Mobile Robot

Cybernetics and Information Technologies ◽

10.1515/cait-2016-0072 ◽

2016 ◽

Vol 16 (4) ◽

pp. 113-125

Author(s):

Jianxian Cai ◽

Xiaogang Ruan ◽

Pengxuan Li

Keyword(s):

Path Planning ◽

Mobile Robot ◽

Autonomous Navigation ◽

Learning Algorithm ◽

Action Learning ◽

Cognitive Learning ◽

Learning Ability ◽

Q Learning ◽

Planning Strategy ◽

Navigation Path

Abstract An autonomous path-planning strategy based on Skinner operant conditioning principle and reinforcement learning principle is developed in this paper. The core strategies are the use of tendency cell and cognitive learning cell, which simulate bionic orientation and asymptotic learning ability. Cognitive learning cell is designed on the base of Boltzmann machine and improved Q-Learning algorithm, which executes operant action learning function to approximate the operative part of robot system. The tendency cell adjusts network weights by the use of information entropy to evaluate the function of operate action. The results of the simulation experiment in mobile robot showed that the designed autonomous path-planning strategy lets the robot realize autonomous navigation path planning. The robot learns to select autonomously according to the bionic orientate action and have fast convergence rate and higher adaptability.

Download Full-text

Path Planning for a Multi-robot System with Decentralized Control Architecture

New Trends in Robot Control - Studies in Systems, Decision and Control ◽

10.1007/978-981-15-1819-5_12 ◽

2020 ◽

pp. 229-259 ◽

Cited By ~ 2

Author(s):

Fethi Metoui ◽

Boumedyen Boussaid ◽

Mohamed Naceur Abdelkrim

Keyword(s):

Path Planning ◽

Decentralized Control ◽

Control Architecture ◽

Robot System ◽

Multi Robot

Download Full-text

Multi-Robot Q-Learning over Community Perception Network with Homogeneous Delays

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.823.321 ◽

2013 ◽

Vol 823 ◽

pp. 321-325

Author(s):

Lu Jin ◽

Yue Quan Yang ◽

Chun Bo Ni ◽

Zhi Qiang Cao ◽

Yi Fei Kong

Keyword(s):

Robot Learning ◽

Learning Method ◽

Community Perception ◽

Q Value ◽

Robot System ◽

Q Learning ◽

Information Interaction ◽

Community Information ◽

Information Sharing Mechanism ◽

Multi Robot

With the more robots, the information interaction of multi-robot system becomes more sophisticated and important in a community perception network environment. By exploiting and fusing the learning information of robots in a perception community, the community information sharing mechanism is proposed, as well as updating rules of the community Q-value table. Moreover, considering the existence of delays of learning information transmission, an improved Q-learning method based on homogeneous delays is presented to improve the robot learning efficiency over the community perception network. Finally, the test experiments demonstrate the effectiveness of the proposed scheme.

Download Full-text