Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot

One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem. The tiling robot’s goal enables the complete coverage of the entire area by reconfiguring to different shapes according to the area’s needs. In the particular sequencing of navigation, it is essential to have a structure that allows the robot to extend the coverage range while saving energy usage during navigation. This implies that the robot is able to cover larger areas entirely with the least required actions. This paper presents a complete path planning (CPP) for hTetran, a polyabolo tiled robot, based on a TSP-based reinforcement learning optimization. This structure simultaneously produces robot shapes and sequential trajectories whilst maximizing the reward of the trained reinforcement learning (RL) model within the predefined polyabolo-based tileset. To this end, a reinforcement learning-based travel sales problem (TSP) with proximal policy optimization (PPO) algorithm was trained using the complementary learning computation of the TSP sequencing. The reconstructive results of the proposed RL-TSP-based CPP for hTetran were compared in terms of energy and time spent with the conventional tiled hypothetical models that incorporate TSP solved through an evolutionary based ant colony optimization (ACO) approach. The CPP demonstrates an ability to generate an ideal Pareto optima trajectory that enhances the robot’s navigation inside the real environment with the least energy and time spent in the company of conventional techniques.

Download Full-text

Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot

Sensors ◽

10.3390/s21041067 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1067 ◽

Cited By ~ 2

Author(s):

Koppaka Ganesh Sai Apuroop ◽

Anh Vu Le ◽

Mohan Rajesh Elara ◽

Bing J. Sheu

Keyword(s):

Reinforcement Learning ◽

Short Term Memory ◽

Learning Algorithm ◽

Area Coverage ◽

Current Paper ◽

Coverage Problem ◽

Entire Area ◽

Maximum Area ◽

Coverage Path Planning ◽

Cleaning Robot

One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots’ objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time.

Download Full-text

UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9340934 ◽

2020 ◽

Author(s):

Mirco Theile ◽

Harald Bayerlein ◽

Richard Nai ◽

David Gesbert ◽

Marco Caccamo

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Coverage Path Planning ◽

Power Constraints

Download Full-text

Coverage Path Planning with Proximal Policy Optimization in a Grid-based Environment

2020 59th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc42340.2020.9304030 ◽

2020 ◽

Author(s):

Aleksandr Ianenko ◽

Alexander Artamonov ◽

Georgii Sarapulov ◽

Alexey Safaraleev ◽

Sergey Bogomolov ◽

...

Keyword(s):

Path Planning ◽

Coverage Path Planning ◽

Policy Optimization ◽

Grid Based

Download Full-text

Complete coverage path planning and guidance for cleaning robots

ISIE '97 Proceeding of the IEEE International Symposium on Industrial Electronics ◽

10.1109/isie.1997.649051 ◽

2002 ◽

Cited By ~ 57

Author(s):

R.N. De Carvalho ◽

H.A. Vidal ◽

P. Vieira ◽

M.I. Ribeiro

Keyword(s):

Path Planning ◽

Complete Coverage ◽

Coverage Path Planning ◽

Cleaning Robots

Download Full-text

Coverage path planning for maritime search and rescue using reinforcement learning

Ocean Engineering ◽

10.1016/j.oceaneng.2021.110098 ◽

2021 ◽

Vol 241 ◽

pp. 110098

Author(s):

Bo Ai ◽

Maoxin Jia ◽

Hanwen Xu ◽

Jiangling Xu ◽

Zhen Wen ◽

...

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Search And Rescue ◽

Coverage Path Planning ◽

Maritime Search And Rescue

Download Full-text

Combined coverage path planning for autonomous cleaning robots in unstructured environments

2008 7th World Congress on Intelligent Control and Automation ◽

10.1109/wcica.2008.4594223 ◽

2008 ◽

Cited By ~ 8

Author(s):

Yu Liu ◽

Xiaoyong Lin ◽

Shiqiang Zhu

Keyword(s):

Path Planning ◽

Coverage Path Planning ◽

Unstructured Environments ◽

Cleaning Robots

Download Full-text

Proposed Smooth-STC Algorithm for Enhanced Coverage Path Planning Performance in Mobile Robot Applications

Robotics ◽

10.3390/robotics8020044 ◽

2019 ◽

Vol 8 (2) ◽

pp. 44 ◽

Cited By ~ 4

Author(s):

Hai Van Pham ◽

Philip Moore ◽

Dinh Xuan Truong

Keyword(s):

Path Planning ◽

Optimal Path ◽

Forest Monitoring ◽

Planning Problem ◽

Operational Environment ◽

Uncertain Environments ◽

Coverage Path Planning ◽

Agricultural Robots ◽

Cleaning Robots ◽

Path Planning Problem

Robotic path planning is a field of research which is gaining traction given the broad domains of interest to which path planning is an important systemic requirement. The aim of path planning is to optimise the efficacy of robotic movement in a defined operational environment. For example, robots have been employed in many domains including: Cleaning robots (such as vacuum cleaners), automated paint spraying robots, window cleaning robots, forest monitoring robots, and agricultural robots (often driven using satellite and geostationary positional satellite data). Additionally, mobile robotic systems have been utilised in disaster areas and locations hazardous to humans (such as war zones in mine clearance). The coverage path planning problem describes an approach which is designed to determine the path that traverses all points in a defined operational environment while avoiding static and dynamic (moving) obstacles. In this paper we present our proposed Smooth-STC model, the aim of the model being to identify an optimal path, avoid all obstacles, prevent (or at least minimise) backtracking, and maximise the coverage in any defined operational environment. The experimental results in a simulation show that, in uncertain environments, our proposed smooth STC method achieves an almost absolute coverage rate and demonstrates improvement when measured against alternative conventional algorithms.

Download Full-text