Reinforcement Learning for Uncooperative Space Objects Smart Imaging Path-Planning

AbstractLeading space agencies are increasingly investing in the gradual automation of space missions. In fact, autonomous flight operations may be a key enabler for on-orbit servicing, assembly and manufacturing (OSAM) missions, carrying inherent benefits such as cost and risk reduction. Within the spectrum of proximity operations, this work focuses on autonomous path-planning for the reconstruction of geometry properties of an uncooperative target. The autonomous navigation problem is called active Simultaneous Localization and Mapping (SLAM) problem, and it has been largely studied within the field of robotics. Active SLAM problem may be formulated as a Partially Observable Markov Decision Process (POMDP). Previous works in astrodynamics have demonstrated that is possible to use Reinforcement Learning (RL) techniques to teach an agent that is moving along a pre-determined orbit when to collect measurements to optimize a given mapping goal. In this work, different RL methods are explored to develop an artificial intelligence agent capable of planning sub-optimal paths for autonomous shape reconstruction of an unknown and uncooperative object via imaging. Proximity orbit dynamics are linearized and include orbit eccentricity. The geometry of the target object is rendered by a polyhedron shaped with a triangular mesh. Artificial intelligent agents are created using both the Deep Q-Network (DQN) and the Advantage Actor Critic (A2C) method. State-action value functions are approximated using Artificial Neural Networks (ANN) and trained according to RL principles. Training of the RL agent architecture occurs under fixed or random initial environment conditions. A large database of training tests has been collected. Trained agents show promising performance in achieving extended coverage of the target. Policy learning is demonstrated by displaying that RL agents, at minimum, have higher mapping performance than agents that behave randomly. Furthermore, RL agent may learn to maneuver the spacecraft to control target lighting conditions as a function of the Sun location. This work, therefore, preliminary demonstrates the applicability of RL to autonomous imaging of an uncooperative space object, thus setting a baseline for future works.

Download Full-text

A Review of Mobile Robot Path Planning Based on Deep Reinforcement Learning Algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/2138/1/012011 ◽

2021 ◽

Vol 2138 (1) ◽

pp. 012011

Author(s):

Yanwei Zhao ◽

Yinong Zhang ◽

Shuying Wang

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Path Planning ◽

Mobile Robot ◽

Video Game ◽

Autonomous Navigation ◽

Learning Algorithm ◽

Basic Knowledge ◽

Target Point ◽

Reinforcement Learning Algorithm

Abstract Path planning refers to that the mobile robot can obtain the surrounding environment information and its own state information through the sensor carried by itself, which can avoid obstacles and move towards the target point. Deep reinforcement learning consists of two parts: reinforcement learning and deep learning, mainly used to deal with perception and decision-making problems, has become an important research branch in the field of artificial intelligence. This paper first introduces the basic knowledge of deep learning and reinforcement learning. Then, the research status of deep reinforcement learning algorithm based on value function and strategy gradient in path planning is described, and the application research of deep reinforcement learning in computer game, video game and autonomous navigation is described. Finally, I made a brief summary and outlook on the algorithms and applications of deep reinforcement learning.

Download Full-text

Autonomous navigation in unknown environment using sliding mode SLAM and genetic algorithm

Intelligence & Robotics ◽

10.20517/ir.2021.09 ◽

2021 ◽

Author(s):

Salvador Ortiz ◽

Wen Yu

Keyword(s):

Genetic Algorithm ◽

Path Planning ◽

Sliding Mode Control ◽

Autonomous Navigation ◽

Sliding Mode ◽

Simultaneous Localization And Mapping ◽

Planning Method ◽

Unknown Environment ◽

Localization And Mapping ◽

Bounded Uncertainties

In this paper, sliding mode control is combined with the classical simultaneous localization and mapping (SLAM) method. This combination can overcome the problem of bounded uncertainties in SLAM. With the help of genetic algorithm, our novel path planning method shows many advantages compared with other popular methods.

Download Full-text

A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning

Applied Soft Computing ◽

10.1016/j.asoc.2021.107605 ◽

2021 ◽

pp. 107605

Author(s):

Shuhuan Wen ◽

Zeteng Wen ◽

Di Zhang ◽

Hong Zhang ◽

Tao Wang

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Transfer Learning ◽

Autonomous Navigation ◽

Robot Path Planning ◽

Planning Algorithm ◽

Path Planning Algorithm ◽

Robot Path ◽

Multi Robot

Download Full-text

Probability Dueling DQN active visual SLAM for autonomous navigation in indoor environment

Industrial Robot the international journal of robotics research and application ◽

10.1108/ir-08-2020-0160 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Shuhuan Wen ◽

Xiaohan Lv ◽

Hak Keung Lam ◽

Shaokang Fan ◽

Xiao Yuan ◽

...

Keyword(s):

Path Planning ◽

Autonomous Navigation ◽

Indoor Environment ◽

Prediction Method ◽

Depth Image ◽

Visual Slam ◽

Content Type ◽

Localization And Mapping ◽

Planning Algorithm ◽

Path Planning Algorithm

Purpose This paper aims to use the Monodepth method to improve the prediction speed of identifying the obstacles and proposes a Probability Dueling DQN algorithm to optimize the path of the agent, which can reach the destination more quickly than the Dueling DQN algorithm. Then the path planning algorithm based on Probability Dueling DQN is combined with FastSLAM to accomplish the autonomous navigation and map the environment. Design/methodology/approach This paper proposes an active simultaneous localization and mapping (SLAM) framework for autonomous navigation under an indoor environment with static and dynamic obstacles. It integrates a path planning algorithm with visual SLAM to decrease navigation uncertainty and build an environment map. Findings The result shows that the proposed method offers good performance over existing Dueling DQN for navigation uncertainty under the indoor environment with different numbers and shapes of the static and dynamic obstacles in the real world field. Originality/value This paper proposes a novel active SLAM framework composed of Probability Dueling DQN that is the improved path planning algorithm based on Dueling DQN and FastSLAM. This framework is used with the Monodepth depth image prediction method with faster prediction speed to realize autonomous navigation in the indoor environment with different numbers and shapes of the static and dynamic obstacles.

Download Full-text

Deep Q Reinforcement Learning for Autonomous Navigation of Surgical Snake Robot in Confined Spaces

10.31256/hsmr2019.18 ◽

2019 ◽

Author(s):

S Athiniotis ◽

◽

R A Srivatsan ◽

H Choset

Keyword(s):

Reinforcement Learning ◽

Autonomous Navigation ◽

Snake Robot ◽

Confined Spaces

Download Full-text

UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9340934 ◽

2020 ◽

Author(s):

Mirco Theile ◽

Harald Bayerlein ◽

Richard Nai ◽

David Gesbert ◽

Marco Caccamo

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Coverage Path Planning ◽

Power Constraints

Download Full-text

UAV online path planning technology based on deep reinforcement learning

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9327752 ◽

2020 ◽

Author(s):

Jiaxuan Fan ◽

Zhenya Wang ◽

Jinlei Ren ◽

Ying Lu ◽

Yiheng Liu

Keyword(s):

Reinforcement Learning ◽

Path Planning

Download Full-text

MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9340876 ◽

2020 ◽

Author(s):

Zuxin Liu ◽

Baiming Chen ◽

Hongyi Zhou ◽

Guru Koushik ◽

Martial Hebert ◽

...

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Dynamic Environments ◽

Multi Agent

Download Full-text

Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics

Complex & Intelligent Systems ◽

10.1007/s40747-021-00366-1 ◽

2021 ◽

Author(s):

Jie Zhong ◽

Tao Wang ◽

Lianglun Cheng

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Free Path ◽

Inverse Kinematics ◽

Multiple Dimensions ◽

Continuous State ◽

Planning Algorithm ◽

Convergence Performance ◽

Path Planner ◽

Action Spaces

AbstractIn actual welding scenarios, an effective path planner is needed to find a collision-free path in the configuration space for the welding manipulator with obstacles around. However, as a state-of-the-art method, the sampling-based planner only satisfies the probability completeness and its computational complexity is sensitive with state dimension. In this paper, we propose a path planner for welding manipulators based on deep reinforcement learning for solving path planning problems in high-dimensional continuous state and action spaces. Compared with the sampling-based method, it is more robust and is less sensitive with state dimension. In detail, to improve the learning efficiency, we introduce the inverse kinematics module to provide prior knowledge while a gain module is also designed to avoid the local optimal policy, we integrate them into the training algorithm. To evaluate our proposed planning algorithm in multiple dimensions, we conducted multiple sets of path planning experiments for welding manipulators. The results show that our method not only improves the convergence performance but also is superior in terms of optimality and robustness of planning compared with most other planning algorithms.

Download Full-text

A Multiagent Deep Reinforcement Learning Approach for Path Planning in Autonomous Surface Vehicles: The Ypacaraí Lake Patrolling Case

IEEE Access ◽

10.1109/access.2021.3053348 ◽

2021 ◽

Vol 9 ◽

pp. 17084-17099

Author(s):

Samuel Yanes Luis ◽

Daniel Gutierrez Reina ◽

Sergio L. Toral Marin

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Learning Approach ◽

Autonomous Surface Vehicles

Download Full-text