Goal-Oriented Navigation with Avoiding Obstacle based on Deep Reinforcement Learning in Continuous Action Space

AbstractThe digital curling game is a two-player zero-sum extensive game in a continuous action space. There are some challenging problems that are still not solved well, such as the uncertainty of strategy, the large game tree searching, and the use of large amounts of supervised data, etc. In this work, we combine NFSP and KR-UCT for digital curling games, where NFSP uses two adversary learning networks and can automatically produce supervised data, and KR-UCT can be used for large game tree searching in continuous action space. We propose two reward mechanisms to make reinforcement learning converge quickly. Experimental results validate the proposed method, and show the strategy model can reach the Nash equilibrium.

Download Full-text

Constructing continuous action space from basis functions for fast and stable reinforcement learning

RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication ◽

10.1109/roman.2009.5326234 ◽

2009 ◽

Cited By ~ 2

Author(s):

Akihiko Yamaguchi ◽

Jun Takamatsu ◽

Tsukasa Ogasawara

Keyword(s):

Reinforcement Learning ◽

Basis Functions ◽

Action Space ◽

Continuous Action

Download Full-text

Soft Action Particle Deep Reinforcement Learning for a Continuous Action Space

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros40897.2019.8967959 ◽

2019 ◽

Author(s):

Minjae Kang ◽

Kyungjae Lee ◽

Songhwai Oh

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous Action

Download Full-text

Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

Lecture Notes in Computer Science - Information Processing in Medical Imaging ◽

10.1007/978-3-030-78191-0_59 ◽

2021 ◽

pp. 767-778

Author(s):

Klemens Kasseroller ◽

Franz Thaler ◽

Christian Payer ◽

Darko Štern

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous Action ◽

Multi Agent ◽

Landmark Localization

Download Full-text

Switching Reinforcement Learning for Continuous Action Space

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.131.976 ◽

2011 ◽

Vol 131 (5) ◽

pp. 976-982

Author(s):

Masato Nagayoshi ◽

Hajime Murao ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous Action

Download Full-text

Switching reinforcement learning for continuous action space

Electronics and Communications in Japan ◽

10.1002/ecj.10383 ◽

2012 ◽

Vol 95 (3) ◽

pp. 37-44

Author(s):

Masato Nagayoshi ◽

Hajime Murao ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous Action

Download Full-text

TD based reinforcement learning using neural networks in control problems with continuous action space

1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227) ◽

10.1109/ijcnn.1998.687171 ◽

2002 ◽

Cited By ~ 1

Author(s):

Jeong-Hoon Lee ◽

Se-Young Oh ◽

Doo-Hyun Choi

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Action Space ◽

Control Problems ◽

Continuous Action

Download Full-text

A reinforcement learning with switching controllers for a continuous action space

Artificial Life and Robotics ◽

10.1007/s10015-010-0772-0 ◽

2010 ◽

Vol 15 (1) ◽

pp. 97-100 ◽

Cited By ~ 4

Author(s):

Masato Nagayoshi ◽

Hajime Murao ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous Action

Download Full-text

Goal-Oriented Obstacle Avoidance with Deep Reinforcement Learning in Continuous Action Space

Electronics ◽

10.3390/electronics9030411 ◽

2020 ◽

Vol 9 (3) ◽

pp. 411

Author(s):

Reinis Cimurs ◽

Jin Han Lee ◽

Il Hong Suh

Keyword(s):

Reinforcement Learning ◽

Obstacle Avoidance ◽

Action Space ◽

Polar Coordinates ◽

Depth Image ◽

Depth Information ◽

Continuous Action ◽

Learning Network ◽

Complex Shapes ◽

Policy Gradient

In this paper, we propose a goal-oriented obstacle avoidance navigation system based on deep reinforcement learning that uses depth information in scenes, as well as goal position in polar coordinates as state inputs. The control signals for robot motion are output in a continuous action space. We devise a deep deterministic policy gradient network with the inclusion of depth-wise separable convolution layers to process the large amounts of sequential depth image information. The goal-oriented obstacle avoidance navigation is performed without prior knowledge of the environment or a map. We show that through the proposed deep reinforcement learning network, a goal-oriented collision avoidance model can be trained end-to-end without manual tuning or supervision by a human operator. We train our model in a simulation, and the resulting network is directly transferred to other environments. Experiments show the capability of the trained network to navigate safely around obstacles and arrive at the designated goal positions in the simulation, as well as in the real world. The proposed method exhibits higher reliability than the compared approaches when navigating around obstacles with complex shapes. The experiments show that the approach is capable of avoiding not only static, but also dynamic obstacles.

Download Full-text

Sparse Actor-Critic: Sparse Tsallis Entropy Regularized Reinforcement Learning in a Continuous Action Space

2020 17th International Conference on Ubiquitous Robots (UR) ◽

10.1109/ur49135.2020.9144780 ◽

2020 ◽

Author(s):

Jaegoo Choy ◽

Kyungjae Lee ◽

Songhwai Oh

Keyword(s):

Reinforcement Learning ◽

Tsallis Entropy ◽

Action Space ◽

Continuous Action

Download Full-text