Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

Ogbonnaya Anicho; Philip B. Charlesworth; Gurvinder S. Baicher; Atulya K. Nagar

doi:10.1007/s42452-021-04658-6

Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

SN Applied Sciences ◽

10.1007/s42452-021-04658-6 ◽

2021 ◽

Vol 3 (6) ◽

Author(s):

Ogbonnaya Anicho ◽

Philip B. Charlesworth ◽

Gurvinder S. Baicher ◽

Atulya K. Nagar

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Swarm Intelligence ◽

Performance Indicators ◽

Convergence Rates ◽

Tuning Parameters ◽

Continuous State Space ◽

Continuous State ◽

User Coverage ◽

Better Than

AbstractThis work analyses the performance of Reinforcement Learning (RL) versus Swarm Intelligence (SI) for coordinating multiple unmanned High Altitude Platform Stations (HAPS) for communications area coverage. It builds upon previous work which looked at various elements of both algorithms. The main aim of this paper is to address the continuous state-space challenge within this work by using partitioning to manage the high dimensionality problem. This enabled comparing the performance of the classical cases of both RL and SI establishing a baseline for future comparisons of improved versions. From previous work, SI was observed to perform better across various key performance indicators. However, after tuning parameters and empirically choosing suitable partitioning ratio for the RL state space, it was observed that the SI algorithm still maintained superior coordination capability by achieving higher mean overall user coverage (about 20% better than the RL algorithm), in addition to faster convergence rates. Though the RL technique showed better average peak user coverage, the unpredictable coverage dip was a key weakness, making SI a more suitable algorithm within the context of this work.

Download Full-text

Hyperspace Neighbor Penetration Approach to Dynamic Programming for Model-Based Reinforcement Learning Problems with Slowly Changing Variables in a Continuous State Space

10.1109/iccma53594.2021.00018 ◽

2021 ◽

Author(s):

Vincent Zha ◽

Ivey Chiu

Keyword(s):

Dynamic Programming ◽

Reinforcement Learning ◽

State Space ◽

Learning Problems ◽

Model Based ◽

Continuous State Space ◽

Continuous State

Download Full-text

A State Space Filter for Reinforcement Learning in POMIDPs - Application to a Continuous State Space

2006 SICE-ICASE International Joint Conference ◽

10.1109/sice.2006.315203 ◽

2006 ◽

Cited By ~ 2

Author(s):

Masato Nagayoshi ◽

Hajimne Murao ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Continuous State Space ◽

Continuous State ◽

Space Filter

Download Full-text

Application of reinforcement learning with continuous state space to ramp metering in real-world conditions

2012 15th International IEEE Conference on Intelligent Transportation Systems ◽

10.1109/itsc.2012.6338837 ◽

2012 ◽

Cited By ~ 8

Author(s):

Kasra Rezaee ◽

Baher Abdulhai ◽

Hossam Abdelgawad

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Real World ◽

Ramp Metering ◽

Continuous State Space ◽

Continuous State

Download Full-text

Analysis of Reward Functions in Deep Reinforcement Learning for Continuous State Space Control

Journal of KIISE ◽

10.5626/jok.2020.47.1.78 ◽

2020 ◽

Vol 47 (1) ◽

pp. 78-87

Author(s):

MinKu Kang ◽

Kee-Eung Kim

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Continuous State Space ◽

Continuous State ◽

Reward Functions

Download Full-text

Behavior Acquisition on a Mobile Robot Using Reinforcement Learning With Continuous State Space

2019 International Conference on Machine Learning and Cybernetics (ICMLC) ◽

10.1109/icmlc48188.2019.8949181 ◽

2019 ◽

Author(s):

Tomoyuki Arai ◽

Yuichiro Toda ◽

Naoyuki Kubota

Keyword(s):

Reinforcement Learning ◽

Mobile Robot ◽

State Space ◽

Continuous State Space ◽

Continuous State

Download Full-text

A Self-Organized Fuzzy-Neuro Reinforcement Learning System for Continuous State Space for Autonomous Robots

2008 International Conference on Computational Intelligence for Modelling Control & Automation ◽

10.1109/cimca.2008.25 ◽

2008 ◽

Cited By ~ 7

Author(s):

Masanao Obayashi ◽

Takashi Kuremoto ◽

Kunikazu Kobayashi

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Autonomous Robots ◽

Learning System ◽

Self Organized ◽

Continuous State Space ◽

Continuous State

Download Full-text

Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning

Proceedings of the 12th International Conference on Agents and Artificial Intelligence ◽

10.5220/0008971502260233 ◽

2020 ◽

Author(s):

Gurpreet Singh ◽

Daniel Lofaro ◽

Donald Sofge

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Action Space ◽

Pursuit Evasion ◽

Continuous State Space ◽

Continuous State ◽

Robotic Swarm

Download Full-text

A Study of Continuous Maximum Entropy Deep Inverse Reinforcement Learning

Mathematical Problems in Engineering ◽

10.1155/2019/4834516 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8

Author(s):

Xi-liang Chen ◽

Lei Cao ◽

Zhi-xiong Xu ◽

Jun Lai ◽

Chen-xi Li

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Maximum Entropy ◽

Learning Algorithm ◽

Action Space ◽

Inverse Reinforcement Learning ◽

Reward Function ◽

Continuous State Space ◽

Hot Start ◽

Continuous State

The assumption of IRL is that demonstrations are optimally acting in an environment. In the past, most of the work on IRL needed to calculate optimal policies for different reward functions. However, this requirement is difficult to satisfy in large or continuous state space tasks. Let alone continuous action space. We propose a continuous maximum entropy deep inverse reinforcement learning algorithm for continuous state space and continues action space, which realizes the depth cognition of the environment model by the way of reconstructing the reward function based on the demonstrations, and a hot start mechanism based on demonstrations to make the training process faster and better. We compare this new approach to well-known IRL algorithms using Maximum Entropy IRL, DDPG, hot start DDPG, etc. Empirical results on classical control environments on OpenAI Gym: MountainCarContinues-v0 show that our approach is able to learn policies faster and better.

Download Full-text

Multi-Agent Reinforcement Learning Using Linear Fuzzy Model Applied to Cooperative Mobile Robots

Symmetry ◽

10.3390/sym10100461 ◽

2018 ◽

Vol 10 (10) ◽

pp. 461 ◽

Cited By ~ 7

Author(s):

David Luviano-Cruz ◽

Francesco Garcia-Luna ◽

Luis Pérez-Domínguez ◽

S. Gadi

Keyword(s):

Reinforcement Learning ◽

Mobile Robots ◽

State Space ◽

Fuzzy Model ◽

Discrete State ◽

Continuous State Space ◽

Continuous State ◽

Multi Agent ◽

Successful Approach ◽

Q Function

A multi-agent system (MAS) is suitable for addressing tasks in a variety of domains without any programmed behaviors, which makes it ideal for the problems associated with the mobile robots. Reinforcement learning (RL) is a successful approach used in the MASs to acquire new behaviors; most of these select exact Q-values in small discrete state space and action space. This article presents a joint Q-function linearly fuzzified for a MAS’ continuous state space, which overcomes the dimensionality problem. Also, this article gives a proof for the convergence and existence of the solution proposed by the algorithm presented. This article also discusses the numerical simulations and experimental results that were carried out to validate the proposed algorithm.

Download Full-text