Multirobot coordination with deep reinforcement learning in complex environments

2015 ◽

Vol 25 (3) ◽

pp. 471-482 ◽

Cited By ~ 7

Author(s):

Bartłomiej Śnieżyński

Keyword(s):

Reinforcement Learning ◽

Supervised Learning ◽

Learning Process ◽

Autonomous Agents ◽

Good Alternative ◽

Learning Model ◽

Learning Method ◽

Complex Environments ◽

Agent Based ◽

Proposed Model

AbstractIn this paper we propose a strategy learning model for autonomous agents based on classification. In the literature, the most commonly used learning method in agent-based systems is reinforcement learning. In our opinion, classification can be considered a good alternative. This type of supervised learning can be used to generate a classifier that allows the agent to choose an appropriate action for execution. Experimental results show that this model can be successfully applied for strategy generation even if rewards are delayed. We compare the efficiency of the proposed model and reinforcement learning using the farmer-pest domain and configurations of various complexity. In complex environments, supervised learning can improve the performance of agents much faster that reinforcement learning. If an appropriate knowledge representation is used, the learned knowledge may be analyzed by humans, which allows tracking the learning process

Download Full-text

A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS

2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP) ◽

10.1109/globalsip.2018.8646428 ◽

2018 ◽

Cited By ~ 1

Author(s):

Chao Wang ◽

Jian Wang ◽

Xudong Zhang

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Learning Approach ◽

Complex Environments

Download Full-text

Rapid trajectory design in complex environments enabled by reinforcement learning and graph search strategies

Acta Astronautica ◽

10.1016/j.actaastro.2019.04.037 ◽

2020 ◽

Vol 171 ◽

pp. 172-195 ◽

Cited By ~ 1

Author(s):

A. Das-Stuart ◽

K.C. Howell ◽

D.C. Folta

Keyword(s):

Reinforcement Learning ◽

Search Strategies ◽

Graph Search ◽

Trajectory Design ◽

Complex Environments

Download Full-text

A Runtime Monitoring Framework to Enforce Invariants on Reinforcement Learning Agents Exploring Complex Environments

2019 IEEE/ACM 2nd International Workshop on Robotics Software Engineering (RoSE) ◽

10.1109/rose.2019.00011 ◽

2019 ◽

Author(s):

Piergiuseppe Mallozzi ◽

Ezequiel Castellano ◽

Patrizio Pelliccione ◽

Gerardo Schneider ◽

Kenji Tei

Keyword(s):

Reinforcement Learning ◽

Runtime Monitoring ◽

Complex Environments ◽

Learning Agents ◽

Monitoring Framework

Download Full-text

Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning

Sensors ◽

10.3390/s19183837 ◽

2019 ◽

Vol 19 (18) ◽

pp. 3837 ◽

Cited By ~ 7

Author(s):

Junjie Zeng ◽

Rusheng Ju ◽

Long Qin ◽

Yue Hu ◽

Quanjun Yin ◽

...

Keyword(s):

Reinforcement Learning ◽

Domain Knowledge ◽

Moving Objects ◽

Dynamic Environment ◽

Dynamic Environments ◽

Continuous Control ◽

Complex Environments ◽

Reward Function ◽

Knowledge Based ◽

Task Architecture

In this paper, we propose a novel Deep Reinforcement Learning (DRL) algorithm which can navigate non-holonomic robots with continuous control in an unknown dynamic environment with moving obstacles. We call the approach MK-A3C (Memory and Knowledge-based Asynchronous Advantage Actor-Critic) for short. As its first component, MK-A3C builds a GRU-based memory neural network to enhance the robot’s capability for temporal reasoning. Robots without it tend to suffer from a lack of rationality in face of incomplete and noisy estimations for complex environments. Additionally, robots with certain memory ability endowed by MK-A3C can avoid local minima traps by estimating the environmental model. Secondly, MK-A3C combines the domain knowledge-based reward function and the transfer learning-based training task architecture, which can solve the non-convergence policies problems caused by sparse reward. These improvements of MK-A3C can efficiently navigate robots in unknown dynamic environments, and satisfy kinetic constraints while handling moving objects. Simulation experiments show that compared with existing methods, MK-A3C can realize successful robotic navigation in unknown and challenging environments by outputting continuous acceleration commands.

Download Full-text

Multi-Task Reinforcement Learning in Humans

10.1101/815332 ◽

2019 ◽

Cited By ~ 1

Author(s):

Momchil S. Tomov ◽

Eric Schulz ◽

Samuel J. Gershman

Keyword(s):

Reinforcement Learning ◽

State Of The Art ◽

Task Demands ◽

Human Intelligence ◽

Value Functions ◽

Complex Environments ◽

Multiple Features ◽

Model Free ◽

Reward Functions ◽

Study Participants

ABSTRACTThe ability to transfer knowledge across tasks and generalize to novel ones is an important hallmark of human intelligence. Yet not much is known about human multi-task reinforcement learning. We study participants’ behavior in a novel two-step decision making task with multiple features and changing reward functions. We compare their behavior to two state-of-the-art algorithms for multi-task reinforcement learning, one that maps previous policies and encountered features to new reward functions and one that approximates value functions across tasks, as well as to standard model-based and model-free algorithms. Across three exploratory experiments and a large preregistered experiment, our results provide strong evidence for a strategy that maps previously learned policies to novel scenarios. These results enrich our understanding of human reinforcement learning in complex environments with changing task demands.

Download Full-text

Unsupervised learning and clustered connectivity enhance reinforcement learning in spiking neural networks

10.1101/2020.03.17.995563 ◽

2020 ◽

Author(s):

Philipp Weidel ◽

Renato Duarte ◽

Abigail Morrison

Keyword(s):

Reinforcement Learning ◽

Unsupervised Learning ◽

Activity Patterns ◽

Receptive Fields ◽

Place Cells ◽

Spiking Neural Networks ◽

Complex Environments ◽

Proposed Model ◽

Clustered Network ◽

Better Than

ABSTRACTReinforcement learning is a learning paradigm that can account for how organisms learn to adapt their behavior in complex environments with sparse rewards. However, implementations in spiking neuronal networks typically rely on input architectures involving place cells or receptive fields. This is problematic, as such approaches either scale badly as the environment grows in size or complexity, or presuppose knowledge on how the environment should be partitioned. Here, we propose a learning architecture that combines unsupervised learning on the input projections with clustered connectivity within the representation layer. This combination allows input features to be mapped to clusters; thus the network self-organizes to produce task-relevant activity patterns that can serve as the basis for reinforcement learning on the output projections. On the basis of the MNIST and Mountain Car tasks, we show that our proposed model performs better than either a comparable unclustered network or a clustered network with static input projections. We conclude that the combination of unsupervised learning and clustered connectivity provides a generic representational substrate suitable for further computation.

Download Full-text

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Computational Intelligence and Neuroscience ◽

10.1155/2021/9945044 ◽

2021 ◽

Vol 2021 ◽

pp. 1-15

Author(s):

Xiaogang Ruan ◽

Peng Li ◽

Xiaoqing Zhu ◽

Hejie Yu ◽

Naigong Yu

Keyword(s):

Reinforcement Learning ◽

Intrinsic Motivation ◽

Driving Forces ◽

Temporal Distance ◽

Training Methods ◽

Complex Environments ◽

Learning Problem ◽

Autonomous Exploration ◽

Exploration Behavior ◽

Efficient Exploration

Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy.

Download Full-text

Unsupervised Learning and Clustered Connectivity Enhance Reinforcement Learning in Spiking Neural Networks

Frontiers in Computational Neuroscience ◽

10.3389/fncom.2021.543872 ◽

2021 ◽

Vol 15 ◽

Author(s):

Philipp Weidel ◽

Renato Duarte ◽

Abigail Morrison

Keyword(s):

Reinforcement Learning ◽

Unsupervised Learning ◽

Ad Hoc ◽

Activity Patterns ◽

Receptive Fields ◽

Complex Environments ◽

Self Organized ◽

Proposed Model ◽

The Brain ◽

Clustered Network

Reinforcement learning is a paradigm that can account for how organisms learn to adapt their behavior in complex environments with sparse rewards. To partition an environment into discrete states, implementations in spiking neuronal networks typically rely on input architectures involving place cells or receptive fields specified ad hoc by the researcher. This is problematic as a model for how an organism can learn appropriate behavioral sequences in unknown environments, as it fails to account for the unsupervised and self-organized nature of the required representations. Additionally, this approach presupposes knowledge on the part of the researcher on how the environment should be partitioned and represented and scales poorly with the size or complexity of the environment. To address these issues and gain insights into how the brain generates its own task-relevant mappings, we propose a learning architecture that combines unsupervised learning on the input projections with biologically motivated clustered connectivity within the representation layer. This combination allows input features to be mapped to clusters; thus the network self-organizes to produce clearly distinguishable activity patterns that can serve as the basis for reinforcement learning on the output projections. On the basis of the MNIST and Mountain Car tasks, we show that our proposed model performs better than either a comparable unclustered network or a clustered network with static input projections. We conclude that the combination of unsupervised learning and clustered connectivity provides a generic representational substrate suitable for further computation.

Download Full-text

Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2018.8593702 ◽

2018 ◽

Cited By ~ 10

Author(s):

Xi Chen ◽

Ali Ghadirzadeh ◽

John Folkesson ◽

Marten Bjorkman ◽

Patric Jensfelt

Keyword(s):

Reinforcement Learning ◽

Legged Robots ◽

Complex Environments

Download Full-text

Multirobot coordination with deep reinforcement learning in complex environments

A strategy learning model for autonomous agents based on classification

A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS

Rapid trajectory design in complex environments enabled by reinforcement learning and graph search strategies

A Runtime Monitoring Framework to Enforce Invariants on Reinforcement Learning Agents Exploring Complex Environments

Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning

Multi-Task Reinforcement Learning in Humans

Unsupervised learning and clustered connectivity enhance reinforcement learning in spiking neural networks

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Unsupervised Learning and Clustered Connectivity Enhance Reinforcement Learning in Spiking Neural Networks

Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments

Export Citation Format