SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

We give a brief overview of the Mario AI Championship, a series of competitions based on an open source clone of the seminal platform game Super Mario Bros. The competition has four tracks. The gameplay and learning tracks resemble traditional reinforcement learning competitions, the Level generation track focuses on the generation of entertaining game levels, and the Turing Test track focuses on humanlike game-playing behavior. We also outline some lessons learned from the competition and its future. The article is written by the four organizers of the competition.

Download Full-text

RiverFuzzRL - an open-source tool to experiment with reinforcement learning for fuzzing

2021 14th IEEE Conference on Software Testing, Verification and Validation (ICST) ◽

10.1109/icst49551.2021.00055 ◽

2021 ◽

Author(s):

Ciprian Paduraru ◽

Miruna Paduraru ◽

Alin Stefanescu

Keyword(s):

Reinforcement Learning ◽

Open Source ◽

Open Source Tool

Download Full-text

A Task of Miniature Mobile Robot Learning for Obstacle Avoidance through Neural Networks

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.151.498 ◽

2012 ◽

Vol 151 ◽

pp. 498-502

Author(s):

Jin Xue Zhang ◽

Hai Zhu Pan

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Mobile Robot ◽

Obstacle Avoidance ◽

Robot Learning ◽

Q Learning ◽

Robot Systems ◽

The Neural Networks ◽

Behavior Based

This paper is concerned with Q-learning , a very popular algorithm for reinforcement learning ,for obstacle avoidance through neural networks. The principle tells that the focus always must be on both ecological nice tasks and behaviours when designing on robot. Many robot systems have used behavior-based systems since the 1980’s.In this paper, the Khepera robot is trained through the proposed algorithm of Q-learning using the neural networks for the task of obstacle avoidance. In experiments with real and simulated robots, the neural networks approach can be used to make it possible for Q-learning to handle changes in the environment.

Download Full-text

An Industrial Perspective on Robot Learning

atp magazin ◽

10.17560/atp.v62i11-12.2470 ◽

2020 ◽

Vol 62 (11-12) ◽

pp. 50-57

Author(s):

Arne Wahrburg ◽

Kim Listmann ◽

Nima Enayati ◽

René Kirsten

Keyword(s):

Reinforcement Learning ◽

Robot Learning

Das Thema „Robot Learning“ erfährt in der akademischen Welt zurzeit große Aufmerksamkeit, insbesondere die Anwendungvon Reinforcement Learning in der Robotik. Aus industrieller Sicht versprechen die rasanten Fortschritte im Bereich Robot Learning verkürzte Inbetriebnahmezeiten, vereinfachte Programmierung, höhere Produktivität und Kostenreduktionen. In diesem Beitragwird das Potenzial der Technologie hinsichtlich industrieller Anwendungen beleuchtet. Es werden wesentliche Herausforderungen herausgestellt und mögliche Ansätze diskutiert, wie Ergebnisse aus dem Bereich Robot Learning in Richtung industrieller Anwendbarkeit getrieben werden können.

Download Full-text

Universal Reinforcement Learning Algorithms: Survey and Experiments

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/194 ◽

2017 ◽

Author(s):

John Aslanides ◽

Jan Leike ◽

Marcus Hutter

Keyword(s):

Reinforcement Learning ◽

Open Source ◽

Markov Decision Process ◽

Decision Process ◽

Empirical Investigation ◽

State Of The Art ◽

Learning Algorithms ◽

Markov Decision ◽

Reference Implementation ◽

Partially Observable

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms that make as few assumptions as possible about the environment. The universal Bayesian agent AIXI and a family of related URL algorithms have been developed in this setting. While numerous theoretical optimality results have been proven for these agents, there has been no empirical investigation of their behavior to date. We present a short and accessible survey of these URL algorithms under a unified notation and framework, along with results of some experiments that qualitatively illustrate some properties of the resulting policies, and their relative performance on partially-observable gridworld environments. We also present an open- source reference implementation of the algorithms which we hope will facilitate further understanding of, and experimentation with, these ideas.

Download Full-text

Incorporation of perception-based information in robot learning using fuzzy reinforcement learning agents

Journal of Ocean University of Qingdao ◽

10.1007/s11802-002-0038-0 ◽

2002 ◽

Vol 1 (1) ◽

pp. 93-100

Author(s):

Zhou Changjiu ◽

Meng Qingchun ◽

Guo Zhongwen ◽

Qu Wiefen ◽

Yin Bo

Keyword(s):

Reinforcement Learning ◽

Robot Learning ◽

Learning Agents

Download Full-text

BOLeRo: Behavior optimization and learning for robots

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420913741 ◽

2020 ◽

Vol 17 (3) ◽

pp. 172988142091374

Author(s):

Alexander Fabisch ◽

Malte Langosz ◽

Frank Kirchner

Keyword(s):

Reinforcement Learning ◽

Evolutionary Algorithms ◽

Open Source ◽

State Of The Art ◽

Learning Algorithms ◽

Robotic Systems ◽

Policy Search ◽

Behavior Learning ◽

And Behavior

Reinforcement learning and behavior optimization are becoming more and more popular in the field of robotics because algorithms are mature enough to tackle real problems in this domain. Robust implementations of state-of-the-art algorithms are often not publicly available though, and experiments are hardly reproducible because open-source implementations are often not available or are still in a stage of research code. Consequently, often it is infeasible to deploy these algorithms on robotic systems. BOLeRo closes this gap for policy search and evolutionary algorithms by delivering open-source implementations of behavior learning algorithms for robots. It is easy to integrate in robotic middlewares and it can be used to compare methods and develop prototypes in simulation.

Download Full-text