Hierarchical Gait Generation for Modular Robots Using Deep Reinforcement Learning

Designing novel robots that can cope with a specific task is a challenging problem because of the enormous design space that involves both morphological structures and control mechanisms. To this end, we present a computational method for automating the design of modular robots. Our method employs a genetic algorithm to evolve robotic structures as an outer optimization, and it applies a reinforcement learning algorithm to each candidate structure to train its behavior and evaluate its potential learning ability as an inner optimization. The size of the design space is reduced significantly by evolving only the robotic structure and by performing behavioral optimization using a separate training algorithm compared to that when both the structure and behavior are evolved simultaneously. Mutual dependence between evolution and learning is achieved by regarding the mean cumulative rewards of a candidate structure in the reinforcement learning as its fitness in the genetic algorithm. Therefore, our method searches for prospective robotic structures that can potentially lead to near-optimal behaviors if trained sufficiently. We demonstrate the usefulness of our method through several effective design results that were automatically generated in the process of experimenting with actual modular robotics kit.

Download Full-text

Automated Gait Generation for Simulated Bodies Using Deep Reinforcement Learning

2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT) ◽

10.1109/icicct.2018.8473310 ◽

2018 ◽

Author(s):

Abhishek Ananthakrishnan ◽

Vatsal Kanakiva ◽

Dipen Ved ◽

Grishma Sharma

Keyword(s):

Reinforcement Learning ◽

Gait Generation

Download Full-text

Autonomous Distributed System for Gait Generation for Single-Legged Modular Robots Connected in Various Configurations

IEEE Transactions on Robotics ◽

10.1109/tro.2020.2992983 ◽

2020 ◽

Vol 36 (5) ◽

pp. 1491-1510 ◽

Cited By ~ 1

Author(s):

Tomohiro Hayakawa ◽

Tomoya Kamimura ◽

Shizuo Kaji ◽

Fumitoshi Matsuno

Keyword(s):

Distributed System ◽

Modular Robots ◽

Gait Generation

Download Full-text

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

The International Journal of Robotics Research ◽

10.1177/0278364907084983 ◽

2008 ◽

Vol 27 (3-4) ◽

pp. 505-526 ◽

Cited By ~ 24

Author(s):

Paulina Varshavskaya ◽

Leslie Pack Kaelbling ◽

Daniela Rus

Keyword(s):

Reinforcement Learning ◽

Automated Design ◽

Modular Robots ◽

Adaptive Controllers

Download Full-text

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

Journal of Artificial Intelligence Research ◽

10.1613/jair.1437 ◽

2005 ◽

Vol 23 ◽

pp. 79-122 ◽

Cited By ~ 6

Author(s):

J. M. Porta ◽

E. Celaya

Keyword(s):

Reinforcement Learning ◽

Autonomous Robots ◽

Learning Algorithms ◽

Learning Systems ◽

Learning Approaches ◽

Legged Robot ◽

Sensors And Actuators ◽

Gait Generation ◽

Learning Time ◽

Motor Commands

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using many actuators as is the case in complex autonomous robots. We argue that reinforcement learning can only be successfully applied to this case if strong assumptions are made on the characteristics of the environment in which the learning is performed, so that the relevant sensor readings and motor commands can be readily identified. The introduction of such assumptions leads to strongly-biased learning systems that can eventually lose the generality of traditional reinforcement-learning algorithms. In this line, we observe that, in realistic situations, the reward received by the robot depends only on a reduced subset of all the executed actions and that only a reduced subset of the sensor inputs (possibly different in each situation and for each action) are relevant to predict the reward. We formalize this property in the so called 'categorizability assumption' and we present an algorithm that takes advantage of the categorizability of the environment, allowing a decrease in the learning time with respect to existing reinforcement-learning algorithms. Results of the application of the algorithm to a couple of simulated realistic-robotic problems (landmark-based navigation and the six-legged robot gait generation) are reported to validate our approach and to compare it to existing flat and generalization-based reinforcement-learning approaches.

Download Full-text

Automatic gait generation in modular robots: “to oscillate or to rotate; that is the question”

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2010.5649025 ◽

2010 ◽

Cited By ~ 11

Author(s):

S Pouya ◽

J van den Kieboom ◽

Alexander Spröwitz ◽

A J Ijspeert

Keyword(s):

Modular Robots ◽

Gait Generation

Download Full-text

Online Gait Learning for Modular Robots with Arbitrary Shapes and Sizes

Artificial Life ◽

10.1162/artl_a_00223 ◽

2017 ◽

Vol 23 (1) ◽

pp. 80-104 ◽

Cited By ~ 3

Author(s):

Berend Weel ◽

M. D'Angelo ◽

Evert Haasdijk ◽

A. E. Eiben

Keyword(s):

Reinforcement Learning ◽

Evolutionary Robotics ◽

Robotic Systems ◽

Modular Robots ◽

Learning Method ◽

Automated Assembly ◽

Simulation Experiments ◽

Morphological Complexity ◽

Real Hardware ◽

Gait Learning

Evolutionary robotics using real hardware is currently restricted to evolving robot controllers, but the technology for evolvable morphologies is advancing quickly. Rapid prototyping (3D printing) and automated assembly are the main enablers of robotic systems where robot offspring can be produced based on a blueprint that specifies the morphologies and the controllers of the parents. This article addresses the problem of gait learning in newborn robots whose morphology is unknown in advance. We investigate a reinforcement learning method and conduct simulation experiments using robot morphologies with different size and complexity. We establish that reinforcement learning does the job well and that it outperforms two alternative algorithms. The experiments also give insights into the online dynamics of gait learning and into the influence of the size, shape, and morphological complexity of the modular robots. These insights can potentially be used to predict the viability of modular robotic organisms before they are constructed.

Download Full-text

Gait Generation of Four-legged Running Robot Based on Reinforcement Learning to Reach a Goal

The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) ◽

10.1299/jsmermd.2017.2p1-g02 ◽

2017 ◽

Vol 2017 (0) ◽

pp. 2P1-G02

Author(s):

Kiichi TANIGUCHI ◽

Atsuki OMURA ◽

Naoto TANI ◽

Kazuyoshi TSUTSUMI

Keyword(s):

Reinforcement Learning ◽

Gait Generation

Download Full-text

Modular Robot Design Synthesis with Deep Reinforcement Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6611 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10418-10425

Author(s):

Julian Whitman ◽

Raunaq Bhirangi ◽

Matthew Travers ◽

Howie Choset

Keyword(s):

Reinforcement Learning ◽

Optimal Design ◽

State Of The Art ◽

Modular Robots ◽

Robot Design ◽

Computationally Efficient ◽

Design Synthesis ◽

Computational Burden ◽

Current State ◽

Deployment Time

Modular robots hold the promise of versatility in that their components can be re-arranged to adapt the robot design to a task at deployment time. Even for the simplest designs, determining the optimal design is exponentially complex due to the number of permutations of ways the modules can be connected. Further, when selecting the design for a given task, there is an additional computational burden in evaluating the capability of each robot, e.g., whether it can reach certain points in the workspace. This work uses deep reinforcement learning to create a search heuristic that allows us to efficiently search the space of modular serial manipulator designs. We show that our algorithm is more computationally efficient in determining robot designs for given tasks in comparison to the current state-of-the-art.

Download Full-text