A Critical Period for Robust Curriculum‐Based Deep Reinforcement Learning of Sequential Action in a Robot Arm

Topics in Cognitive Science ◽

10.1111/tops.12595 ◽

2022 ◽

Author(s):

Roy de Kleijn ◽

Deniz Sen ◽

George Kachergis

Keyword(s):

Reinforcement Learning ◽

Critical Period ◽

Robot Arm ◽

Sequential Action

Download Full-text

EEG-Induced Autonomous Game-Teaching to a Robot Arm by Human Trainers Using Reinforcement Learning

IEEE Transactions on Games ◽

10.1109/tg.2021.3124340 ◽

2021 ◽

pp. 1-1

Author(s):

Reshma Kar ◽

Lidia Ghosh ◽

Amit Konar ◽

Aruna Chakraborty ◽

Atulya K. Nagar

Keyword(s):

Reinforcement Learning ◽

Download Full-text

Reinforcement Learning Control for Robot Arm Grasping Based on Improved DDPG

10.23919/ccc52363.2021.9550413 ◽

2021 ◽

Author(s):

Guangjun Qi ◽

Yuan Li

Keyword(s):

Reinforcement Learning ◽

Learning Control ◽

Download Full-text

Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection

IEICE Transactions on Information and Systems ◽

10.1587/transinf.e94.d.255 ◽

2011 ◽

Vol E94-D (2) ◽

pp. 255-263 ◽

Author(s):

Ali AKRAMIZADEH ◽

Ahmad AFSHAR ◽

Mohammad Bagher MENHAJ ◽

Samira JAFARI

Keyword(s):

Reinforcement Learning ◽

Multiagent Systems ◽

Action Selection ◽

Sequential Action ◽

Download Full-text

Robot Arm Control Method of Moving Below Object Based on Deep Reinforcement Learning

Communications in Computer and Information Science - Methods and Applications for Modeling and Simulation of Complex Systems ◽

10.1007/978-981-15-1078-6_11 ◽

2019 ◽

pp. 127-136

Author(s):

HeYu Li ◽

LiQin Guo ◽

GuoQiang Shi ◽

YingYing Xiao ◽

Bi Zeng ◽

...

Keyword(s):

Reinforcement Learning ◽

Control Method ◽

Robot Arm ◽

Object Based ◽

Robot Arm Control

Download Full-text

Hand-eye coordination in robot arm reaching task by reinforcement learning using a neural network

IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028) ◽

10.1109/icsmc.1999.815594 ◽

2003 ◽

Author(s):

K. Shibata ◽

K. Ito

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Robot Arm ◽

Reaching Task ◽

Arm Reaching ◽

Hand Eye Coordination

Download Full-text

Biological robot arm motion through reinforcement learning

Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292) ◽

10.1109/robot.2002.1014236 ◽

2003 ◽

Author(s):

J. Izawa ◽

T. Kondo ◽

K. Ito

Keyword(s):

Reinforcement Learning ◽

Robot Arm ◽

Download Full-text

Gamma-Nets: Generalizing Value Estimation over Timescale

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6027 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5717-5725

Author(s):

Craig Sherstan ◽

Shibhansh Dohare ◽

James MacGlashan ◽

Johannes Günther ◽

Patrick M. Pilarski

Keyword(s):

Reinforcement Learning ◽

Value Function ◽

A Priori ◽

Predictive Ability ◽

Representation Learning ◽

Robot Arm ◽

Function Estimation ◽

Temporal Abstraction ◽

Long Time ◽

Function Approximator

Temporal abstraction is a key requirement for agents making decisions over long time horizons—a fundamental challenge in reinforcement learning. There are many reasons why value estimates at multiple timescales might be useful; recent work has shown that value estimates at different time scales can be the basis for creating more advanced discounting functions and for driving representation learning. Further, predictions at many different timescales serve to broaden an agent's model of its environment. One predictive approach of interest within an online learning setting is general value function (GVFs), which represent models of an agent's world as a collection of predictive questions each defined by a policy, a signal to be predicted, and a prediction timescale. In this paper we present Γ-nets, a method for generalizing value function estimation over timescale, allowing a given GVF to be trained and queried for arbitrary timescales so as to greatly increase the predictive ability and scalability of a GVF-based model. The key to our approach is to use timescale as one of the value estimator's inputs. As a result, the prediction target for any timescale is available at every timestep and we are free to train on any number of timescales. We first provide two demonstrations by 1) predicting a square wave and 2) predicting sensorimotor signals on a robot arm using a linear function approximator. Next, we empirically evaluate Γ-nets in the deep reinforcement learning setting using policy evaluation on a set of Atari video games. Our results show that Γ-nets can be effective for predicting arbitrary timescales, with only a small cost in accuracy as compared to learning estimators for fixed timescales. Γ-nets provide a method for accurately and compactly making predictions at many timescales without requiring a priori knowledge of the task, making it a valuable contribution to ongoing work on model-based planning, representation learning, and lifelong learning algorithms.

Download Full-text

Model-Free Reinforcement Learning with Ensemble for a Soft Continuum Robot Arm

2021 IEEE 4th International Conference on Soft Robotics (RoboSoft) ◽

10.1109/robosoft51838.2021.9479340 ◽

2021 ◽

Author(s):

Ryota Morimoto ◽

Satoshi Nishikawa ◽

Ryuma Niiyama ◽

Yasuo Kuniyoshi

Keyword(s):

Reinforcement Learning ◽

Robot Arm ◽

Continuum Robot

Download Full-text

Throwing Motion by Flexible Robot Arm using Deep Reinforcement Learning

The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) ◽

10.1299/jsmermd.2020.2a2-j09 ◽

2020 ◽

Vol 2020 (0) ◽

pp. 2A2-J09

Author(s):

Kenta YOSHIZAWA ◽

Taisuke KOBAYASHI ◽

Kenji SUGIMOTO

Keyword(s):

Reinforcement Learning ◽

Robot Arm ◽

Flexible Robot ◽

Throwing Motion

Download Full-text

Reinforcement learning strategies for sequential action learning

Neuroscience Research ◽

10.1016/j.neures.2009.09.1332 ◽

2009 ◽

Vol 65 ◽

pp. S236

Author(s):

Alan Fermin ◽

Yoshida Takehiko ◽

Saori Tanaka ◽

Makoto Ito ◽

Junichiro Yoshimoto ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Strategies ◽

Action Learning ◽

Sequential Action

Download Full-text