Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator

Robot learning problems are limited by physical constraints, which make learning successful policies for complex motor skills on real systems unfeasible. Some reinforcement learning methods, like Policy Search, offer stable convergence toward locally optimal solutions, whereas interactive machine learning or learning-from-demonstration methods allow fast transfer of human knowledge to the agents. However, most methods require expert demonstrations. In this work, we propose the use of human corrective advice in the actions domain for learning motor trajectories. Additionally, we combine this human feedback with reward functions in a Policy Search learning scheme. The use of both sources of information speeds up the learning process, since the intuitive knowledge of the human teacher can be easily transferred to the agent, while the Policy Search method with the cost/reward function take over for supervising the process and reducing the influence of occasional wrong human corrections. This interactive approach has been validated for learning movement primitives with simulated arms with several degrees of freedom in reaching via-point movements, and also using real robots in such tasks as “writing characters” and the ball-in-a-cup game. Compared with standard reinforcement learning without human advice, the results show that the proposed method not only converges to higher rewards when learning movement primitives, but also that the learning is sped up by a factor of 4–40 times, depending on the task.

Download Full-text

Applying statistical generalization to determine search direction for reinforcement learning of movement primitives

2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012) ◽

10.1109/humanoids.2012.6651500 ◽

2012 ◽

Cited By ~ 6

Author(s):

Bojan Nemec ◽

Denis Forte ◽

Rok Vuga ◽

Minija Tamosiunaite ◽

Florentin Worgotter ◽

...

Keyword(s):

Reinforcement Learning ◽

Search Direction ◽

Movement Primitives

Download Full-text

Pick and Place Operations in Logistics Using a Mobile Manipulator Controlled with Deep Reinforcement Learning

Applied Sciences ◽

10.3390/app9020348 ◽

2019 ◽

Vol 9 (2) ◽

pp. 348 ◽

Cited By ~ 5

Author(s):

Ander Iriondo ◽

Elena Lazkano ◽

Loreto Susperregi ◽

Julen Urain ◽

Ane Fernandez ◽

...

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Mobile Manipulator ◽

Complex Tasks ◽

Complex Control ◽

Planning And Control ◽

Pick And Place ◽

Policy Gradient ◽

Point To Point ◽

And Control

Programming robots to perform complex tasks is a very expensive job. Traditional path planning and control are able to generate point to point collision free trajectories, but when the tasks to be performed are complex, traditional planning and control become complex tasks. This study focused on robotic operations in logistics, specifically, on picking objects in unstructured areas using a mobile manipulator configuration. The mobile manipulator has to be able to place its base in a correct place so the arm is able to plan a trajectory up to an object in a table. A deep reinforcement learning (DRL) approach was selected to solve this type of complex control tasks. Using the arm planner’s feedback, a controller for the robot base is learned, which guides the platform to such a place where the arm is able to plan a trajectory up to the object. In addition the performance of two DRL algorithms ((Deep Deterministic Policy Gradient (DDPG)) and (Proximal Policy Optimisation (PPO)) is compared within the context of a concrete robotic task.

Download Full-text

Reinforcement learning based on movement primitives for contact tasks

Robotics and Computer-Integrated Manufacturing ◽

10.1016/j.rcim.2019.101863 ◽

2020 ◽

Vol 62 ◽

pp. 101863

Author(s):

Young-Loul Kim ◽

Kuk-Hyun Ahn ◽

Jae-Bok Song

Keyword(s):

Reinforcement Learning ◽

Movement Primitives ◽

Contact Tasks

Download Full-text

Learning and Generalization of Dynamic Movement Primitives by Hierarchical Deep Reinforcement Learning from Demonstration

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2018.8594476 ◽

2018 ◽

Author(s):

Wonchul Kim ◽

Chungkeun Lee ◽

H. Jin Kim

Keyword(s):

Reinforcement Learning ◽

Learning From Demonstration ◽

Movement Primitives ◽

Dynamic Movement ◽

Dynamic Movement Primitives

Download Full-text

Imitation learning of a wheeled mobile manipulator based on dynamical movement primitives

Industrial Robot the international journal of robotics research and application ◽

10.1108/ir-11-2020-0255 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Zeguo Yang ◽

Mantian Li ◽

Fusheng Zha ◽

Xin Wang ◽

Pengfei Wang ◽

...

Keyword(s):

Visual Tracking ◽

Kinematic Model ◽

Imitation Learning ◽

Mobile Manipulator ◽

Mobile Manipulators ◽

Content Type ◽

Learning Framework ◽

Movement Primitives ◽

Whole Process ◽

Tracking Controller

Purpose This paper aims to introduce an imitation learning framework for a wheeled mobile manipulator based on dynamical movement primitives (DMPs). A novel mobile manipulator with the capability to learn from demonstration is introduced. Then, this study explains the whole process for a wheeled mobile manipulator to learn a demonstrated task and generalize to new situations. Two visual tracking controllers are designed for recording human demonstrations and monitoring robot operations. The study clarifies how human demonstrations can be learned and generalized to new situations by a wheel mobile manipulator. Design/methodology/approach The kinematic model of a mobile manipulator is analyzed. An RGB-D camera is applied to record the demonstration trajectories and observe robot operations. To avoid human demonstration behaviors going out of sight of the camera, a visual tracking controller is designed based on the kinematic model of the mobile manipulator. The demonstration trajectories are then represented by DMPs and learned by the mobile manipulator with corresponding models. Another tracking controller is designed based on the kinematic model of the mobile manipulator to monitor and modify the robot operations. Findings To verify the effectiveness of the imitation learning framework, several daily tasks are demonstrated and learned by the mobile manipulator. The results indicate that the presented approach shows good performance for a wheeled mobile manipulator to learn tasks through human demonstrations. The only thing a robot-user needs to do is to provide demonstrations, which highly facilitates the application of mobile manipulators. Originality/value The research fulfills the need for a wheeled mobile manipulator to learn tasks via demonstrations instead of manual planning. Similar approaches can be applied to mobile manipulators with different architecture.

Download Full-text

Motion capture and reinforcement learning of dynamically stable humanoid movement primitives

2013 IEEE International Conference on Robotics and Automation ◽

10.1109/icra.2013.6631333 ◽

2013 ◽

Cited By ~ 9

Author(s):

Rok Vuga ◽

Matjaz Ogrinc ◽

Andrej Gams ◽

Tadej Petric ◽

Norikazu Sugimoto ◽

...

Keyword(s):

Reinforcement Learning ◽

Motion Capture ◽

Movement Primitives

Download Full-text