Motion capture and reinforcement learning of dynamically stable humanoid movement primitives

Robot learning problems are limited by physical constraints, which make learning successful policies for complex motor skills on real systems unfeasible. Some reinforcement learning methods, like Policy Search, offer stable convergence toward locally optimal solutions, whereas interactive machine learning or learning-from-demonstration methods allow fast transfer of human knowledge to the agents. However, most methods require expert demonstrations. In this work, we propose the use of human corrective advice in the actions domain for learning motor trajectories. Additionally, we combine this human feedback with reward functions in a Policy Search learning scheme. The use of both sources of information speeds up the learning process, since the intuitive knowledge of the human teacher can be easily transferred to the agent, while the Policy Search method with the cost/reward function take over for supervising the process and reducing the influence of occasional wrong human corrections. This interactive approach has been validated for learning movement primitives with simulated arms with several degrees of freedom in reaching via-point movements, and also using real robots in such tasks as “writing characters” and the ball-in-a-cup game. Compared with standard reinforcement learning without human advice, the results show that the proposed method not only converges to higher rewards when learning movement primitives, but also that the learning is sped up by a factor of 4–40 times, depending on the task.

Download Full-text

AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

IEEE Robotics and Automation Letters ◽

10.1109/lra.2020.3013906 ◽

2020 ◽

Vol 5 (4) ◽

pp. 6678-6685

Author(s):

Rahul Tallamraju ◽

Nitin Saini ◽

Elia Bonetto ◽

Michael Pabst ◽

Yu Tang Liu ◽

...

Keyword(s):

Reinforcement Learning ◽

Motion Capture ◽

Human Motion ◽

Human Motion Capture

Download Full-text

Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator

IEEE/ASME Transactions on Mechatronics ◽

10.1109/tmech.2017.2717461 ◽

2018 ◽

Vol 23 (1) ◽

pp. 121-131 ◽

Cited By ~ 45

Author(s):

Zhijun Li ◽

Ting Zhao ◽

Fei Chen ◽

Yingbai Hu ◽

Chun-Yi Su ◽

...

Keyword(s):

Reinforcement Learning ◽

Mobile Manipulator ◽

Movement Primitives

Download Full-text

General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles

Sensors ◽

10.3390/s21134560 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4560

Author(s):

Chen-Huan Pi ◽

Yi-Wei Dai ◽

Kai-Chun Hu ◽

Stone Cheng

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Motion Capture ◽

Flight Control ◽

Control Policy ◽

General Purpose ◽

Position Information ◽

Low Level ◽

Model Free ◽

Aerial Vehicles

This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states.

Download Full-text