Smooth Imitation Learning via Smooth Costs and Smooth Policies

AbstractWe present a novel method for learning from demonstration 6-D tasks that can be modeled as a sequence of linear motions and compliances. The focus of this paper is the learning of a single linear primitive, many of which can be sequenced to perform more complex tasks. The presented method learns from demonstrations how to take advantage of mechanical gradients in in-contact tasks, such as assembly, both for translations and rotations, without any prior information. The method assumes there exists a desired linear direction in 6-D which, if followed by the manipulator, leads the robot’s end-effector to the goal area shown in the demonstration, either in free space or by leveraging contact through compliance. First, demonstrations are gathered where the teacher explicitly shows the robot how the mechanical gradients can be used as guidance towards the goal. From the demonstrations, a set of directions is computed which would result in the observed motion at each timestep during a demonstration of a single primitive. By observing which direction is included in all these sets, we find a single desired direction which can reproduce the demonstrated motion. Finding the number of compliant axes and their directions in both rotation and translation is based on the assumption that in the presence of a desired direction of motion, all other observed motion is caused by the contact force of the environment, signalling the need for compliance. We evaluate the method on a KUKA LWR4+ robot with test setups imitating typical tasks where a human would use compliance to cope with positional uncertainty. Results show that the method can successfully learn and reproduce compliant motions by taking advantage of the geometry of the task, therefore reducing the need for localization accuracy.

Download Full-text

Efficient Motion Planning for Automated Lane Change based on Imitation Learning and Mixed-Integer Optimization

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) ◽

10.1109/itsc45102.2020.9294679 ◽

2020 ◽

Author(s):

Chenyang Xi ◽

Tianyu Shi ◽

Yuankai Wu ◽

Lijun Sun

Keyword(s):

Motion Planning ◽

Imitation Learning ◽

Mixed Integer ◽

Lane Change ◽

Integer Optimization ◽

Mixed Integer Optimization

Download Full-text

What is the Reward for Handwriting? — A Handwriting Generation Model Based on Imitation Learning

2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) ◽

10.1109/icfhr2020.2020.00030 ◽

2020 ◽

Author(s):

Keisuke Kanda ◽

Brian Kenji Iwana ◽

Seiichi Uchida

Keyword(s):

Imitation Learning ◽

Generation Model ◽

Model Based ◽

Handwriting Generation

Download Full-text

UAV-Assisted Communication in Remote Disaster Areas using Imitation Learning

IEEE Open Journal of the Communications Society ◽

10.1109/ojcoms.2021.3067001 ◽

2021 ◽

pp. 1-1

Author(s):

Alireza Shamsoshoara ◽

Fatemeh Afghah ◽

Erik Blasch ◽

Jonathan Ashdown ◽

Mehdi Bennis

Keyword(s):

Imitation Learning

Download Full-text

Imitation Learning for improved 3D PET/MR attenuation correction

Medical Image Analysis ◽

10.1016/j.media.2021.102079 ◽

2021 ◽

pp. 102079

Author(s):

Kerstin Kläser ◽

Thomas Varsavsky ◽

Pawel Markiewicz ◽

Tom Vercauteren ◽

Alexander Hammers ◽

...

Keyword(s):

Attenuation Correction ◽

Imitation Learning ◽

3D Pet

Download Full-text

Minimizing the Age-of-Critical-Information: An Imitation Learning-based Scheduling Approach Under Partial Observations

IEEE Transactions on Mobile Computing ◽

10.1109/tmc.2021.3053136 ◽

2021 ◽

pp. 1-1

Author(s):

Xiaojie Wang ◽

Zhaolong Ning ◽

Song Guo ◽

Miaowen Wen ◽

Vincent Poor

Keyword(s):

Imitation Learning ◽

Partial Observations ◽

Critical Information

Download Full-text

Imitation Learning-based Algorithm for Drone Cinematography System

IEEE Transactions on Cognitive and Developmental Systems ◽

10.1109/tcds.2020.3043441 ◽

2020 ◽

pp. 1-1

Author(s):

Yuanjie Dang ◽

Chong Huang ◽

Peng Chen ◽

Ronghua Liang ◽

Xin Yang ◽

...

Keyword(s):

Imitation Learning

Download Full-text

Goal-driven active learning

Autonomous Agents and Multi-Agent Systems ◽

10.1007/s10458-021-09527-5 ◽

2021 ◽

Vol 35 (2) ◽

Author(s):

Nicolas Bougie ◽

Ryutaro Ichise

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Learning Process ◽

Real World ◽

Imitation Learning ◽

Learning Approaches ◽

Wide Range ◽

Fixed Set ◽

Complex Decision Making ◽

Complex Decision

AbstractDeep reinforcement learning methods have achieved significant successes in complex decision-making problems. In fact, they traditionally rely on well-designed extrinsic rewards, which limits their applicability to many real-world tasks where rewards are naturally sparse. While cloning behaviors provided by an expert is a promising approach to the exploration problem, learning from a fixed set of demonstrations may be impracticable due to lack of state coverage or distribution mismatch—when the learner’s goal deviates from the demonstrated behaviors. Besides, we are interested in learning how to reach a wide range of goals from the same set of demonstrations. In this work we propose a novel goal-conditioned method that leverages very small sets of goal-driven demonstrations to massively accelerate the learning process. Crucially, we introduce the concept of active goal-driven demonstrations to query the demonstrator only in hard-to-learn and uncertain regions of the state space. We further present a strategy for prioritizing sampling of goals where the disagreement between the expert and the policy is maximized. We evaluate our method on a variety of benchmark environments from the Mujoco domain. Experimental results show that our method outperforms prior imitation learning approaches in most of the tasks in terms of exploration efficiency and average scores.

Download Full-text