learning from demonstration Latest Research Papers

Deep robotic learning by learning from demonstration allows robots to mimic a given demonstration and generalize their performance to unknown task setups. However, this generalization ability is heavily affected by the number of demonstrations, which can be costly to manually generate. Without sufficient demonstrations, robots tend to overfit to the available demonstrations and lose the robustness offered by deep learning. Applying the concept of motor babbling – a process similar to that by which human infants move their bodies randomly to obtain proprioception – is also effective for allowing robots to enhance their generalization ability. Furthermore, the generation of babbling data is simpler than task-oriented demonstrations. Previous researches use motor babbling in the concept of pre-training and fine-tuning but have the problem of the babbling data being overwritten by the task data. In this work, we propose an RNN-based robot-control framework capable of leveraging targetless babbling data to aid the robot in acquiring proprioception and increasing the generalization ability of the learned task data by learning both babbling and task data simultaneously. Through simultaneous learning, our framework can use the dynamics obtained from babbling data to learn the target task efficiently. In the experiment, we prepare demonstrations of a block-picking task and aimless-babbling data. With our framework, the robot can learn tasks faster and show greater generalization ability when blocks are at unknown positions or move during execution.

Download Full-text

Concept2Robot: Learning manipulation concepts from instructions and human demonstrations

The International Journal of Robotics Research ◽

10.1177/02783649211046285 ◽

2021 ◽

pp. 027836492110462

Author(s):

Lin Shao ◽

Toki Migimatsu ◽

Qiang Zhang ◽

Karen Yang ◽

Jeannette Bohg

Keyword(s):

Natural Language ◽

Motor Skills ◽

Large Scale ◽

Language Instruction ◽

Robot Motion ◽

Learning From Demonstration ◽

Simulation Experiments ◽

Single Task ◽

Design Task ◽

Second Stage

We aim to endow a robot with the ability to learn manipulation concepts that link natural language instructions to motor skills. Our goal is to learn a single multi-task policy that takes as input a natural language instruction and an image of the initial scene and outputs a robot motion trajectory to achieve the specified task. This policy has to generalize over different instructions and environments. Our insight is that we can approach this problem through learning from demonstration by leveraging large-scale video datasets of humans performing manipulation actions. Thereby, we avoid more time-consuming processes such as teleoperation or kinesthetic teaching. We also avoid having to manually design task-specific rewards. We propose a two-stage learning process where we first learn single-task policies through reinforcement learning. The reward is provided by scoring how well the robot visually appears to perform the task. This score is given by a video-based action classifier trained on a large-scale human activity dataset. In the second stage, we train a multi-task policy through imitation learning to imitate all the single-task policies. In extensive simulation experiments, we show that the multi-task policy learns to perform a large percentage of the 78 different manipulation tasks on which it was trained. The tasks are of greater variety and complexity than previously considered robot manipulation tasks. We show that the policy generalizes over variations of the environment. We also show examples of successful generalization over novel but similar instructions.

Download Full-text

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

International Journal of Social Robotics ◽

10.1007/s12369-021-00825-2 ◽

2021 ◽

Author(s):

Mingfei Sun ◽

Zhenhui Peng ◽

Meng Xia ◽

Xiaojuan Ma

Keyword(s):

Learning Outcomes ◽

User Study ◽

Robot Learning ◽

Teacher Behavior ◽

Learning From Demonstration ◽

Reciprocal Effects ◽

Simulation Environment ◽

Engagement Behavior ◽

The Impact

AbstractRobot learning from demonstration (RLfD) is a technique for robots to derive policies from instructors’ examples. Although the reciprocal effects of student engagement on teacher behavior are widely recognized in the educational community, it is unclear whether the same phenomenon holds for RLfD. To fill this gap, we first design three types of robot engagement behavior (gaze, imitation, and a hybrid of the two) based on the learning literature. We then conduct, in a simulation environment, a within-subject user study to investigate the impact of different robot engagement cues on humans compared to a “without-engagement” condition. Results suggest that engagement communication has significantly negative influences on the human’s estimation of the simulated robots’ capability and significantly raises their expectation towards the learning outcomes, even though we do not run actual imitation learning algorithms in the experiments. Moreover, imitation behavior affects humans more than gaze does in all metrics, while their combination has the most profound influences on humans. We also find that communicating engagement via imitation or the combined behavior significantly improves humans’ perception towards the quality of simulated demonstrations, even if all demonstrations are of the same quality.

Download Full-text

Combining Learning from Demonstration with Learning by Exploration to Facilitate Contact-Rich Tasks

10.1109/iros51168.2021.9636417 ◽

2021 ◽

Author(s):

Yunlei Shi ◽

Zhaopeng Chen ◽

Yansong Wu ◽

Dimitri Henkel ◽

Sebastian Riedel ◽

...

Keyword(s):

Learning From Demonstration ◽

Rich Tasks

Download Full-text

Towards a User Adaptive Assistive Robot: Learning from Demonstration Using Navigation Functions

10.1109/iros51168.2021.9636200 ◽

2021 ◽

Author(s):

Xanthi S. Papageorgiou ◽

Athanasios C. Dometios ◽

Costas S. Tzafestas

Keyword(s):

Robot Learning ◽

Learning From Demonstration ◽

Assistive Robot

Download Full-text

Efficient Reinforcement Learning from Demonstration via Bayesian Network-Based Knowledge Extraction

Computational Intelligence and Neuroscience ◽

10.1155/2021/7588221 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Yichuan Zhang ◽

Yixing Lan ◽

Qiang Fang ◽

Xin Xu ◽

Junxiang Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Bayesian Network ◽

Wasserstein Distance ◽

Learning From Demonstration ◽

Abstract Concepts ◽

Generalization Capability ◽

Data Set ◽

The Neural Network ◽

Abstract Data ◽

Fine Tune

Reinforcement learning from demonstration (RLfD) is considered to be a promising approach to improve reinforcement learning (RL) by leveraging expert demonstrations as the additional decision-making guidance. However, most existing RLfD methods only regard demonstrations as low-level knowledge instances under a certain task. Demonstrations are generally used to either provide additional rewards or pretrain the neural network-based RL policy in a supervised manner, usually resulting in poor generalization capability and weak robustness performance. Considering that human knowledge is not only interpretable but also suitable for generalization, we propose to exploit the potential of demonstrations by extracting knowledge from them via Bayesian networks and develop a novel RLfD method called Reinforcement Learning from demonstration via Bayesian Network-based Knowledge (RLBNK). The proposed RLBNK method takes advantage of node influence with the Wasserstein distance metric (NIW) algorithm to obtain abstract concepts from demonstrations and then a Bayesian network conducts knowledge learning and inference based on the abstract data set, which will yield the coarse policy with corresponding confidence. Once the coarse policy’s confidence is low, another RL-based refine module will further optimize and fine-tune the policy to form a (near) optimal hybrid policy. Experimental results show that the proposed RLBNK method improves the learning efficiency of corresponding baseline RL algorithms under both normal and sparse reward settings. Furthermore, we demonstrate that our RLBNK method delivers better generalization capability and robustness than baseline methods.

Download Full-text

Learning from demonstration using products of experts: Applications to manipulation and task prioritization

The International Journal of Robotics Research ◽

10.1177/02783649211040561 ◽

2021 ◽

pp. 027836492110405

Author(s):

Emmanuel Pignat ◽

Joāo Silvério ◽

Sylvain Calinon

Keyword(s):

Probability Distributions ◽

Control Level ◽

Learning From Demonstration ◽

Joint Angles ◽

Secondary Tasks ◽

Final Model ◽

Relevant Task ◽

Contrastive Divergence ◽

Robot Configuration

Probability distributions are key components of many learning from demonstration (LfD) approaches, with the spaces chosen to represent tasks playing a central role. Although the robot configuration is defined by its joint angles, end-effector poses are often best explained within several task spaces. In many approaches, distributions within relevant task spaces are learned independently and only combined at the control level. This simplification implies several problems that are addressed in this work. We show that the fusion of models in different task spaces can be expressed as products of experts (PoE), where the probabilities of the models are multiplied and renormalized so that it becomes a proper distribution of joint angles. Multiple experiments are presented to show that learning the different models jointly in the PoE framework significantly improves the quality of the final model. The proposed approach particularly stands out when the robot has to learn hierarchical objectives that arise when a task requires the prioritization of several sub-tasks (e.g. in a humanoid robot, keeping balance has a higher priority than reaching for an object). Since training the model jointly usually relies on contrastive divergence, which requires costly approximations that can affect performance, we propose an alternative strategy using variational inference and mixture model approximations. In particular, we show that the proposed approach can be extended to PoE with a nullspace structure (PoENS), where the model is able to recover secondary tasks that are masked by the resolution of tasks of higher-importance.

Download Full-text

A Learning from Demonstration Method for Robotic Assembly with a Dual-Sub-6-DoF Parallel Robot*

10.1109/wrcsara53879.2021.9612676 ◽

2021 ◽

Author(s):

Haopeng Hu ◽

Zhilong Zhao ◽

Xiansheng Yang ◽

Yunjiang Lou

Keyword(s):

Parallel Robot ◽

Robotic Assembly ◽

Learning From Demonstration

Download Full-text

learning from demonstration
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration

Control of a lower limb exoskeleton using Learning from Demonstration and an iterative Linear Quadratic Regulator Controller: A simulation study

Leveraging Motor Babbling for Efficient Robot Learning

Concept2Robot: Learning manipulation concepts from instructions and human demonstrations

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

Combining Learning from Demonstration with Learning by Exploration to Facilitate Contact-Rich Tasks

Towards a User Adaptive Assistive Robot: Learning from Demonstration Using Navigation Functions

Efficient Reinforcement Learning from Demonstration via Bayesian Network-Based Knowledge Extraction

Learning from demonstration using products of experts: Applications to manipulation and task prioritization

A Learning from Demonstration Method for Robotic Assembly with a Dual-Sub-6-DoF Parallel Robot*

Export Citation Format

learning from demonstrationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration

Control of a lower limb exoskeleton using Learning from Demonstration and an iterative Linear Quadratic Regulator Controller: A simulation study

Leveraging Motor Babbling for Efficient Robot Learning

Concept2Robot: Learning manipulation concepts from instructions and human demonstrations

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

Combining Learning from Demonstration with Learning by Exploration to Facilitate Contact-Rich Tasks

Towards a User Adaptive Assistive Robot: Learning from Demonstration Using Navigation Functions

Efficient Reinforcement Learning from Demonstration via Bayesian Network-Based Knowledge Extraction

Learning from demonstration using products of experts: Applications to manipulation and task prioritization

A Learning from Demonstration Method for Robotic Assembly with a Dual-Sub-6-DoF Parallel Robot*

learning from demonstration
Recently Published Documents