Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention

2018 ◽

pp. 266-291

Author(s):

Abdelghafour Harraz ◽

Mostapha Zbakh

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Load Balancing ◽

Decision Process ◽

Cloud System ◽

Human Intervention ◽

Q Learning ◽

State Action ◽

Learning Techniques ◽

Markov Decision

Artificial Intelligence allows to create engines that are able to explore, learn environments and therefore create policies that permit to control them in real time with no human intervention. It can be applied, through its Reinforcement Learning techniques component, using frameworks such as temporal differences, State-Action-Reward-State-Action (SARSA), Q Learning to name a few, to systems that are be perceived as a Markov Decision Process, this opens door in front of applying Reinforcement Learning to Cloud Load Balancing to be able to dispatch load dynamically to a given Cloud System. The authors will describe different techniques that can used to implement a Reinforcement Learning based engine in a cloud system.

Download Full-text

BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5639 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2561-2568

Author(s):

Morgane Ayle ◽

Jimmy Tekli ◽

Julia El-Zini ◽

Boulos El-Asmar ◽

Mariette Awad

Keyword(s):

Reinforcement Learning ◽

Industrial Sector ◽

Detection Methods ◽

Learning Approaches ◽

Human Intervention ◽

Car Industry ◽

Bounding Box ◽

Learning Agent ◽

Industry Standards ◽

Bounding Boxes

Research has shown that deep neural networks are able to help and assist human workers throughout the industrial sector via different computer vision applications. However, such data-driven learning approaches require a very large number of labeled training images in order to generalize well and achieve high accuracies that meet industry standards. Gathering and labeling large amounts of images is both expensive and time consuming, specifically for industrial use-cases. In this work, we introduce BAR (Bounding-box Automated Refinement), a reinforcement learning agent that learns to correct inaccurate bounding-boxes that are weakly generated by certain detection methods, or wrongly annotated by a human, using either an offline training method with Deep Reinforcement Learning (BAR-DRL), or an online one using Contextual Bandits (BAR-CB). Our agent limits the human intervention to correcting or verifying a subset of bounding-boxes instead of re-drawing new ones. Results on a car industry-related dataset and on the PASCAL VOC dataset show a consistent increase of up to 0.28 in the Intersection-over-Union of bounding-boxes with their desired ground-truths, while saving 30%-82% of human intervention time in either correcting or re-drawing inaccurate proposals.

Download Full-text

Model Predictive-Actor Critic Reinforcement Learning for Dexterous Manipulation

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429677 ◽

2021 ◽

Author(s):

Muhammad Omer ◽

Rami Ahmed ◽

Benjamin Rosman ◽

Sharief F. Babikir

Keyword(s):

Reinforcement Learning ◽

Dexterous Manipulation

Download Full-text

Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost

2019 International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2019.8794102 ◽

2019 ◽

Cited By ~ 7

Author(s):

Henry Zhu ◽

Abhishek Gupta ◽

Aravind Rajeswaran ◽

Sergey Levine ◽

Vikash Kumar

Keyword(s):

Reinforcement Learning ◽

Low Cost ◽

Dexterous Manipulation

Download Full-text

Analyzing the Effects of Reinforcement Learning to Develop Humanoid Robots

International Journal of End-User Computing and Development ◽

10.4018/ijeucd.20190101.oa2 ◽

2019 ◽

Vol 8 (1) ◽

pp. 55-66

Author(s):

Naaima Suroor ◽

Imran Hussain ◽

Aqeel Khalique ◽

Tabrej Ahamad Khan

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Problem Solving ◽

Cognitive Abilities ◽

Humanoid Robots ◽

Human Intervention ◽

Learning Techniques ◽

Probable Impact ◽

Learning Concept

Reinforcement learning is a flourishing machine learning concept that has greatly influenced how robots are designed and taught to solve problems without human intervention. Robotics is not an alien discipline anymore, and we have several great innovations in this field that promise to impact lives for the better. However, humanoid robots are still a baffling concept for scientists, although we have managed to develop a few great inventions which look, talk, work, and behave very similarly to humans. But, can these machines actually exhibit the cognitive abilities of judgment, problem-solving, and perception as well as humans? In this article, the authors analyzed the probable impact and aspects of robots and their potential to behave like humans in every possible way through reinforcement learning techniques. The paper also discusses the gap between 'natural' and 'artificial' knowledge.

Download Full-text

Optimization of Deep Reinforcement Learning with Hybrid Multi-Task Learning

2021 IEEE International Systems Conference (SysCon) ◽

10.1109/syscon48628.2021.9447080 ◽

2021 ◽

Author(s):

Nelson Vithayathil Varghese ◽

Qusay H. Mahmoud

Keyword(s):

Reinforcement Learning ◽

Task Learning

Download Full-text

Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations

Robotics: Science and Systems XIV ◽

10.15607/rss.2018.xiv.049 ◽

2018 ◽

Cited By ~ 32

Author(s):

Aravind Rajeswaran ◽

Vikash Kumar ◽

Abhishek Gupta ◽

Giulia Vezzani ◽

John Schulman ◽

...

Keyword(s):

Reinforcement Learning ◽

Dexterous Manipulation

Download Full-text

Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement Learning

10.29007/g7bg ◽

2019 ◽

Author(s):

João Ribeiro ◽

Francisco Melo ◽

João Dias

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Single Task ◽

Similar Performance ◽

The Third ◽

Task Learning ◽

Multiple Tasks ◽

Similar Task ◽

Reinforcement Learning Algorithm

In this paper we investigate two hypothesis regarding the use of deep reinforcement learning in multiple tasks. The first hypothesis is driven by the question of whether a deep reinforcement learning algorithm, trained on two similar tasks, is able to outperform two single-task, individually trained algorithms, by more efficiently learning a new, similar task, that none of the three algorithms has encountered before. The second hypothesis is driven by the question of whether the same multi-task deep RL algorithm, trained on two similar tasks and augmented with elastic weight consolidation (EWC), is able to retain similar performance on the new task, as a similar algorithm without EWC, whilst being able to overcome catastrophic forgetting in the two previous tasks. We show that a multi-task Asynchronous Advantage Actor-Critic (GA3C) algorithm, trained on Space Invaders and Demon Attack, is in fact able to outperform two single-tasks GA3C versions, trained individually for each single-task, when evaluated on a new, third task—namely, Phoenix. We also show that, when training two trained multi-task GA3C algorithms on the third task, if one is augmented with EWC, it is not only able to achieve similar performance on the new task, but also capable of overcoming a substantial amount of catastrophic forgetting on the two previous tasks.

Download Full-text