Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement Learning

Mapping Intimacies ◽

10.29007/g7bg ◽

2019 ◽

Author(s):

João Ribeiro ◽

Francisco Melo ◽

João Dias

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Single Task ◽

Similar Performance ◽

The Third ◽

Task Learning ◽

Multiple Tasks ◽

Similar Task ◽

Reinforcement Learning Algorithm

In this paper we investigate two hypothesis regarding the use of deep reinforcement learning in multiple tasks. The first hypothesis is driven by the question of whether a deep reinforcement learning algorithm, trained on two similar tasks, is able to outperform two single-task, individually trained algorithms, by more efficiently learning a new, similar task, that none of the three algorithms has encountered before. The second hypothesis is driven by the question of whether the same multi-task deep RL algorithm, trained on two similar tasks and augmented with elastic weight consolidation (EWC), is able to retain similar performance on the new task, as a similar algorithm without EWC, whilst being able to overcome catastrophic forgetting in the two previous tasks. We show that a multi-task Asynchronous Advantage Actor-Critic (GA3C) algorithm, trained on Space Invaders and Demon Attack, is in fact able to outperform two single-tasks GA3C versions, trained individually for each single-task, when evaluated on a new, third task—namely, Phoenix. We also show that, when training two trained multi-task GA3C algorithms on the third task, if one is augmented with EWC, it is not only able to achieve similar performance on the new task, but also capable of overcoming a substantial amount of catastrophic forgetting on the two previous tasks.

Download Full-text

Multi-Task Deep Reinforcement Learning for Continuous Action Control

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/461 ◽

2017 ◽

Cited By ~ 9

Author(s):

Zhaoyang Yang ◽

Kathryn Merrick ◽

Hussein Abbass ◽

Lianwen Jin

Keyword(s):

Reinforcement Learning ◽

Network Architecture ◽

Learning Algorithm ◽

Learning Algorithms ◽

Action Control ◽

Learning Performance ◽

Sensor Data ◽

Continuous Action ◽

Single Task ◽

Multiple Tasks

In this paper, we propose a deep reinforcement learning algorithm to learn multiple tasks concurrently. A new network architecture is proposed in the algorithm which reduces the number of parameters needed by more than 75% per task compared to typical single-task deep reinforcement learning algorithms. The proposed algorithm and network fuse images with sensor data and were tested with up to 12 movement-based control tasks on a simulated Pioneer 3AT robot equipped with a camera and range sensors. Results show that the proposed algorithm and network can learn skills that are as good as the skills learned by a comparable single-task learning algorithm. Results also show that learning performance is consistent even when the number of tasks and the number of constraints on the tasks increased.

Download Full-text

Model dependent reinforcement learning algorithm for reservoir operation stochastic optimization

International Journal of Hydrology ◽

10.15406/ijh.2018.02.00129 ◽

2018 ◽

Vol 2 (5) ◽

Author(s):

Li Wenwu

Keyword(s):

Reinforcement Learning ◽

Stochastic Optimization ◽

Reservoir Operation ◽

Learning Algorithm ◽

Reinforcement Learning Algorithm

Download Full-text

Reinforcement learning algorithm for one-warehouse multi-retailer inventory problem

Automation, Mechanical and Electrical Engineering ◽

10.2495/amee140161 ◽

2014 ◽

Author(s):

C.Y. Li ◽

X.T. Wang ◽

T.W. Zhang

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Inventory Problem ◽

Reinforcement Learning Algorithm

Download Full-text

Intelligent Energy Management Strategy Based on an Improved Reinforcement Learning Algorithm With Exploration Factor for a Plug-in PHEV

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3085710 ◽

2021 ◽

pp. 1-11

Author(s):

Xinyou Lin ◽

Kuncheng Zhou ◽

Liping Mo ◽

Hailin Li

Keyword(s):

Reinforcement Learning ◽

Energy Management ◽

Management Strategy ◽

Learning Algorithm ◽

Energy Management Strategy ◽

Reinforcement Learning Algorithm

Download Full-text

A multi-objective reinforcement learning algorithm for deadline constrained scientific workflow scheduling in clouds

Frontiers of Computer Science ◽

10.1007/s11704-020-9273-z ◽

2021 ◽

Vol 15 (5) ◽

Author(s):

Yao Qin ◽

Hua Wang ◽

Shanwen Yi ◽

Xiaole Li ◽

Linbo Zhai

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Scientific Workflow ◽

Workflow Scheduling ◽

Multi Objective ◽

Reinforcement Learning Algorithm

Download Full-text

Optimization of PV Energy Conversion System Using Reinforcement Learning Algorithm

2020 20th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA) ◽

10.1109/sta50679.2020.9329331 ◽

2020 ◽

Author(s):

Mohamed Ali Zeddini ◽

Mourad Turki ◽

Mohamed Faouzi Mimoun

Keyword(s):

Reinforcement Learning ◽

Energy Conversion ◽

Learning Algorithm ◽

Conversion System ◽

Energy Conversion System ◽

Reinforcement Learning Algorithm

Download Full-text

Enhancing Energy Trading Between Different Islanded Microgrids A Reinforcement Learning Algorithm Case Study in Northern Kordofan State

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429584 ◽

2021 ◽

Author(s):

Moayad ELamin ◽

Fay Elhassan ◽

Mahmoud A. Manzoul

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Energy Trading ◽

Reinforcement Learning Algorithm

Download Full-text

Solving flow-shop scheduling problem with a reinforcement learning algorithm that generalizes the value function with neural network

Alexandria Engineering Journal ◽

10.1016/j.aej.2021.01.030 ◽

2021 ◽

Vol 60 (3) ◽

pp. 2787-2800

Author(s):

Jianfeng Ren ◽

Chunming Ye ◽

Feng Yang

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Value Function ◽

Flow Shop ◽

Learning Algorithm ◽

Flow Shop Scheduling ◽

Scheduling Problem ◽

Shop Scheduling ◽

The Value Function ◽

Reinforcement Learning Algorithm

Download Full-text

A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network

Measurement and Control ◽

10.1177/00202940211000380 ◽

2021 ◽

Vol 54 (3-4) ◽

pp. 417-428

Author(s):

Yanyan Dai ◽

KiDong Lee ◽

SukGyu Lee

Keyword(s):

Control System ◽

Reinforcement Learning ◽

Inverted Pendulum ◽

Learning Algorithm ◽

Deep Understanding ◽

Control Engineering ◽

Experience Replay ◽

Real Hardware ◽

Rotary Inverted Pendulum ◽

Reinforcement Learning Algorithm

For real applications, rotary inverted pendulum systems have been known as the basic model in nonlinear control systems. If researchers have no deep understanding of control, it is difficult to control a rotary inverted pendulum platform using classic control engineering models, as shown in section 2.1. Therefore, without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm. Many recent achievements in reinforcement learning (RL) have become possible, but there is a lack of research to quickly test high-frequency RL algorithms using real hardware environment. In this paper, we propose a real-time Hardware-in-the-loop (HIL) control system to train and test the deep reinforcement learning algorithm from simulation to real hardware implementation. The Double Deep Q-Network (DDQN) with prioritized experience replay reinforcement learning algorithm, without a deep understanding of classical control engineering, is used to implement the agent. For the real experiment, to swing up the rotary inverted pendulum and make the pendulum smoothly move, we define 21 actions to swing up and balance the pendulum. Comparing Deep Q-Network (DQN), the DDQN with prioritized experience replay algorithm removes the overestimate of Q value and decreases the training time. Finally, this paper shows the experiment results with comparisons of classic control theory and different reinforcement learning algorithms.

Download Full-text

A multi-agent reinforcement learning algorithm with fuzzy approximation for Distributed Stochastic Unit Commitment

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-182879 ◽

2019 ◽

Vol 37 (5) ◽

pp. 6613-6628

Author(s):

Ghorbani Farzaneh ◽

Afsharchi Mohsen ◽

Derhami Vali

Keyword(s):

Reinforcement Learning ◽

Unit Commitment ◽

Learning Algorithm ◽

Fuzzy Approximation ◽

Multi Agent ◽

Stochastic Unit Commitment ◽

Reinforcement Learning Algorithm

Download Full-text