Batch mode reinforcement learning based on the synthesis of artificial trajectories

Annals of Operations Research ◽

10.1007/s10479-012-1248-5 ◽

2012 ◽

Vol 208 (1) ◽

pp. 383-416 ◽

Author(s):

Raphael Fonteneau ◽

Susan A. Murphy ◽

Louis Wehenkel ◽

Damien Ernst

Keyword(s):

Reinforcement Learning ◽

Download Full-text

On periodic reference tracking using batch-mode reinforcement learning with application to gene regulatory network control

52nd IEEE Conference on Decision and Control ◽

10.1109/cdc.2013.6760515 ◽

2013 ◽

Author(s):

Aivar Sootla ◽

Natalja Strelkowa ◽

Damien Ernst ◽

Mauricio Barahona ◽

Guy-Bart Stan

Keyword(s):

Reinforcement Learning ◽

Gene Regulatory Network ◽

Regulatory Network ◽

Network Control ◽

Reference Tracking ◽

Gene Regulatory

Download Full-text

Reinforcement Learning for Electric Vehicle Charging using Dueling Neural Networks

10.20944/preprints202103.0592.v1 ◽

2021 ◽

Author(s):

Gargya Gokhale ◽

Bert Claessens ◽

Chris Develder

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Cost Minimization ◽

Renewable Energy Sources ◽

Regression Technique ◽

Main Research ◽

Smart Charging ◽

We consider the problem of coordinating the charging of an entire fleet of electric vehicles (EV), using a model-free approach, i.e. purely data-driven reinforcement learning (RL). The objective of the RL-based control is to optimize charging actions, while fulfilling all EV charging constraints (e.g. timely completion of the charging). In particular, we focus on batch-mode learning and adopt fitted Q-iteration (FQI). A core component in FQI is approximating the Q-function using a regression technique, from which the policy is derived. Recently, a dueling neural networks architecture was proposed and shown to lead to better policy evaluation in the presence of many similar-valued actions, as applied in a computer game context. The main research contributions of the current paper are that (i)we develop a dueling neural networks approach for the setting of joint coordination of an entire EV fleet, and (ii)we evaluate its performance and compare it to an all-knowing benchmark and an FQI approach using EXTRA trees regression technique, a popular approach currently discussed in EV related works. We present a case study where RL agents are trained with an epsilon-greedy approach for different objectives, (a)cost minimization, and (b)maximization of self-consumption of local renewable energy sources. Our results indicate that RL agents achieve significant cost reductions (70--80%) compared to a business-as-usual scenario without smart charging. Comparing the dueling neural networks regression to EXTRA trees indicates that for our case study's EV fleet parameters and training scenario, the EXTRA trees-based agents achieve higher performance in terms of both lower costs (or higher self-consumption) and stronger robustness, i.e. less variation among trained agents. This suggests that adopting dueling neural networks in this EV setting is not particularly beneficial as opposed to the Atari game context from where this idea originated.

Download Full-text

Efficient Batch-Mode Reinforcement Learning Using Extreme Learning Machines

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2019.2926806 ◽

2019 ◽

pp. 1-14

Author(s):

Jiahang Liu ◽

Lei Zuo ◽

Xin Xu ◽

Xinglong Zhang ◽

Junkai Ren ◽

...

Keyword(s):

Reinforcement Learning ◽

Extreme Learning Machines ◽

Learning Machines

Download Full-text

Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

SIAM Journal on Control and Optimization ◽

10.1137/120867263 ◽

2013 ◽

Vol 51 (5) ◽

pp. 3355-3385 ◽

Author(s):

R. Fonteneau ◽

D. Ernst ◽

B. Boigelot ◽

Q. Louveaux

Keyword(s):

Reinforcement Learning ◽

Relaxation Schemes

Download Full-text

OPTIMAL SAMPLE SELECTION FOR BATCH-MODE REINFORCEMENT LEARNING

Proceedings of the 3rd International Conference on Agents and Artificial Intelligence ◽

10.5220/0003133500410050 ◽

2011 ◽

Keyword(s):

Reinforcement Learning ◽

Sample Selection ◽

Optimal Sample ◽

Download Full-text

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets

Lecture Notes in Computer Science - Recent Advances in Reinforcement Learning ◽

10.1007/978-3-540-89722-4_7 ◽

2008 ◽

pp. 82-95 ◽

Author(s):

Thomas Gabel ◽

Martin Riedmiller

Keyword(s):

Reinforcement Learning ◽

Learning Methods ◽

Download Full-text

Supplemental Material for Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Psychological Review ◽

10.1037/0033-295x.114.3.784.supp ◽

2007 ◽

Keyword(s):

Reinforcement Learning ◽

Problem Gambling ◽

Learning Models ◽

Behavioral Extinction ◽

Reinforcement Learning Models

Download Full-text

Bayes factors for reinforcement-learning models of the Iowa gambling task.

Decision ◽

10.1037/dec0000040 ◽

2016 ◽

Vol 3 (2) ◽

pp. 115-131 ◽

Author(s):

Helen Steingroever ◽

Ruud Wetzels ◽

Eric-Jan Wagenmakers

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Bayes Factors ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Analogical Reinforcement Learning With Two-Stage Memory Retrieval

PsycEXTRA Dataset ◽

10.1037/e528942014-705 ◽

2014 ◽

Author(s):

James Foster ◽

Matt Jones

Keyword(s):

Reinforcement Learning ◽

Memory Retrieval ◽

Download Full-text

Effects of Working Memory Capacity on the Speed and Accuracy of Learning in Reinforcement Learning Models

PsycEXTRA Dataset ◽

10.1037/e528942014-552 ◽

2014 ◽

Author(s):

Adnane Ez-Zizi ◽

Simon Farrell ◽

David Leslie

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Working Memory Capacity ◽

Memory Capacity ◽

Learning Models ◽

Reinforcement Learning Models ◽

Speed And Accuracy

Download Full-text