Reinforcement Learning for Control Using Value Function Approximation

Encyclopedia of Systems and Control ◽

10.1007/978-3-030-44184-5_100067 ◽

2021 ◽

pp. 1868-1873

Author(s):

Konstantinos Gatsis ◽

George J. Pappas

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation

Download Full-text

Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence ◽

10.1016/j.artint.2007.08.001 ◽

2008 ◽

Vol 172 (4-5) ◽

pp. 454-482 ◽

Author(s):

André da Motta Salles Barreto ◽

Charles W. Anderson

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Gradient Descent ◽

Value Function ◽

Value Function Approximation ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

Reinforcement Learning for Control Using Value Function Approximation

Encyclopedia of Systems and Control ◽

10.1007/978-1-4471-5102-9_100067-1 ◽

2020 ◽

pp. 1-6

Author(s):

Konstantinos Gatsis ◽

George J. Pappas

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation

Download Full-text

CBR for State Value Function Approximation in Reinforcement Learning

Case-Based Reasoning Research and Development - Lecture Notes in Computer Science ◽

10.1007/11536406_18 ◽

2005 ◽

pp. 206-221 ◽

Author(s):

Thomas Gabel ◽

Martin Riedmiller

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation

Download Full-text

On Convergence Rate of Adaptive Multiscale Value Function Approximation for Reinforcement Learning

2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP) ◽

10.1109/mlsp.2019.8918816 ◽

2019 ◽

Author(s):

Tao Li ◽

Quanyan Zhu

Keyword(s):

Reinforcement Learning ◽

Convergence Rate ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation

Download Full-text

A Clustering-Based Graph Laplacian Framework for Value Function Approximation in Reinforcement Learning

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2014.2311578 ◽

2014 ◽

Vol 44 (12) ◽

pp. 2613-2625 ◽

Author(s):

Xin Xu ◽

Zhenhua Huang ◽

Daniel Graves ◽

Witold Pedrycz

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Graph Laplacian ◽

Value Function Approximation

Download Full-text

Adaptive importance sampling for value function approximation in off-policy reinforcement learning

Neural Networks ◽

10.1016/j.neunet.2009.01.002 ◽

2009 ◽

Vol 22 (10) ◽

pp. 1399-1410 ◽

Author(s):

Hirotaka Hachiya ◽

Takayuki Akiyama ◽

Masashi Sugiayma ◽

Jan Peters

Keyword(s):

Reinforcement Learning ◽

Importance Sampling ◽

Function Approximation ◽

Value Function ◽

Adaptive Importance Sampling ◽

Value Function Approximation

Download Full-text

Kernelized value function approximation for reinforcement learning

Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09 ◽

10.1145/1553374.1553504 ◽

2009 ◽

Author(s):

Gavin Taylor ◽

Ronald Parr

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation

Download Full-text

Efficient Value Function Approximation with Unsupervised Hierarchical Categorization for a Reinforcement Learning Agent

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology ◽

10.1109/wi-iat.2010.16 ◽

2010 ◽

Author(s):

Yongjia Wang ◽

John E. Laird

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation ◽

Learning Agent ◽

Hierarchical Categorization

Download Full-text

Online Support Vector Regression based value function approximation for Reinforcement Learning

2009 IEEE International Symposium on Industrial Electronics ◽

10.1109/isie.2009.5222726 ◽

2009 ◽

Author(s):

Dong-Hyun Lee ◽

Vo Van Quang ◽

Sungho Jo ◽

Ju-Jang Lee

Keyword(s):

Reinforcement Learning ◽

Support Vector Regression ◽

Function Approximation ◽

Value Function ◽

Support Vector ◽

Online Support ◽

Value Function Approximation ◽

Online Support Vector Regression

Download Full-text

Ensemble Network Architecture for Deep Reinforcement Learning

Mathematical Problems in Engineering ◽

10.1155/2018/2129393 ◽

2018 ◽

Vol 2018 ◽

pp. 1-6 ◽

Author(s):

Xi-liang Chen ◽

Lei Cao ◽

Chen-xi Li ◽

Zhi-xiong Xu ◽

Jun Lai

Keyword(s):

Reinforcement Learning ◽

Network Architecture ◽

Function Approximation ◽

Value Function ◽

Learning Algorithm ◽

Approximation Error ◽

Value Function Approximation ◽

Value Evaluation ◽

Target Values ◽

Classical Control

The popular deepQlearning algorithm is known to be instability because of theQ-value’s shake and overestimation action values under certain conditions. These issues tend to adversely affect their performance. In this paper, we develop the ensemble network architecture for deep reinforcement learning which is based on value function approximation. The temporal ensemble stabilizes the training process by reducing the variance of target approximation error and the ensemble of target values reduces the overestimate and makes better performance by estimating more accurateQ-value. Our results show that this architecture leads to statistically significant better value evaluation and more stable and better performance on several classical control tasks at OpenAI Gym environment.

Download Full-text