Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2020.p0711 ◽

2020 ◽

Vol 24 (6) ◽

pp. 711-718

Author(s):

Nobuhiko Yamaguchi ◽

Keyword(s):

Reinforcement Learning ◽

Bayesian Inference ◽

Expectation Maximization ◽

Weighted Regression ◽

High Dimensional ◽

Policy Search ◽

Variational Bayesian ◽

Learning Framework ◽

Average Return ◽

Variational Bayesian Inference

Direct policy search is a promising reinforcement learning framework particularly for controlling continuous, high-dimensional systems. Peters et al. proposed reward-weighted regression (RWR) as a direct policy search. The RWR algorithm estimates the policy parameter based on the expectation-maximization (EM) algorithm and is therefore prone to overfitting. In this study, we focus on variational Bayesian inference to avoid overfitting and propose direct policy search reinforcement learning based on variational Bayesian inference (VBRL). The performance of the proposed VBRL is assessed in several experiments involving a mountain car and a ball batting task. These experiments demonstrate that VBRL yields a higher average return and outperforms the RWR.

Download Full-text

Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

Neural Computation ◽

10.1162/neco_a_00199 ◽

2011 ◽

Vol 23 (11) ◽

pp. 2798-2832 ◽

Author(s):

Hirotaka Hachiya ◽

Jan Peters ◽

Masashi Sugiyama

Keyword(s):

Reinforcement Learning ◽

Expectation Maximization ◽

Search Method ◽

Weighted Regression ◽

High Dimensional ◽

Extended Version ◽

Conference Paper ◽

Policy Search ◽

Learning Framework ◽

Direct policy search is a promising reinforcement learning framework, in particular for controlling continuous, high-dimensional systems. Policy search often requires a large number of samples for obtaining a stable policy update estimator, and this is prohibitive when the sampling cost is expensive. In this letter, we extend an expectation-maximization-based policy search method so that previously collected samples can be efficiently reused. The usefulness of the proposed method, reward-weighted regression with sample reuse (R[Formula: see text]), is demonstrated through robot learning experiments. (This letter is an extended version of our earlier conference paper: Hachiya, Peters, & Sugiyama, 2009 .)

Download Full-text

Kernel-Based Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference

2019 Seventh International Symposium on Computing and Networking Workshops (CANDARW) ◽

10.1109/candarw.2019.00040 ◽

2019 ◽

Author(s):

Nobuhiko Yamaguchi ◽

Osamu Fukuda ◽

Hiroshi Okumura

Keyword(s):

Reinforcement Learning ◽

Bayesian Inference ◽

Policy Search ◽

Variational Bayesian ◽

Variational Bayesian Inference

Download Full-text

Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference

2018 Joint 10th International Conference on Soft Computing and Intelligent Systems (SCIS) and 19th International Symposium on Advanced Intelligent Systems (ISIS) ◽

10.1109/scis-isis.2018.00167 ◽

2018 ◽

Author(s):

Nobuhiko Yamaguchi ◽

Kazuya Ihara ◽

Osamu Fukuda ◽

Hiroshi Okumura

Keyword(s):

Reinforcement Learning ◽

Bayesian Inference ◽

Policy Search ◽

Variational Bayesian ◽

Variational Bayesian Inference

Download Full-text

Unsupervised learning applied in MER and ECG signals through Gaussians mixtures with the Expectation-Maximization algorithm and Variational Bayesian Inference

2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) ◽

10.1109/embc.2013.6610503 ◽

2013 ◽

Author(s):

Hernan Dario Vargas Cardona ◽

Alvaro Angel Orozco ◽

Mauricio A. Alvarez

Keyword(s):

Bayesian Inference ◽

Unsupervised Learning ◽

Expectation Maximization ◽

Expectation Maximization Algorithm ◽

Variational Bayesian ◽

Ecg Signals ◽

Variational Bayesian Inference

Download Full-text

Probabilistic model updating via variational Bayesian inference and adaptive Gaussian process modeling

Computer Methods in Applied Mechanics and Engineering ◽

10.1016/j.cma.2021.113915 ◽

2021 ◽

Vol 383 ◽

pp. 113915

Author(s):

Pinghe Ni ◽

Jun Li ◽

Hong Hao ◽

Qiang Han ◽

Xiuli Du

Keyword(s):

Bayesian Inference ◽

Gaussian Process ◽

Process Modeling ◽

Probabilistic Model ◽

Model Updating ◽

Variational Bayesian ◽

Variational Bayesian Inference ◽

Gaussian Process Modeling

Download Full-text

Off-grid DOA estimation through variational Bayesian inference in colored noise environment

Digital Signal Processing ◽

10.1016/j.dsp.2021.102967 ◽

2021 ◽

Vol 111 ◽

pp. 102967

Author(s):

Yahao Zhang ◽

Yixin Yang ◽

Long Yang

Keyword(s):

Bayesian Inference ◽

Colored Noise ◽

Doa Estimation ◽

Variational Bayesian ◽

Noise Environment ◽

Variational Bayesian Inference

Download Full-text

Stochastic collapsed variational Bayesian inference for biterm topic model

2016 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2016.7727629 ◽

2016 ◽

Author(s):

Narutaka Awaya ◽

Jun Kitazono ◽

Toshiaki Omori ◽

Seiichi Ozawa

Keyword(s):

Bayesian Inference ◽

Topic Model ◽

Variational Bayesian ◽

Variational Bayesian Inference

Download Full-text

Reconstruction-Aware Imaging System Ranking by Use of a Sparsity-Driven Numerical Observer Enabled by Variational Bayesian Inference

IEEE Transactions on Medical Imaging ◽

10.1109/tmi.2018.2880870 ◽

2019 ◽

Vol 38 (5) ◽

pp. 1251-1262 ◽

Author(s):

Yujia Chen ◽

Yang Lou ◽

Kun Wang ◽

Matthew A. Kupinski ◽

Mark A. Anastasio

Keyword(s):

Bayesian Inference ◽

Imaging System ◽

Variational Bayesian ◽

Variational Bayesian Inference

Download Full-text

Skew t Distribution-Based Nonlinear Filter with Asymmetric Measurement Noise Using Variational Bayesian Inference

Computer Modeling in Engineering & Sciences ◽

10.32604/cmes.2021.019027 ◽

2022 ◽

Vol 130 (1) ◽

pp. 1-16

Author(s):

Chen Xu ◽

Yawen Mao ◽

Hongtian Chen ◽

Hongfeng Tao ◽

Fei Liu

Keyword(s):

Bayesian Inference ◽

Nonlinear Filter ◽

Measurement Noise ◽

Variational Bayesian ◽

Variational Bayesian Inference ◽

Download Full-text

Variational Bayesian inference for beamforming

The Journal of the Acoustical Society of America ◽

10.1121/10.0007965 ◽

2021 ◽

Vol 150 (4) ◽

pp. A154-A154

Author(s):

Yongsung Park ◽

Florian Meyer ◽

Peter Gerstoft

Keyword(s):

Bayesian Inference ◽

Variational Bayesian ◽

Variational Bayesian Inference

Download Full-text