scholarly journals Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning

Author(s):  
Sayyed Jaffar Ali Raza ◽  
Apan Dastider ◽  
Mingjie Lin
2021 ◽  
Author(s):  
N. Snehal ◽  
W. Pooja ◽  
K. Sonam ◽  
S. R. Wagh ◽  
N. M. Singh

Author(s):  
Abhinav Verma

We study the problem of generating interpretable and verifiable policies for Reinforcement Learning (RL). Unlike the popular Deep Reinforcement Learning (DRL) paradigm, in which the policy is represented by a neural network, the aim of this work is to find policies that can be represented in highlevel programming languages. Such programmatic policies have several benefits, including being more easily interpreted than neural networks, and being amenable to verification by scalable symbolic methods. The generation methods for programmatic policies also provide a mechanism for systematically using domain knowledge for guiding the policy search. The interpretability and verifiability of these policies provides the opportunity to deploy RL based solutions in safety critical environments. This thesis draws on, and extends, work from both the machine learning and formal methods communities.


2011 ◽  
Vol 23 (11) ◽  
pp. 2798-2832 ◽  
Author(s):  
Hirotaka Hachiya ◽  
Jan Peters ◽  
Masashi Sugiyama

Direct policy search is a promising reinforcement learning framework, in particular for controlling continuous, high-dimensional systems. Policy search often requires a large number of samples for obtaining a stable policy update estimator, and this is prohibitive when the sampling cost is expensive. In this letter, we extend an expectation-maximization-based policy search method so that previously collected samples can be efficiently reused. The usefulness of the proposed method, reward-weighted regression with sample reuse (R[Formula: see text]), is demonstrated through robot learning experiments. (This letter is an extended version of our earlier conference paper: Hachiya, Peters, & Sugiyama, 2009 .)


2014 ◽  
Vol 97 (3) ◽  
pp. 327-351 ◽  
Author(s):  
Róbert Busa-Fekete ◽  
Balázs Szörényi ◽  
Paul Weng ◽  
Weiwei Cheng ◽  
Eyke Hüllermeier

Sign in / Sign up

Export Citation Format

Share Document