Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques

International Journal of General Systems ◽

10.1080/03081079.2014.883387 ◽

2014 ◽

Vol 43 (6) ◽

pp. 649-669 ◽

Author(s):

Abhijit Gosavi

Keyword(s):

Dynamic Programming ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Techniques ◽

Markov Decision

Download Full-text

Model-Free Reinforcement Learning for Branching Markov Decision Processes

Computer Aided Verification - Lecture Notes in Computer Science ◽

10.1007/978-3-030-81688-9_30 ◽

2021 ◽

pp. 651-673

Author(s):

Ernst Moritz Hahn ◽

Mateo Perez ◽

Sven Schewe ◽

Fabio Somenzi ◽

Ashutosh Trivedi ◽

...

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Control Strategy ◽

Natural Extension ◽

Decision Processes ◽

Optimal Control Strategy ◽

Learning Techniques ◽

Markov Decision

AbstractWe study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs). The state of a (discrete-time) BMCs is a collection of entities of various types that, while spawning other entities, generate a payoff. In comparison with BMCs, where the evolution of a each entity of the same type follows the same probabilistic pattern, BMDPs allow an external controller to pick from a range of options. This permits us to study the best/worst behaviour of the system. We generalise model-free reinforcement learning techniques to compute an optimal control strategy of an unknown BMDP in the limit. We present results of an implementation that demonstrate the practicality of the approach.

Download Full-text

Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

2020 59th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc42340.2020.9303982 ◽

2020 ◽

Author(s):

Yu Wang ◽

Nima Roohi ◽

Matthew West ◽

Mahesh Viswanathan ◽

Geir E. Dullerud

Keyword(s):

Reinforcement Learning ◽

Model Checking ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision

Download Full-text

Average Reward Reinforcement Learning for Semi-Markov Decision Processes

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-70087-8_79 ◽

2017 ◽

pp. 768-777

Author(s):

Jiayuan Yang ◽

Yanjie Li ◽

Haoyao Chen ◽

Jiangang Li

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Average Reward ◽

Markov Decision

Download Full-text

A dynamic programming algorithm for decentralized Markov decision processes with a broadcast structure

49th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc.2010.5718187 ◽

2010 ◽

Author(s):

Jeff Wu ◽

Sanjay Lall

Keyword(s):

Dynamic Programming ◽

Markov Decision Processes ◽

Dynamic Programming Algorithm ◽

Decision Processes ◽

Programming Algorithm ◽

Markov Decision

Download Full-text

RVI reinforcement learning for semi-Markov decision processes with average reward

2010 8th World Congress on Intelligent Control and Automation ◽

10.1109/wcica.2010.5554785 ◽

2010 ◽

Author(s):

Yanjie Li ◽

Fang Cao

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Average Reward ◽

Markov Decision

Download Full-text

A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes

Systems and Computers in Japan ◽

10.1002/scj.10645 ◽

2005 ◽

Vol 36 (3) ◽

pp. 42-52 ◽

Author(s):

Koichiro Takita ◽

Masafumi Hagiwara

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Learning Algorithm ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Reinforcement Learning Algorithm

Download Full-text

Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

Discrete Event Dynamic Systems ◽

10.1007/s10626-006-0003-y ◽

2007 ◽

Vol 17 (1) ◽

pp. 23-52 ◽

Author(s):

Mohammed Shahid Abdulla ◽

Shalabh Bhatnagar

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Average Cost ◽

Decision Processes ◽

Markov Decision

Download Full-text

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Technometrics ◽

10.1080/00401706.1995.10484354 ◽

1995 ◽

Vol 37 (3) ◽

pp. 353-353 ◽

Author(s):

Laurence A. Baxter

Keyword(s):

Dynamic Programming ◽

Markov Decision Processes ◽

Stochastic Dynamic Programming ◽

Decision Processes ◽

Stochastic Dynamic ◽

Markov Decision

Download Full-text

Learning and Optimal Control of Imprecise Markov Decision Processes by Dynamic Programming Using the Imprecise Dirichlet Model

Soft Methodology and Random Information Systems ◽

10.1007/978-3-540-44465-7_16 ◽

2004 ◽

pp. 141-148 ◽

Author(s):

Matthias C. M. Troffaes

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Imprecise Dirichlet Model ◽

Dirichlet Model

Download Full-text

Adaptive Honeypot Engagement Through Reinforcement Learning of Semi-Markov Decision Processes

Lecture Notes in Computer Science - Decision and Game Theory for Security ◽

10.1007/978-3-030-32430-8_13 ◽

2019 ◽

pp. 196-216 ◽

Author(s):

Linan Huang ◽

Quanyan Zhu

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision

Download Full-text