First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors

Journal of Applied Probability ◽

10.1017/s0021900200012560 ◽

2015 ◽

Vol 52 (02) ◽

pp. 441-456 ◽

Author(s):

Xiao Wu ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Discrete Time ◽

Infinite Horizon ◽

Decision Processes ◽

Previous Literature ◽

First Passage ◽

Discount Factors ◽

Markov Decision ◽

The Difference ◽

Minimisation Problem

This paper deals with the first passage optimality and variance minimisation problems of discrete-time Markov decision processes (MDPs) with varying discount factors and unbounded rewards/costs. First, under suitable conditions slightly weaker than those in the previous literature on the standard (infinite horizon) discounted MDPs, we establish the existence and characterisation of the first passage expected-optimal stationary policies. Second, to further distinguish the expected-optimal stationary policies, we introduce the variance minimisation problem, prove that it is equivalent to a new first passage optimality problem of MDPs, and, thus, show the existence of a variance-optimal policy that minimises the variance over the set of all first passage expected-optimal stationary policies. Finally, we use a computable example to illustrate our main results and also to show the difference between the first passage optimality here and the standard discount optimality of MDPs in the previous literature.

Download Full-text

First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors

Journal of Applied Probability ◽

10.1239/jap/1437658608 ◽

2015 ◽

Vol 52 (2) ◽

pp. 441-456 ◽

Author(s):

Xiao Wu ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Discrete Time ◽

Infinite Horizon ◽

Decision Processes ◽

Previous Literature ◽

First Passage ◽

Discount Factors ◽

Markov Decision ◽

The Difference ◽

Minimisation Problem

This paper deals with the first passage optimality and variance minimisation problems of discrete-time Markov decision processes (MDPs) with varying discount factors and unbounded rewards/costs. First, under suitable conditions slightly weaker than those in the previous literature on the standard (infinite horizon) discounted MDPs, we establish the existence and characterisation of the first passage expected-optimal stationary policies. Second, to further distinguish the expected-optimal stationary policies, we introduce the variance minimisation problem, prove that it is equivalent to a new first passage optimality problem of MDPs, and, thus, show the existence of a variance-optimal policy that minimises the variance over the set of all first passage expected-optimal stationary policies. Finally, we use a computable example to illustrate our main results and also to show the difference between the first passage optimality here and the standard discount optimality of MDPs in the previous literature.

Download Full-text

Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors

Discrete Event Dynamic Systems ◽

10.1007/s10626-014-0209-3 ◽

2015 ◽

Vol 26 (4) ◽

pp. 669-683 ◽

Author(s):

Xiao Wu ◽

Junyu Zhang

Keyword(s):

Markov Decision Processes ◽

Discrete Time ◽

Decision Processes ◽

First Passage ◽

Discount Factors ◽

Markov Decision ◽

Finite Approximation

Download Full-text

An application to the finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors

Proceeding of the 11th World Congress on Intelligent Control and Automation ◽

10.1109/wcica.2014.7052984 ◽

2014 ◽

Author(s):

Xiao Wu ◽

Junyu Zhang

Keyword(s):

Markov Decision Processes ◽

Discrete Time ◽

Decision Processes ◽

First Passage ◽

Discount Factors ◽

Markov Decision ◽

Finite Approximation

Download Full-text

First Passage Optimality for Continuous-Time Markov Decision Processes With Varying Discount Factors and History-Dependent Policies

IEEE Transactions on Automatic Control ◽

10.1109/tac.2013.2281475 ◽

2014 ◽

Vol 59 (1) ◽

pp. 163-174 ◽

Author(s):

Xianping Guo ◽

Xinyuan Song ◽

Yi Zhang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

First Passage ◽

Discount Factors ◽

Markov Decision

Download Full-text

Infinite horizon Markov decision processes with unknown or variable discount factors

European Journal of Operational Research ◽

10.1016/0377-2217(87)90174-3 ◽

1987 ◽

Vol 28 (1) ◽

pp. 96-100 ◽

Author(s):

D.J. White

Keyword(s):

Markov Decision Processes ◽

Infinite Horizon ◽

Decision Processes ◽

Discount Factors ◽

Markov Decision

Download Full-text

First passage Markov decision processes with constraints and varying discount factors

Frontiers of Mathematics in China ◽

10.1007/s11464-015-0479-6 ◽

2015 ◽

Vol 10 (4) ◽

pp. 1005-1023 ◽

Author(s):

Xiao Wu ◽

Xiaolong Zou ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

First Passage ◽

Discount Factors ◽

Markov Decision

Download Full-text

Approximate Optimal Cost and Policies of First Passage Markov Decision Processes with Countable-State Space and Discount Factors

Proceedings of IncoME-V & CEPE Net-2020 - Mechanisms and Machine Science ◽

10.1007/978-3-030-75793-9_5 ◽

2021 ◽

pp. 39-49

Author(s):

Xiao Wu ◽

Yanqiu Tang

Keyword(s):

State Space ◽

Markov Decision Processes ◽

Decision Processes ◽

First Passage ◽

Countable State Space ◽

Optimal Cost ◽

Discount Factors ◽

Countable State ◽

Markov Decision

Download Full-text

A Vector Minimum Superharmonic Approach to Solving Infinite-Horizon Discounted Markov Decision Processes

Journal of the Operational Research Society ◽

10.1038/sj/jors/0431109 ◽

1992 ◽

Vol 43 (11) ◽

pp. 1095-1102

Author(s):

D J White

Keyword(s):

Markov Decision Processes ◽

Infinite Horizon ◽

Decision Processes ◽

Markov Decision

Download Full-text

A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion

SIAM Journal on Control and Optimization ◽

10.1137/19m1255811 ◽

2020 ◽

Vol 58 (4) ◽

pp. 2535-2566

Author(s):

François Dufour ◽

Alexandre Genadot

Keyword(s):

Convex Programming ◽

Markov Decision Processes ◽

Discrete Time ◽

Decision Processes ◽

Programming Approach ◽

Total Reward ◽

Markov Decision ◽

Reward Criterion

Download Full-text

First passage risk probability optimality for continuous time Markov decision processes

Kybernetika ◽

10.14736/kyb-2019-1-0114 ◽

2019 ◽

pp. 114-133

Author(s):

Haifeng Huo ◽

Xian Wen

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

First Passage ◽

Risk Probability ◽

Markov Decision

Download Full-text