Markov decision processes with continuous time parameter

European Journal of Operational Research ◽

10.1016/0377-2217(84)90298-4 ◽

1984 ◽

Vol 16 (3) ◽

pp. 392-393

Author(s):

M. Schäl

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Time Parameter ◽

Markov Decision

Download Full-text

Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model

Advances in Applied Probability ◽

10.2307/1426437 ◽

1983 ◽

Vol 15 (2) ◽

pp. 274-303 ◽

Author(s):

Arie Hordijk ◽

Frank A. Van Der Duyn Schouten

Keyword(s):

Markov Decision Processes ◽

Optimal Policy ◽

Continuous Time ◽

Sufficient Conditions ◽

Decision Processes ◽

Time Parameter ◽

Queueing Model ◽

Replacement Model ◽

Optimal Policies ◽

Markov Decision

Recently the authors introduced the concept of Markov decision drift processes. A Markov decision drift process can be seen as a straightforward generalization of a Markov decision process with continuous time parameter. In this paper we investigate the existence of stationary average optimal policies for Markov decision drift processes. Using a well-known Abelian theorem we derive sufficient conditions, which guarantee that a ‘limit point' of a sequence of discounted optimal policies with the discounting factor approaching 1 is an average optimal policy. An alternative set of sufficient conditions is obtained for the case in which the discounted optimal policies generate regenerative stochastic processes. The latter set of conditions is easier to verify in several applications. The results of this paper are also applicable to Markov decision processes with discrete or continuous time parameter and to semi-Markov decision processes. In this sense they generalize some well-known results for Markov decision processes with finite or compact action space. Applications to an M/M/1 queueing model and a maintenance replacement model are given. It is shown that under certain conditions on the model parameters the average optimal policy for the M/M/1 queueing model is monotone non-decreasing (as a function of the number of waiting customers) with respect to the service intensity and monotone non-increasing with respect to the arrival intensity. For the maintenance replacement model we prove the average optimality of a bang-bang type policy. Special attention is paid to the computation of the optimal control parameters.

Download Full-text

Markov Decision Processes with Continuous Time Parameter

Journal of the Operational Research Society ◽

10.2307/2581180 ◽

1984 ◽

Vol 35 (4) ◽

pp. 366

Author(s):

Sean Collins ◽

F. A. van der Duyn Schouten

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Time Parameter ◽

Markov Decision

Download Full-text

Markov Decision Processes with Continuous Time Parameter

Journal of the Operational Research Society ◽

10.1057/jors.1984.74 ◽

1984 ◽

Vol 35 (4) ◽

pp. 366-367

Author(s):

Sean Collins

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Time Parameter ◽

Markov Decision

Download Full-text

Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model

Advances in Applied Probability ◽

10.1017/s0001867800021182 ◽

1983 ◽

Vol 15 (02) ◽

pp. 274-303 ◽

Author(s):

Arie Hordijk ◽

Frank A. Van Der Duyn Schouten

Keyword(s):

Markov Decision Processes ◽

Optimal Policy ◽

Continuous Time ◽

Sufficient Conditions ◽

Decision Processes ◽

Time Parameter ◽

Queueing Model ◽

Replacement Model ◽

Optimal Policies ◽

Markov Decision

Recently the authors introduced the concept of Markov decision drift processes. A Markov decision drift process can be seen as a straightforward generalization of a Markov decision process with continuous time parameter. In this paper we investigate the existence of stationary average optimal policies for Markov decision drift processes. Using a well-known Abelian theorem we derive sufficient conditions, which guarantee that a ‘limit point' of a sequence of discounted optimal policies with the discounting factor approaching 1 is an average optimal policy. An alternative set of sufficient conditions is obtained for the case in which the discounted optimal policies generate regenerative stochastic processes. The latter set of conditions is easier to verify in several applications. The results of this paper are also applicable to Markov decision processes with discrete or continuous time parameter and to semi-Markov decision processes. In this sense they generalize some well-known results for Markov decision processes with finite or compact action space. Applications to an M/M/1 queueing model and a maintenance replacement model are given. It is shown that under certain conditions on the model parameters the average optimal policy for the M/M/1 queueing model is monotone non-decreasing (as a function of the number of waiting customers) with respect to the service intensity and monotone non-increasing with respect to the arrival intensity. For the maintenance replacement model we prove the average optimality of a bang-bang type policy. Special attention is paid to the computation of the optimal control parameters.

Download Full-text

Markov Decision Processes With Continuous Time Parameter.

Journal of the American Statistical Association ◽

10.2307/2287942 ◽

1985 ◽

Vol 80 (390) ◽

pp. 491

Author(s):

Martin L. Puterman ◽

F. A. Van der Duyn Schouten

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Time Parameter ◽

Markov Decision

Download Full-text

Denumerable state continuous time Markov decision processes with unbounded cost and transition rates under average criterion

The ANZIAM Journal ◽

10.1017/s144618110001213x ◽

2002 ◽

Vol 43 (4) ◽

pp. 541-557 ◽

Author(s):

Xianping Guo ◽

Weiping Zhu

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Transition Rates ◽

Birth And Death Processes ◽

Optimality Equation ◽

Average Criterion ◽

Markov Decision ◽

Unbounded Cost ◽

AbstractIn this paper, we consider denumerable state continuous time Markov decision processes with (possibly unbounded) transition and cost rates under average criterion. We present a set of conditions and prove the existence of both average cost optimal stationary policies and a solution of the average optimality equation under the conditions. The results in this paper are applied to an admission control queue model and controlled birth and death processes.

Download Full-text

Continuous-Time Markov Decision Processes

10.1007/978-3-030-54987-9 ◽

2020 ◽

Author(s):

Alexey Piunovskiy ◽

Yi Zhang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Markov Decision

Download Full-text

Numerical Approximations for Discounted Continuous Time Markov Decision Processes

Modeling, Stochastic Control, Optimization, and Applications - The IMA Volumes in Mathematics and its Applications ◽

10.1007/978-3-030-25498-8_7 ◽

2019 ◽

pp. 147-171

Author(s):

François Dufour ◽

Tomás Prieto-Rumeau

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Numerical Approximations ◽

Markov Decision

Download Full-text

Preferred Rules in Continuous Time Markov Decision Processes

Management Science ◽

10.1287/mnsc.21.3.348 ◽

1974 ◽

Vol 21 (3) ◽

pp. 348-357 ◽

Author(s):

Mark R. Lembersky

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Markov Decision

Download Full-text

First passage risk probability optimality for continuous time Markov decision processes

Kybernetika ◽

10.14736/kyb-2019-1-0114 ◽

2019 ◽

pp. 114-133

Author(s):

Haifeng Huo ◽

Xian Wen

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

First Passage ◽

Risk Probability ◽

Markov Decision

Download Full-text