Geometric convergence of value-iteration in multichain Markov decision problems

P. J. Schweitzer; A. Federgruen

doi:10.1017/s000186780003175x

Geometric convergence of value-iteration in multichain Markov decision problems

Advances in Applied Probability ◽

10.1017/s000186780003175x ◽

1979 ◽

Vol 11 (01) ◽

pp. 188-217 ◽

Cited By ~ 3

Author(s):

P. J. Schweitzer ◽

A. Federgruen

Keyword(s):

Convergence Rate ◽

Iteration Method ◽

Chain Structure ◽

Decision Problems ◽

Value Iteration ◽

Convergence Factor ◽

Markov Decision Problems ◽

Geometric Convergence ◽

Markov Decision ◽

Maximal Gain

This paper considers undiscounted Markov decision problems. With no restriction (on either the periodicity or chain structure of the problem) we show that the value iteration method for finding maximal gain policies exhibits a geometric rate of convergence, whenever convergence occurs. In addition, we study the behaviour of the value-iteration operator; we give bounds for the number of steps needed for contraction, describe the ultimate behaviour of the convergence factor and give conditions for the existence of a uniform convergence rate.

Geometric convergence of value-iteration in multichain Markov decision problems

Advances in Applied Probability ◽

10.2307/1426774 ◽

1979 ◽

Vol 11 (1) ◽

pp. 188-217 ◽

Cited By ~ 23

Author(s):

P. J. Schweitzer ◽

A. Federgruen

Keyword(s):

Convergence Rate ◽

Iteration Method ◽

Chain Structure ◽

Decision Problems ◽

Value Iteration ◽

Convergence Factor ◽

Markov Decision Problems ◽

Geometric Convergence ◽

Markov Decision ◽

Maximal Gain

DISCOUNTED AND UNDISCOUNTED VALUE-ITERATION IN MARKOV DECISION PROBLEMS: A SURVEY

Dynamic Programming and its Applications ◽

10.1016/b978-0-12-568150-6.50008-8 ◽

1978 ◽

pp. 23-52 ◽

Cited By ~ 11

Author(s):

A. Federgruen ◽

P.J. Schweitzer

Keyword(s):

Decision Problems ◽

Value Iteration ◽

Markov Decision Problems ◽

Markov Decision

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

Mathematics of Operations Research ◽

10.1287/moor.2.4.360 ◽

1977 ◽

Vol 2 (4) ◽

pp. 360-381 ◽

Cited By ~ 30

Author(s):

P. J. Schweitzer ◽

A. Federgruen

Keyword(s):

Asymptotic Behavior ◽

Decision Problems ◽

Value Iteration ◽

Markov Decision Problems ◽

Markov Decision

Model Acquisition for Markov Decision Problems

10.21236/ada373795 ◽

1998 ◽

Author(s):

Thomas L. Dean

Keyword(s):

Decision Problems ◽

Model Acquisition ◽

Markov Decision Problems ◽

Markov Decision

Model Acquisition for Markov Decision Problems

10.21236/ada380049 ◽

1998 ◽

Author(s):

Thomas Dean

Keyword(s):

Decision Problems ◽

Model Acquisition ◽

Markov Decision Problems ◽

Markov Decision

Solving Uncertain Markov Decision Problems: An Interval-Based Method

Lecture Notes in Computer Science - Advances in Natural Computation ◽

10.1007/11881223_120 ◽

2006 ◽

pp. 948-957 ◽

Cited By ~ 2

Author(s):

Shulin Cui ◽

Jigui Sun ◽

Minghao Yin ◽

Shuai Lu

Keyword(s):

Decision Problems ◽

Markov Decision Problems ◽

Markov Decision

Optimal control and optimal sensor activation for Markov decision problems with costly observations

2015 IEEE Conference on Control Applications (CCA) ◽

10.1109/cca.2015.7320814 ◽

2015 ◽

Cited By ~ 1

Author(s):

Rene K. Boel ◽

Jan H. van Schuppen

Keyword(s):

Optimal Control ◽

Decision Problems ◽

Markov Decision Problems ◽

Markov Decision ◽

Sensor Activation

A simulation-based learning automata framework for solving semi-Markov decision problems under long-run average reward

IIE Transactions ◽

10.1080/07408170490438672 ◽

2004 ◽

Vol 36 (6) ◽

pp. 557-567 ◽

Cited By ~ 14

Author(s):

ABHIJIT GOSAVI ◽

TAPAS K. DAS ◽

SUDEEP SARKAR

Keyword(s):

Learning Automata ◽

Decision Problems ◽

Average Reward ◽

Markov Decision Problems ◽

Long Run ◽

Simulation Based ◽

Markov Decision ◽

Long Run Average Reward

A reinforcement learning algorithm with fuzzy approximation for semi Markov decision problems

Journal of Intelligent & Fuzzy Systems ◽

10.3233/ifs-141460 ◽

2015 ◽

Vol 28 (4) ◽

pp. 1733-1744 ◽

Cited By ~ 1

Author(s):

Ufuk Kula ◽

Beyazıt Ocaktan

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Decision Problems ◽

Fuzzy Approximation ◽

Markov Decision Problems ◽

Markov Decision ◽

Reinforcement Learning Algorithm

Infinite Horizon Markov Decision Problems

Optimized Response-Adaptive Clinical Trials ◽

10.1007/978-3-658-08344-1_3 ◽

2014 ◽

pp. 39-65

Author(s):

Thomas Ondra

Keyword(s):

Infinite Horizon ◽

Decision Problems ◽

Markov Decision Problems ◽

Markov Decision