Approximating general Markovian decision-problems by clustering their state- and action-spaces

A stationary Markovian decision model is considered with general state and action spaces where the transition probabilities are weakened to be bounded transition measures (this is useful for many applications). New and improved bounds are given for the optimal value of stationary problems with a large planning horizon if either only a few steps of iteration are carried out or, in addition, a solution of the infinite-stage problem is known. Similar estimates are obtained for the quality of policies which are composed of nearly optimal decisions from the first few steps or from the infinite-stage solution.

Download Full-text

Bounds and good policies in stationary finite–stage Markovian decision problems

Advances in Applied Probability ◽

10.1017/s0001867800033437 ◽

1980 ◽

Vol 12 (01) ◽

pp. 154-173

Author(s):

Gerhard Hübner

Keyword(s):

Decision Model ◽

Transition Probabilities ◽

Planning Horizon ◽

Decision Problems ◽

Optimal Decisions ◽

Optimal Value ◽

Action Spaces ◽

Stationary Problems ◽

Markovian Decision Problems

A stationary Markovian decision model is considered with general state and action spaces where the transition probabilities are weakened to be bounded transition measures (this is useful for many applications). New and improved bounds are given for the optimal value of stationary problems with a large planning horizon if either only a few steps of iteration are carried out or, in addition, a solution of the infinite-stage problem is known. Similar estimates are obtained for the quality of policies which are composed of nearly optimal decisions from the first few steps or from the infinite-stage solution.

Download Full-text

A modified Gauss-Seidel-algorithm with exclusion of suboptimal actions for A class of semi-Markovian decision problems

Optimization ◽

10.1080/02331930008844514 ◽

2000 ◽

Vol 48 (4) ◽

pp. 429-451 ◽

Cited By ~ 1

Author(s):

V Nollau ◽

D Hudak

Keyword(s):

Decision Problems ◽

Markovian Decision Problems

Download Full-text

A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

SSRN Electronic Journal ◽

10.2139/ssrn.37768 ◽

1997 ◽

Cited By ~ 10

Author(s):

John P. Rust

Keyword(s):

Infinite Horizon ◽

Policy Iteration ◽

Decision Problems ◽

Continuous State ◽

Iteration Methods ◽

Markovian Decision Problems

Download Full-text

A method of clustering for discounted markovian decision problems

Mathematische Operationsforschung und Statistik Series Optimization ◽

10.1080/02331938108842713 ◽

1981 ◽

Vol 12 (1) ◽

pp. 137-147 ◽

Cited By ~ 3

Author(s):

A. Hahnewald-busch ◽

V. Nollau

Keyword(s):

Decision Problems ◽

Markovian Decision Problems

Download Full-text

On the equivalence of mixed and behavior strategies in finitely additive decision problems

Journal of Applied Probability ◽

10.1017/jpr.2019.47 ◽

2019 ◽

Vol 56 (3) ◽

pp. 810-829

Author(s):

János Flesch ◽

Dries Vermeulen ◽

Anna Zseleva

Keyword(s):

Mixed Strategy ◽

Action Space ◽

Decision Problems ◽

Probability Measures ◽

Infinite Time ◽

Behavior Strategy ◽

Behavior Strategies ◽

And Behavior ◽

Arbitrary Action ◽

Action Spaces

AbstractWe consider decision problems with arbitrary action spaces, deterministic transitions, and infinite time horizon. In the usual setup when probability measures are countably additive, a general version of Kuhn’s theorem implies under fairly general conditions that for every mixed strategy of the decision maker there exists an equivalent behavior strategy. We examine to what extent this remains valid when probability measures are only assumed to be finitely additive. Under the classical approach of Dubins and Savage (2014), we prove the following statements: (1) If the action space is finite, every mixed strategy has an equivalent behavior strategy. (2) Even if the action space is infinite, at least one optimal mixed strategy has an equivalent behavior strategy. The approach by Dubins and Savage turns out to be essentially maximal: these two statements are no longer valid if we take any extension of their approach that considers all singleton plays.

Download Full-text