Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Abstract This work concerns Markov decision chains on a finite state space. The decision-maker has a constant and nonnull risk sensitivity coefficient, and the performance of a control policy is measured by two different indices, namely, the discounted and average criteria. Motivated by well-known results for the risk-neutral case, the problem of approximating the optimal risk-sensitive average cost in terms of the optimal risk-sensitive discounted value functions is addressed. Under suitable communication assumptions, it is shown that, as the discount factor increases to 1, appropriate normalizations of the optimal discounted value functions converge to the optimal average cost, and to the functional part of the solution of the risk-sensitive average cost optimality equation.

Download Full-text

Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Mathematics of Operations Research ◽

10.1287/moor.2017.0893 ◽

2018 ◽

Vol 43 (3) ◽

pp. 1025-1050 ◽

Cited By ~ 3

Author(s):

Rolando Cavazos-Cadena

Keyword(s):

Average Cost ◽

Risk Sensitive ◽

Markov Decision

Download Full-text

THE OPTIMALITY EQUATIONS IN MULTICHAIN DENUMERABLE STATE MARKOV DECISION PROCESSES WITH THE AVERAGE COST CRITERION: THE BOUNDED COST CASE MULTISTAGE BAYESIAN ACCEPTANCE SAMPLING: OPTIMALITY OF A (z,c",c'^)-SAMPLING PLAN IN GASE OF A POLYA PRIOR DISTRIBUTION

Statistics & Risk Modeling ◽

10.1524/strm.1985.3.12.143 ◽

1985 ◽

Vol 3 (1-2) ◽

Author(s):

Henk Zijm

Keyword(s):

Markov Decision Processes ◽

Average Cost ◽

Prior Distribution ◽

Decision Processes ◽

Sampling Plan ◽

Acceptance Sampling ◽

Average Cost Criterion ◽

Cost Criterion ◽

Markov Decision ◽

Optimality Equations

Download Full-text

Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria

Mathematical Methods of Operations Research ◽

10.1007/s00186-008-0277-y ◽

2008 ◽

Vol 70 (3) ◽

pp. 541-566 ◽

Cited By ~ 7

Author(s):

Rolando Cavazos-Cadena

Keyword(s):

Average Cost ◽

Optimality Equation ◽

Risk Sensitive ◽

Risk Neutral ◽

Markov Decision ◽

Average Cost Optimality Equation ◽

Cost Optimality

Download Full-text

Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

Mathematical Methods of Operations Research ◽

10.1007/s00186-020-00732-8 ◽

2021 ◽

Author(s):

O. L. V. Costa ◽

F. Dufour

Keyword(s):

Markov Processes ◽

Risk Sensitive ◽

Piecewise Deterministic Markov Processes ◽

Risk Sensitive Control ◽

Optimality Equations

Download Full-text

Explicit Solution of the Average-Cost Optimality Equation for a Pest-Control Problem

Advances in Decision Sciences ◽

10.1155/2011/617812 ◽

2011 ◽

Vol 2011 ◽

pp. 1-11

Author(s):

Epaminondas G. Kyriakidis

Keyword(s):

Optimal Control ◽

Markov Decision Process ◽

Pest Control ◽

Continuous Time ◽

Decision Process ◽

Average Cost ◽

Optimality Equation ◽

Markov Decision ◽

Average Cost Optimality Equation ◽

Cost Optimality

We introduce a Markov decision process in continuous time for the optimal control of a simple symmetrical immigration-emigration process by the introduction of total catastrophes. It is proved that a particular control-limit policy is average cost optimal within the class of all stationary policies by verifying that the relative values of this policy are the solution of the corresponding optimality equation.

Download Full-text

The convergence of value iteration in average cost Markov decision chains

Operations Research Letters ◽

10.1016/0167-6377(96)00018-1 ◽

1996 ◽

Vol 19 (1) ◽

pp. 11-16 ◽

Cited By ~ 12

Author(s):

Linn I. Sennott

Keyword(s):

Average Cost ◽

Value Iteration ◽

Markov Decision

Download Full-text

Optimality conditions for a Markov decision chain with unbounded costs

Journal of Applied Probability ◽

10.1017/s002190020009728x ◽

1980 ◽

Vol 17 (04) ◽

pp. 996-1003

Author(s):

D. R. Robinson

Keyword(s):

Dynamic Programming ◽

Optimality Conditions ◽

Average Cost ◽

Optimality Equation ◽

Dynamic Programming Equation ◽

Potential Condition ◽

Markov Decision ◽

General Conditions

It is known that when costs are unbounded satisfaction of the appropriate dynamic programming ‘optimality' equation by a policy is not sufficient to guarantee its average optimality. A ‘lowest-order potential' condition is introduced which, along with the dynamic programming equation, is sufficient to establish the optimality of the policy. Also, it is shown that under fairly general conditions, if the lowest-order potential condition is not satisfied there exists a non-memoryless policy with smaller average cost than the policy satisfying the dynamic programming equation.

Download Full-text