compact action sets Latest Research Papers

Uniform value for recursive games with compact action sets

Operations Research Letters ◽

10.1016/j.orl.2016.06.005 ◽

2016 ◽

Vol 44 (5) ◽

pp. 575-577 ◽

Cited By ~ 1

Author(s):

Xiaoxi Li ◽

Sylvain Sorin

Keyword(s):

Uniform Value ◽

Recursive Games ◽

Compact Action Sets

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Journal of Applied Probability ◽

10.1239/jap/1437658607 ◽

2015 ◽

Vol 52 (2) ◽

pp. 419-440

Author(s):

Rolando Cavazos-Cadena ◽

Raúl Montes-De-Oca ◽

Karel Sladký

Keyword(s):

Sample Path ◽

Point Of View ◽

Average Reward ◽

Stationary Policy ◽

Optimality Equation ◽

Markov Decision ◽

Average Reward Criterion ◽

Compact Action Sets ◽

Path Point ◽

Reward Criterion

This paper concerns discrete-time Markov decision chains with denumerable state and compact action sets. Besides standard continuity requirements, the main assumption on the model is that it admits a Lyapunov function ℓ. In this context the average reward criterion is analyzed from the sample-path point of view. The main conclusion is that if the expected average reward associated to ℓ2 is finite under any policy then a stationary policy obtained from the optimality equation in the standard way is sample-path average optimal in a strong sense.

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Journal of Applied Probability ◽

10.1017/s0021900200012559 ◽

2015 ◽

Vol 52 (02) ◽

pp. 419-440 ◽

Cited By ~ 1

Author(s):

Rolando Cavazos-Cadena ◽

Raúl Montes-De-Oca ◽

Karel Sladký

Keyword(s):

Sample Path ◽

Point Of View ◽

Average Reward ◽

Stationary Policy ◽

Optimality Equation ◽

Markov Decision ◽

Average Reward Criterion ◽

Compact Action Sets ◽

Path Point ◽

Reward Criterion

This paper concerns discrete-time Markov decision chains with denumerable state and compact action sets. Besides standard continuity requirements, the main assumption on the model is that it admits a Lyapunov function ℓ. In this context the average reward criterion is analyzed from the sample-path point of view. The main conclusion is that if the expected average reward associated to ℓ2is finite under any policy then a stationary policy obtained from the optimality equation in the standard way is sample-path average optimal in a strong sense.

A Zero-Sum Stochastic Game with Compact Action Sets and no Asymptotic Value

Dynamic Games and Applications ◽

10.1007/s13235-013-0073-z ◽

2013 ◽

Vol 3 (2) ◽

pp. 172-186 ◽

Cited By ~ 32

Author(s):

Guillaume Vigeral

Keyword(s):

Stochastic Game ◽

Asymptotic Value ◽

Compact Action Sets ◽

Zero Sum

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

Journal of Applied Probability ◽

10.1239/jap/1134587805 ◽

2005 ◽

Vol 42 (4) ◽

pp. 905-918 ◽

Cited By ~ 1

Author(s):

Rolando Cavazos-Cadena ◽

Raúl Montes-De-Oca

Keyword(s):

Iteration Algorithm ◽

Value Iteration ◽

Stationary Policy ◽

Long Run ◽

Risk Sensitive ◽

Finite State ◽

Markov Decision ◽

Optimal Stationary Policy ◽

Compact Action Sets ◽

Nonstationary Value Iteration

This work concerns Markov decision chains with finite state spaces and compact action sets. The performance index is the long-run risk-sensitive average cost criterion, and it is assumed that, under each stationary policy, the state space is a communicating class and that the cost function and the transition law depend continuously on the action. These latter data are not directly available to the decision-maker, but convergent approximations are known or are more easily computed. In this context, the nonstationary value iteration algorithm is used to approximate the solution of the optimality equation, and to obtain a nearly optimal stationary policy.

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

Journal of Applied Probability ◽

10.1017/s0021900200000991 ◽

2005 ◽

Vol 42 (04) ◽

pp. 905-918

Author(s):

Rolando Cavazos-Cadena ◽

Raúl Montes-De-Oca

Keyword(s):

Iteration Algorithm ◽

Value Iteration ◽

Stationary Policy ◽

Long Run ◽

Risk Sensitive ◽

Finite State ◽

Markov Decision ◽

Optimal Stationary Policy ◽

Compact Action Sets ◽

Nonstationary Value Iteration

This work concerns Markov decision chains with finite state spaces and compact action sets. The performance index is the long-run risk-sensitive average cost criterion, and it is assumed that, under each stationary policy, the state space is a communicating class and that the cost function and the transition law depend continuously on the action. These latter data are not directly available to the decision-maker, but convergent approximations are known or are more easily computed. In this context, the nonstationary value iteration algorithm is used to approximate the solution of the optimality equation, and to obtain a nearly optimal stationary policy.

A Note on the Existence of Optimal Policies in Total Reward Dynamic Programs with Compact Action Sets

Mathematics of Operations Research ◽

10.1287/moor.25.4.657.12112 ◽

2000 ◽

Vol 25 (4) ◽

pp. 657-666 ◽

Cited By ~ 1

Author(s):

Rolando Cavazos-Cadena ◽

Eugene A. Feinberg ◽

Raúl Montes-de-Oca

Keyword(s):

Total Reward ◽

Optimal Policies ◽

Dynamic Programs ◽

Compact Action Sets

compact action sets
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Uniform value for recursive games with compact action sets

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

A Zero-Sum Stochastic Game with Compact Action Sets and no Asymptotic Value

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

A Note on the Existence of Optimal Policies in Total Reward Dynamic Programs with Compact Action Sets

Export Citation Format

compact action setsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Uniform value for recursive games with compact action sets

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

A Zero-Sum Stochastic Game with Compact Action Sets and no Asymptotic Value

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

A Note on the Existence of Optimal Policies in Total Reward Dynamic Programs with Compact Action Sets

compact action sets
Recently Published Documents