Centralized Optimization for Dec-POMDPs Under the Expected Average Reward Criterion

IEEE Transactions on Automatic Control ◽

10.1109/tac.2017.2702203 ◽

2017 ◽

Vol 62 (11) ◽

pp. 6032-6038 ◽

Author(s):

Xiaofeng Jiang ◽

Xiaodong Wang ◽

Hongsheng Xi ◽

Falin Liu

Keyword(s):

Average Reward ◽

Average Reward Criterion ◽

Expected Average Reward Criterion ◽

Reward Criterion

Download Full-text

An expected average reward criterion

Stochastic Processes and their Applications ◽

10.1016/0304-4149(87)90055-x ◽

1987 ◽

Vol 26 ◽

pp. 123-140 ◽

Author(s):

K.-J. Bierth

Keyword(s):

Average Reward ◽

Average Reward Criterion ◽

Expected Average Reward Criterion ◽

Reward Criterion

Download Full-text

Finding Optimal Observation-Based Policies for Constrained POMDPs Under the Expected Average Reward Criterion

IEEE Transactions on Automatic Control ◽

10.1109/tac.2015.2497904 ◽

2016 ◽

Vol 61 (10) ◽

pp. 3070-3075 ◽

Author(s):

Xiaofeng Jiang ◽

Hongsheng Xi ◽

Xiaodong Wang ◽

Falin Liu

Keyword(s):

Average Reward ◽

Optimal Observation ◽

Average Reward Criterion ◽

Expected Average Reward Criterion ◽

Reward Criterion

Download Full-text

ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION

Bulletin of informatics and cybernetics ◽

10.5109/13364 ◽

1985 ◽

Vol 21 (3/4) ◽

pp. 1-17

Author(s):

Kensuke Tanaka

Keyword(s):

Learning Algorithm ◽

Average Reward ◽

Markov Game ◽

Average Reward Criterion ◽

Zero Sum ◽

Expected Average Reward Criterion ◽

Reward Criterion

Download Full-text

Finding optimal memoryless policies of POMDPs under the expected average reward criterion

European Journal of Operational Research ◽

10.1016/j.ejor.2010.12.014 ◽

2011 ◽

Vol 211 (3) ◽

pp. 556-567 ◽

Author(s):

Yanjie Li ◽

Baoqun Yin ◽

Hongsheng Xi

Keyword(s):

Average Reward ◽

Average Reward Criterion ◽

Expected Average Reward Criterion ◽

Reward Criterion

Download Full-text

A new LP formulation of the admission control problem modelled as an MDP under average reward criterion

International Journal of Systems Science ◽

10.1080/00207721003717289 ◽

2011 ◽

Vol 42 (12) ◽

pp. 2085-2096

Author(s):

Antonio Pietrabissa

Keyword(s):

Control Problem ◽

Admission Control ◽

Average Reward ◽

Average Reward Criterion ◽

Reward Criterion

Download Full-text

An $\varepsilon $-Optimal Control of a Finite Markov Chain with an Average Reward Criterion

Theory of Probability and Its Applications ◽

10.1137/1125006 ◽

1980 ◽

Vol 25 (1) ◽

pp. 70-81 ◽

Author(s):

E. A. Fainberg

Keyword(s):

Optimal Control ◽

Markov Chain ◽

Average Reward ◽

Finite Markov Chain ◽

Average Reward Criterion ◽

Reward Criterion

Download Full-text

Average Reward Criterion

Adaptive Markov Control Processes - Applied Mathematical Sciences ◽

10.1007/978-1-4419-8714-3_3 ◽

1989 ◽

pp. 51-82

Author(s):

O. Hernández-Lerma

Keyword(s):

Average Reward ◽

Average Reward Criterion ◽

Reward Criterion

Download Full-text

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Journal of Applied Probability ◽

10.1239/jap/1437658607 ◽

2015 ◽

Vol 52 (2) ◽

pp. 419-440

Author(s):

Rolando Cavazos-Cadena ◽

Raúl Montes-De-Oca ◽

Karel Sladký

Keyword(s):

Sample Path ◽

Point Of View ◽

Average Reward ◽

Stationary Policy ◽

Optimality Equation ◽

Markov Decision ◽

Average Reward Criterion ◽

Compact Action Sets ◽

Reward Criterion

This paper concerns discrete-time Markov decision chains with denumerable state and compact action sets. Besides standard continuity requirements, the main assumption on the model is that it admits a Lyapunov function ℓ. In this context the average reward criterion is analyzed from the sample-path point of view. The main conclusion is that if the expected average reward associated to ℓ2 is finite under any policy then a stationary policy obtained from the optimality equation in the standard way is sample-path average optimal in a strong sense.

Download Full-text

Optimal switching problem for countable Markov chains: average reward criterion

Mathematical Methods of Operations Research ◽

10.1007/s001860000102 ◽

2001 ◽

Vol 53 (1) ◽

pp. 1-24 ◽

Author(s):

Alexander Yushkevich

Keyword(s):

Markov Chains ◽

Average Reward ◽

Optimal Switching ◽

Average Reward Criterion ◽

Reward Criterion

Download Full-text

Fuzzy decision processes with an average reward criterion

Mathematical and Computer Modelling ◽

10.1016/s0895-7177(99)00160-0 ◽

1999 ◽

Vol 30 (7-8) ◽

pp. 7-20

Author(s):

M. Kurano ◽

M. Yasuda ◽

J.-I. Nakagami ◽

Y. Yoshida

Keyword(s):

Decision Processes ◽

Average Reward ◽

Fuzzy Decision ◽

Average Reward Criterion ◽

Reward Criterion

Download Full-text