Optimal strategies for adaptive zero-sum average Markov games

Abstract Zero-sum two-person finite undiscounted (limiting ratio average) semi-Markov games (SMGs) are considered with a general multichain structure. We derive the strategy evaluation equations for stationary strategies of the players. A relation between the payoff in the multichain SMG and that in the associated stochastic game (SG) obtained by a data-transformation is established. We prove that the multichain optimality equations (OEs) for an SMG have a solution if and only if the associated SG has optimal stationary strategies. Though the solution of the OEs may not be optimal for an SMG, we establish the significance of studying the OEs for a multichain SMG. We provide a nice example of SMGs in which one player has no optimal strategy in the stationary class but has an optimal semistationary strategy (that depends only on the initial and current state of the game). For an SMG with absorbing states, we prove that solutions in the game where all players are restricted to semistationary strategies are solutions for the unrestricted game. Finally, we prove the existence of stationary optimal strategies for unichain SMGs and conclude that the unichain condition is equivalent to require that the game satisfies some recurrence/ergodicity/weakly communicating conditions.

Download Full-text

Average optimal strategies for zero-sum Markov games with poorly known payoff function on one side

Journal of Dynamics & Games ◽

10.3934/jdg.2014.1.105 ◽

2014 ◽

Vol 1 (1) ◽

pp. 105-119

Author(s):

Fernando Luque-Vásquez ◽

◽

J. Adolfo Minjárez-Sosa ◽

Keyword(s):

Optimal Strategies ◽

Payoff Function ◽

Markov Games ◽

Zero Sum ◽

Average Optimal Strategies

Download Full-text

Asymptotically Optimal Strategies for Adaptive Zero-Sum Discounted Markov Games

SIAM Journal on Control and Optimization ◽

10.1137/060651458 ◽

2009 ◽

Vol 48 (3) ◽

pp. 1405-1421 ◽

Cited By ~ 6

Author(s):

J. Adolfo Minjárez-Sosa ◽

Oscar Vega-Amaya

Keyword(s):

Optimal Strategies ◽

Asymptotically Optimal ◽

Markov Games ◽

Zero Sum

Download Full-text

Zero-sum two-person semi-Markov games

Journal of Applied Probability ◽

10.1017/s002190020010662x ◽

1992 ◽

Vol 29 (01) ◽

pp. 56-72 ◽

Cited By ~ 6

Author(s):

Arbind K. Lal ◽

Sagnik Sinha

Keyword(s):

Stochastic Games ◽

Optimal Strategies ◽

Existence Of A Solution ◽

Average Case ◽

Optimality Equation ◽

Markov Games ◽

Special Cases ◽

Ergodic Condition ◽

Markov Decision ◽

Zero Sum

Semi-Markov games are investigated under discounted and limiting average payoff criteria. The issue of the existence of the value and a pair of stationary optimal strategies are settled; the optimality equation is studied and under a natural ergodic condition the existence of a solution to the optimality equation is proved for the limiting average case. Semi-Markov games provide useful flexibility in constructing recursive game models. All the work on Markov/semi-Markov decision processes and Markov (stochastic) games can be viewed as special cases of the developments in this paper.

Download Full-text

Zero-sum two-person semi-Markov games

Journal of Applied Probability ◽

10.2307/3214791 ◽

1992 ◽

Vol 29 (1) ◽

pp. 56-72 ◽

Cited By ~ 13

Author(s):

Arbind K. Lal ◽

Sagnik Sinha

Keyword(s):

Stochastic Games ◽

Optimal Strategies ◽

Existence Of A Solution ◽

Average Case ◽

Optimality Equation ◽

Markov Games ◽

Special Cases ◽

Ergodic Condition ◽

Markov Decision ◽

Zero Sum

Semi-Markov games are investigated under discounted and limiting average payoff criteria. The issue of the existence of the value and a pair of stationary optimal strategies are settled; the optimality equation is studied and under a natural ergodic condition the existence of a solution to the optimality equation is proved for the limiting average case. Semi-Markov games provide useful flexibility in constructing recursive game models. All the work on Markov/semi-Markov decision processes and Markov (stochastic) games can be viewed as special cases of the developments in this paper.

Download Full-text

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Dynamic Games and Applications ◽

10.1007/s13235-018-0248-8 ◽

2018 ◽

Vol 9 (1) ◽

pp. 103-121 ◽

Cited By ~ 1

Author(s):

David González-Sánchez ◽

Fernando Luque-Vásquez ◽

J. Adolfo Minjárez-Sosa

Keyword(s):

Optimal Strategies ◽

Markov Games ◽

Discount Factors ◽

Zero Sum ◽

Random State

Download Full-text

Zero-Sum Ergodic Semi-Markov Games with Weakly Continuous Transition Probabilities

Journal of Optimization Theory and Applications ◽

10.1007/s10957-008-9491-2 ◽

2009 ◽

Vol 141 (2) ◽

pp. 321-347 ◽

Cited By ~ 15

Author(s):

A. Jaśkiewicz

Keyword(s):

Transition Probabilities ◽

Continuous Transition ◽

Markov Games ◽

Weakly Continuous ◽

Zero Sum

Download Full-text

Zero-Sum Markov Games

Zero-Sum Discrete-Time Markov Games with Unknown Disturbance Distribution - SpringerBriefs in Probability and Mathematical Statistics ◽

10.1007/978-3-030-35720-7_1 ◽

2020 ◽

pp. 1-8

Author(s):

J. Adolfo Minjárez-Sosa

Keyword(s):

Markov Games ◽

Zero Sum

Download Full-text

PIVOTING ALGORITHMS FOR SOME CLASSES OF STOCHASTIC GAMES: A SURVEY

International Game Theory Review ◽

10.1142/s0219198901000385 ◽

2001 ◽

Vol 03 (02n03) ◽

pp. 253-281 ◽

Cited By ~ 21

Author(s):

S. R. MOHAN ◽

S. K. NEOGY ◽

T. PARTHASARATHY

Keyword(s):

Linear Complementarity Problem ◽

Complementarity Problem ◽

Stochastic Games ◽

Optimal Strategies ◽

Linear Complementarity ◽

Dual Simplex Method ◽

Special Cases ◽

Dual Simplex ◽

Finite Step ◽

Zero Sum

In this paper, we survey the recent literature on computing the value vector and the associated optimal strategies of the players for special cases of zero-sum stochastic games, or in computing a Nash equilibrium point and the corresponding stationary strategies of the players for special cases of nonzero-sum stochastic games, using finite-step algorithms based on pivoting. Examples of finite-step pivoting algorithms are the various simplex-type algorithms, such as the primal simplex or dual simplex method for solving the linear programming problem or Lemke's or Lemke-Howson's algorithm for solving the linear complementarity problem. Also included are Lemke-type algorithms for solving various generalisations of the linear complementarity problem. The survey also includes a few new results and observations.

Download Full-text

Dual-Issue Final-Offer Arbitration: Invariance of Pure Optimal Strategies Under Lp Metrics

International Game Theory Review ◽

10.1142/s0219198919500117 ◽

2019 ◽

Vol 21 (04) ◽

pp. 1950011

Author(s):

Brian R. Powers

Keyword(s):

Normal Distribution ◽

Optimal Strategies ◽

Bivariate Normal Distribution ◽

Bivariate Normal ◽

Final Offer ◽

Pure Strategies ◽

Zero Sum

We consider a final-offer arbitration problem between two players with two quantitative issues in dispute. We model the problem as a zero-sum game where the arbiter’s opinion is drawn from a bivariate normal distribution and derive the only possible pure strategies regardless of the choice of [Formula: see text] metric used by the arbiter.

Download Full-text