Planning Algorithms for Zero-Sum Games with Exponential Action Spaces: A Unifying Perspective

In this paper we review several planning algorithms developed for zero-sum games with exponential action spaces, i.e., spaces that grow exponentially with the number of game components that can act simultaneously at a given game state. As an example, real-time strategy games have exponential action spaces because the number of actions available grows exponentially with the number of units controlled by the player. We also present a unifying perspective in which several existing algorithms can be described as an instantiation of a variant of NaiveMCTS. In addition to describing several existing planning algorithms for exponential action spaces, we show that other instantiations of this variant of NaiveMCTS represent novel and promising algorithms to be studied in future works.

Mixed Policies

Noncooperative Game Theory ◽

10.23943/princeton/9780691175218.003.0004 ◽

2017 ◽

Author(s):

João P. Hespanha

Keyword(s):

Probability Distribution ◽

Saddle Point ◽

Security Policy ◽

General Type ◽

Zero Sum Games ◽

Security Levels ◽

Mixed Action ◽

Zero Sum ◽

Saddle Point Equilibrium ◽

This chapter explores the concept of mixed policies and how the notions for pure policies can be adapted to this more general type of policies. A pure policy consists of choices of particular actions (perhaps based on some observation), whereas a mixed policy involves choosing a probability distribution to select actions (perhaps as a function of observations). The idea behind mixed policies is that the players select their actions randomly according to a previously selected probability distribution. The chapter first considers the rock-paper-scissors game as an example of mixed policy before discussing mixed action spaces, mixed security policy and saddle-point equilibrium, mixed saddle-point equilibrium vs. average security levels, and general zero-sum games. It concludes with practice exercises with corresponding solutions and an additional exercise.

Online Gaming: Real Time Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration

Advances in Reinforcement Learning ◽

10.5772/13209 ◽

2011 ◽

Author(s):

Kyriakos G. ◽

Frank L.

Keyword(s):

Real Time ◽

Online Gaming ◽

Policy Iteration ◽

Zero Sum Games ◽

Time Solution ◽

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

10.24963/ijcai.2018/377 ◽

2018 ◽

Cited By ~ 2

Author(s):

Anderson Rocha Tavares ◽

Sivasubramanian Anbalagan ◽

Leandro Soriano Marcolino ◽

Luiz Chaimowicz

Keyword(s):

Reinforcement Learning ◽

Finite Number ◽

Real Time ◽

Function Approximation ◽

Large Scale ◽

Sufficient Conditions ◽

Approximation Approach ◽

Conditions For Learning ◽

Strategy Games ◽

Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.

Two-Player Non-Zero-Sum Games

Noncooperative Game Theory ◽

10.23943/princeton/9780691175218.003.0009 ◽

2017 ◽

Author(s):

João P. Hespanha

Keyword(s):

Nash Equilibrium ◽

Security Policy ◽

The Other ◽

Bimatrix Games ◽

Zero Sum Games ◽

Best Response ◽

Key Concepts ◽

Player Game ◽

Zero Sum ◽

This chapter defines a number of key concepts for non-zero-sum games involving two players. It begins by considering a two-player game G in which two players P₁ and P₂ are allowed to select policies within action spaces Γ‎₁ and Γ‎₂, respectively. Each player wants to minimize their own outcome, and does not care about the outcome of the other player. The chapter proceeds by discussing the security policy and Nash equilibrium for two-player non-zero-sum games, bimatrix games, admissible Nash equilibrium, and mixed policy. It also explores the order interchangeability property for Nash equilibria in best-response equivalent games before concluding with practice exercises and their corresponding solutions, along with additional exercises.

Analysis of the Individualistic, Competitive and Cooperative Motives in Non-Zero-Sum Games

Psychological Reports ◽

10.2466/pr0.1976.39.1.55 ◽

1976 ◽

Vol 39 (1) ◽

pp. 55-61

Author(s):

Shaul Fox

Keyword(s):

Choice Behavior ◽

Conflict Situation ◽

Zero Sum Games ◽

Strategy Games ◽

The Subject ◽

Sole Criterion ◽

According to Messick and McClintock (1968), differences in choice behavior of strategy games of the non-zero-sum type may be explained mainly by three motives: the individualistic, the competitive, and the cooperative. The researchers' operational definitions of the motives are based on the payoffs in the game matrices. This article critically examines Messick and Mc-Clintock's expositions and demonstrates that the payoff consideration cannot be the sole criterion for the identification of motivational goals. Disregarding the opponent's choice may lead to mistaken conclusions concerning the participant's motive as inferred from his decision. In the wake of this oversight, the proposal for measuring the three motives, stated in this article, is based on the following principles: (1) A pre-programmed plan for one participant in the game in order to standardize the situation the subjects face. (2) A large number of trials in order to ensure the subject's awareness of the opponent's fixed strategy. (3) The combination of 1 and 2 with appropriate payoff values enables the construction of the conflict situation confronting the subject.

Synthesizing interpretable strategies for real-time planning in zero-sum games

10.11606/t.55.2021.tde-21122021-111842 ◽

2021 ◽

Author(s):

Julian Ricardo Hernandez Mariño

Keyword(s):

Real Time ◽

Zero Sum Games ◽

Zero Sum ◽

Time Planning

Zero Sum Games, Linear Programming and Kuhn-Tucker Theory

SSRN Electronic Journal ◽

10.2139/ssrn.2374311 ◽

2014 ◽

Author(s):

Dionysius Glycopantis

Keyword(s):

Linear Programming ◽

Zero Sum Games ◽

Monte-Carlo Dual Bounds for Finite Horizon Zero-Sum Games

SSRN Electronic Journal ◽

10.2139/ssrn.2661205 ◽

2015 ◽

Author(s):

Mark S. Joshi ◽

Dan Zhu

Keyword(s):

Monte Carlo ◽

Finite Horizon ◽

Zero Sum Games ◽

Zero Sum ◽

Dual Bounds

The 3P Challenge: A Serious Game for Reflecting on Partnership in Public-Private Concessions

Public Works Management & Policy ◽

10.1177/1087724x20981585 ◽

2020 ◽

pp. 1087724X2098158

Author(s):

Camilo Benitez-Avila ◽

Andreas Hartmann ◽

Geert Dewulf

Keyword(s):

Process Management ◽

Project Managers ◽

Serious Game ◽

Public Private Partnerships ◽

Mixed Group ◽

Public Project ◽

Zero Sum Games ◽

Management Literature ◽

Contractual Obligations ◽

Process management literature is skeptical about creating legitimacy and a sense of partnership when implementing concessional Public-Private Partnerships. Within such organizational arrangements, managerial interaction often resembles zero-sum games. To explore the possibility to (re)create a sense of partnership in concessional PPPs, we developed the “3P challenge” serious game. Two gaming sessions with a mixed group of practitioners and a team of public project managers showed that the game cycle recreates adversarial situations where players can enact contractual obligations with higher or lower levels of subjectivity. When reflecting on the gaming experience, practitioners point out that PPP contracts can be creatively enacted by managers who act as brokers of diverse interests. While becoming aware of each other stakes they can blend contractual dispositions or place brackets around some contractual clauses for reaching agreement. By doing so, they can (re)create a sense of partnership, clarity, and fairness of the PPP contract.

Decomposition techniques for Markov zero-sum games with nested information

52nd IEEE Conference on Decision and Control ◽

10.1109/cdc.2013.6759943 ◽

2013 ◽

Cited By ~ 1

Author(s):

Jiefu Zheng ◽

David A. Castanon

Keyword(s):

Decomposition Techniques ◽

Zero Sum Games ◽