scholarly journals Computing Maxmin Strategies in Extensive-form Zero-sum Games with Imperfect Recall

Author(s):  
Branislav Bosansky ◽  
Jiri Cermak ◽  
Karel Horak ◽  
Michal Pechoucek
Author(s):  
Jiri Cermak ◽  
Branislav Bošanský ◽  
Viliam Lisý

We solve large two-player zero-sum extensive-form games with perfect recall. We propose a new algorithm based on fictitious play that significantly reduces memory requirements for storing average strategies. The key feature is exploiting imperfect recall abstractions while preserving the convergence rate and guarantees of fictitious play applied directly to the perfect recall game. The algorithm creates a coarse imperfect recall abstraction of the perfect recall game and automatically refines its information set structure only where the imperfect recall might cause problems. Experimental evaluation shows that our novel algorithm is able to solve a simplified poker game with 7.10^5 information sets using an abstracted game with only 1.8% of information sets of the original game. Additional experiments on poker and randomly generated games suggest that the relative size of the abstraction decreases as the size of the solved games increases.


Author(s):  
Chun Kai Ling ◽  
Fei Fang ◽  
J. Zico Kolter

With the recent advances in solving large, zero-sum extensive form games, there is a growing interest in the inverse problem of inferring underlying game parameters given only access to agent actions. Although a recent work provides a powerful differentiable end-to-end learning frameworks which embed a game solver within a deep-learning framework, allowing unknown game parameters to be learned via backpropagation, this framework faces significant limitations when applied to boundedly rational human agents and large scale problems, leading to poor practicality. In this paper, we address these limitations and propose a framework that is applicable for more practical settings. First, seeking to learn the rationality of human agents in complex two-player zero-sum games, we draw upon well-known ideas in decision theory to obtain a concise and interpretable agent behavior model, and derive solvers and gradients for end-to-end learning. Second, to scale up to large, real-world scenarios, we propose an efficient first-order primal-dual method which exploits the structure of extensive-form games, yielding significantly faster computation for both game solving and gradient computation. When tested on randomly generated games, we report speedups of orders of magnitude over previous approaches. We also demonstrate the effectiveness of our model on both real-world one-player settings and synthetic data.


1992 ◽  
Vol 4 (4) ◽  
pp. 528-552 ◽  
Author(s):  
Daphne Koller ◽  
Nimrod Megiddo

2020 ◽  
pp. 1087724X2098158
Author(s):  
Camilo Benitez-Avila ◽  
Andreas Hartmann ◽  
Geert Dewulf

Process management literature is skeptical about creating legitimacy and a sense of partnership when implementing concessional Public-Private Partnerships. Within such organizational arrangements, managerial interaction often resembles zero-sum games. To explore the possibility to (re)create a sense of partnership in concessional PPPs, we developed the “3P challenge” serious game. Two gaming sessions with a mixed group of practitioners and a team of public project managers showed that the game cycle recreates adversarial situations where players can enact contractual obligations with higher or lower levels of subjectivity. When reflecting on the gaming experience, practitioners point out that PPP contracts can be creatively enacted by managers who act as brokers of diverse interests. While becoming aware of each other stakes they can blend contractual dispositions or place brackets around some contractual clauses for reaching agreement. By doing so, they can (re)create a sense of partnership, clarity, and fairness of the PPP contract.


Sign in / Sign up

Export Citation Format

Share Document