Decentralized No-regret Learning Algorithms for Extensive-form Correlated Equilibria (Extended Abstract)

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/645 ◽

2021 ◽

Author(s):

Andrea Celli ◽

Alberto Marchesi ◽

Gabriele Farina ◽

Nicola Gatti

Keyword(s):

Normal Form ◽

Private Information ◽

Imperfect Information ◽

Research Question ◽

Correlated Equilibrium ◽

Extensive Form ◽

Learning Dynamics ◽

Extensive Form Games ◽

Normal Form Games ◽

Correlated Equilibria

The existence of uncoupled no-regret learning dynamics converging to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form games generalize normal-form games by modeling both sequential and simultaneous moves, as well as imperfect information. Because of the sequential nature and the presence of private information, correlation in extensive-form games possesses significantly different properties than in normal-form games. The extensive-form correlated equilibrium (EFCE) is the natural extensive-form counterpart to the classical notion of correlated equilibrium in normal-form games. Compared to the latter, the constraints that define the set of EFCEs are significantly more complex, as the correlation device ({\em a.k.a.} mediator) must take into account the evolution of beliefs of each player as they make observations throughout the game. Due to this additional complexity, the existence of uncoupled learning dynamics leading to an EFCE has remained a challenging open research question for a long time. In this article, we settle that question by giving the first uncoupled no-regret dynamics which provably converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. We show that each iterate can be computed in time polynomial in the size of the game tree, and that, when all players play repeatedly according to our learning dynamics, the empirical frequency of play after T game repetitions is guaranteed to be a O(T^-1/2)-approximate EFCE with high probability, and an EFCE almost surely in the limit.

Get full-text (via PubEx)

Coarse Correlation in Extensive-Form Games

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i02.5563 ◽

2020 ◽

Vol 34 (02) ◽

pp. 1934-1941

Author(s):

Gabriele Farina ◽

Tommaso Bianchi ◽

Tuomas Sandholm

Keyword(s):

Normal Form ◽

Saddle Points ◽

Solution Concept ◽

Correlated Equilibrium ◽

Extensive Form ◽

Extensive Form Games ◽

Rational Agents ◽

Correlation Models ◽

Correlated Equilibria ◽

Special Case

Coarse correlation models strategic interactions of rational agents complemented by a correlation device which is a mediator that can recommend behavior but not enforce it. Despite being a classical concept in the theory of normal-form games since 1978, not much is known about the merits of coarse correlation in extensive-form settings. In this paper, we consider two instantiations of the idea of coarse correlation in extensive-form games: normal-form coarse-correlated equilibrium (NFCCE), already defined in the literature, and extensive-form coarse-correlated equilibrium (EFCCE), a new solution concept that we introduce. We show that EFCCEs are a subset of NFCCEs and a superset of the related extensive-form correlated equilibria. We also show that, in n-player extensive-form games, social-welfare-maximizing EFCCEs and NFCCEs are bilinear saddle points, and give new efficient algorithms for the special case of two-player games with no chance moves. Experimentally, our proposed algorithm for NFCCE is two to four orders of magnitude faster than the prior state of the art.

Get full-text (via PubEx)

Solving Large Extensive-Form Games with Strategy Constraints

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33011861 ◽

2019 ◽

Vol 33 ◽

pp. 1861-1868

Author(s):

Trevor Davis ◽

Kevin Waugh ◽

Michael Bowling

Keyword(s):

Private Information ◽

Imperfect Information ◽

Risk Mitigation ◽

Solution Concept ◽

Optimal Strategies ◽

Linear Constraints ◽

Convex Constraints ◽

Extensive Form ◽

Extensive Form Games ◽

Large Extensive Form Games

Extensive-form games are a common model for multiagent interactions with imperfect information. In two-player zerosum games, the typical solution concept is a Nash equilibrium over the unconstrained strategy set for each player. In many situations, however, we would like to constrain the set of possible strategies. For example, constraints are a natural way to model limited resources, risk mitigation, safety, consistency with past observations of behavior, or other secondary objectives for an agent. In small games, optimal strategies under linear constraints can be found by solving a linear program; however, state-of-the-art algorithms for solving large games cannot handle general constraints. In this work we introduce a generalized form of Counterfactual Regret Minimization that provably finds optimal strategies under any feasible set of convex constraints. We demonstrate the effectiveness of our algorithm for finding strategies that mitigate risk in security games, and for opponent modeling in poker games when given only partial observations of private information.

Get full-text (via PubEx)

Non-altruistic Equilibria

The Indian Economic Journal ◽

10.1177/0019466220953124 ◽

2019 ◽

Vol 67 (3-4) ◽

pp. 185-195

Author(s):

Kazuhiro Ohnishi

Keyword(s):

Normal Form ◽

Cooperative Games ◽

The Other ◽

Extensive Form ◽

Extensive Form Games ◽

Normal Form Games ◽

Subgame Perfect Equilibria ◽

Perfect Equilibria ◽

Non Cooperative Games

Which choice will a player make if he can make one of two choices in which his own payoffs are equal, but his rival’s payoffs are not equal, that is, one with a large payoff for his rival and the other with a small payoff for his rival? This paper introduces non-altruistic equilibria for normal-form games and extensive-form non-altruistic equilibria for extensive-form games as equilibrium concepts of non-cooperative games by discussing such a problem and examines the connections between their equilibrium concepts and Nash and subgame perfect equilibria that are important and frequently encountered equilibrium concepts.

Get full-text (via PubEx)

A relation between perfect equilibria in extensive form games and proper equilibria in normal form games

International Journal of Game Theory ◽

10.1007/bf01769861 ◽

1984 ◽

Vol 13 (1) ◽

pp. 1-13 ◽

Cited By ~ 69

Author(s):

E. van Damme

Keyword(s):

Normal Form ◽

Extensive Form ◽

Extensive Form Games ◽

Normal Form Games ◽

Perfect Equilibria

Get full-text (via PubEx)

Beyond the symmetric normal form: extensive form games, asymmetric games and games with continuous strategy spaces

Proceedings of Symposia in Applied Mathematics - Evolutionary Game Dynamics ◽

10.1090/psapm/069/2882633 ◽

2011 ◽

pp. 27-59 ◽

Cited By ~ 3

Author(s):

Ross Cressman

Keyword(s):

Normal Form ◽

Extensive Form ◽

Extensive Form Games

Get full-text (via PubEx)

An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information

Journal of Artificial Intelligence Research ◽

10.1613/jair.4477 ◽

2014 ◽

Vol 51 ◽

pp. 829-866 ◽

Cited By ~ 14

Author(s):

B. Bosansky ◽

C. Kiekintveld ◽

V. Lisy ◽

M. Pechoucek

Keyword(s):

Nash Equilibrium ◽

Imperfect Information ◽

Search Algorithm ◽

Main Idea ◽

Substantial Improvement ◽

Extensive Form ◽

Extensive Form Games ◽

Solution Algorithms ◽

Restricted Game ◽

Zero Sum

Developing scalable solution algorithms is one of the central problems in computational game theory. We present an iterative algorithm for computing an exact Nash equilibrium for two-player zero-sum extensive-form games with imperfect information. Our approach combines two key elements: (1) the compact sequence-form representation of extensive-form games and (2) the algorithmic framework of double-oracle methods. The main idea of our algorithm is to restrict the game by allowing the players to play only selected sequences of available actions. After solving the restricted game, new sequences are added by finding best responses to the current solution using fast algorithms. We experimentally evaluate our algorithm on a set of games inspired by patrolling scenarios, board, and card games. The results show significant runtime improvements in games admitting an equilibrium with small support, and substantial improvement in memory use even on games with large support. The improvement in memory use is particularly important because it allows our algorithm to solve much larger game instances than existing linear programming methods. Our main contributions include (1) a generic sequence-form double-oracle algorithm for solving zero-sum extensive-form games; (2) fast methods for maintaining a valid restricted game model when adding new sequences; (3) a search algorithm and pruning methods for computing best-response sequences; (4) theoretical guarantees about the convergence of the algorithm to a Nash equilibrium; (5) experimental analysis of our algorithm on several games, including an approximate version of the algorithm.

Get full-text (via PubEx)

Equilibrium in behavior strategies in infinite extensive form games with imperfect information

Economic Theory ◽

10.1007/bf01212472 ◽

1992 ◽

Vol 2 (4) ◽

pp. 481-494

Author(s):

Subir K. Chakrabarti

Keyword(s):

Imperfect Information ◽

Extensive Form ◽

Extensive Form Games ◽

Behavior Strategies

Get full-text (via PubEx)

Can quantum entanglement implement classical correlated equilibria?

Quantum Information and Computation ◽

10.26421/qic14.5-6-7 ◽

2014 ◽

Vol 14 (5&6) ◽

pp. 493-516

Author(s):

Alan Deckelbaum

Keyword(s):

Quantum State ◽

Complete Information ◽

Impossibility Result ◽

Correlated Equilibrium ◽

Extensive Form ◽

Initial State ◽

Equilibrium Distributions ◽

Pure Quantum State ◽

Correlated Equilibria ◽

Classical Game

We ask whether players of a classical game can partition a pure quantum state to implement classical correlated equilibrium distributions. The main contribution of this work is an impossibility result: we provide an example of a classical correlated equilibrium that cannot be securely implemented without useful information leaking outside the system. We study the model where players of a classical complete information game initially share an entangled pure quantum state. Players may perform arbitrary local operations on their subsystems, but no direct communication (either quantum or classical) is allowed. We explain why, for the purpose of implementing classical correlated equilibria, it is desirable to restrict the initial state to be pure and to restrict communication. In this framework, we define the concept of pure quantum correlated equilibrium (PQCE) and show that in a normal form game, any outcome distribution implementable by a PQCE can also be implemented by a classical correlated equilibrium (CE), but that the converse is false. We extend our analysis to extensive form games, and compare the power of PQCE to extensive form classical correlated equilibria (EFCE) and immediate-revelation extensive form correlated equilibria (IR-EFCE).

Get full-text (via PubEx)

Partial Order Games

Games ◽

10.3390/g13010002 ◽

2021 ◽

Vol 13 (1) ◽

pp. 2

Author(s):

Valeria Zahoransky ◽

Julian Gutierrez ◽

Paul Harrenstein ◽

Michael Wooldridge

Keyword(s):

Partial Order ◽

Imperfect Information ◽

Compact Representation ◽

Game Model ◽

Extensive Form ◽

Dependence Relation ◽

Extensive Form Games ◽

Extensive Form Game ◽

The Cost ◽

The Relationship

We introduce a non-cooperative game model in which players’ decision nodes are partially ordered by a dependence relation, which directly captures informational dependencies in the game. In saying that a decision node v is dependent on decision nodes v1,…,vk, we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v1,…,vk. Although partial order games are no more expressive than extensive form games of imperfect information (we show that any partial order game can be reduced to a strategically equivalent extensive form game of imperfect information, though possibly at the cost of an exponential blowup in the size of the game), they provide a more natural and compact representation for many strategic settings of interest. After introducing the game model, we investigate the relationship to extensive form games of imperfect information, the problem of computing Nash equilibria, and conditions that enable backwards induction in this new model.

Get full-text (via PubEx)

Bayesian learning leads to correlated equilibria in normal form games

Economic Theory ◽

10.1007/bf01213814 ◽

1994 ◽

Vol 4 (6) ◽

pp. 821-841 ◽

Cited By ~ 28

Author(s):

Yaw Nyarko

Keyword(s):

Normal Form ◽

Bayesian Learning ◽

Normal Form Games ◽

Correlated Equilibria

Get full-text (via PubEx)