Quantal Response Equilibrium in Extensive-Form Games

Author(s):  
Jacob K. Goeree ◽  
Charles A. Holt ◽  
Thomas R. Palfrey

This chapter lays out the general theory of quantal response equilibrium (QRE) for extensive-form games. The formulation of the model is necessarily more complicated because timing and information now play a direct role in the decision maker's choice. This can have interesting and unanticipated consequences. It first describes four possible ways to define QRE in extensive-form games, depending on how the games are represented. It then turns to the structural agent quantal response equilibrium (AQRE) extensive-form games. This is followed by a discussion of the logit AQRE model, which implies a unique selection from the set of Nash equilibria. This selection is defined by the connected component of the logit AQRE correspondence. The final section presents an AQRE analysis of the centipede game.

Author(s):  
Jacob K. Goeree ◽  
Charles A. Holt ◽  
Thomas R. Palfrey

This chapter explores questions related to learning and dynamics. The first part explores dynamic quantal response equilibrium models where strategies are conditioned on observed histories of past decisions and outcomes of stage games. The second part considers models in which players are learning about others' behavior via a process in which they may update and respond to current beliefs in a noisy (quantal) manner. The final section explores learning models that involve quantal responses to beliefs formed by processing information from finite (but possibly long) histories of prior or observed action profiles. The formulation permits consideration of a wide variety of exogenous or even endogenous (e.g., least squares) learning rules.


2015 ◽  
Vol 18 (4) ◽  
pp. 762-763 ◽  
Author(s):  
Richard D. McKelvey ◽  
Thomas R. Palfrey

Author(s):  
Gabriele Farina ◽  
Christian Kroer ◽  
Tuomas Sandholm

Regret minimization is a powerful tool for solving large-scale extensive-form games. State-of-the-art methods rely on minimizing regret locally at each decision point. In this work we derive a new framework for regret minimization on sequential decision problems and extensive-form games with general compact convex sets at each decision point and general convex losses, as opposed to prior work which has been for simplex decision points and linear losses. We call our framework laminar regret decomposition. It generalizes the CFR algorithm to this more general setting. Furthermore, our framework enables a new proof of CFR even in the known setting, which is derived from a perspective of decomposing polytope regret, thereby leading to an arguably simpler interpretation of the algorithm. Our generalization to convex compact sets and convex losses allows us to develop new algorithms for several problems: regularized sequential decision making, regularized Nash equilibria in zero-sum extensive-form games, and computing approximate extensive-form perfect equilibria. Our generalization also leads to the first regret-minimization algorithm for computing reduced-normal-form quantal response equilibria based on minimizing local regrets. Experiments show that our framework leads to algorithms that scale at a rate comparable to the fastest variants of counterfactual regret minimization for computing Nash equilibrium, and therefore our approach leads to the first algorithm for computing quantal response equilibria in extremely large games. Our algorithms for (quadratically) regularized equilibrium finding are orders of magnitude faster than the fastest algorithms for Nash equilibrium finding; this suggests regret-minimization algorithms based on decreasing regularization for Nash equilibrium finding as future work. Finally we show that our framework enables a new kind of scalable opponent exploitation approach.


2021 ◽  
Vol 288 (1962) ◽  
Author(s):  
Cecilia Lindig-León ◽  
Gerrit Schmid ◽  
Daniel A. Braun

The Nash equilibrium is one of the most central solution concepts to study strategic interactions between multiple players and has recently also been shown to capture sensorimotor interactions between players that are haptically coupled. While previous studies in behavioural economics have shown that systematic deviations from Nash equilibria in economic decision-making can be explained by the more general quantal response equilibria, such deviations have not been reported for the sensorimotor domain. Here we investigate haptically coupled dyads across three different sensorimotor games corresponding to the classic symmetric and asymmetric Prisoner's Dilemma, where the quantal response equilibrium predicts characteristic shifts across the three games, although the Nash equilibrium stays the same. We find that subjects exhibit the predicted deviations from the Nash solution. Furthermore, we show that taking into account subjects' priors for the games, we arrive at a more accurate description of bounded rational response equilibria that can be regarded as a quantal response equilibrium with non-uniform prior. Our results suggest that bounded rational response equilibria provide a general tool to explain sensorimotor interactions that include the Nash equilibrium as a special case in the absence of information processing limitations.


Author(s):  
Herbert Gintis

The extensive form of a game is informationally richer than the normal form since players gather information that allows them to update their subjective priors as the game progresses. For this reason, the study of rationalizability in extensive form games is more complex than the corresponding study in normal form games. There are two ways to use the added information to eliminate strategies that would not be chosen by a rational agent: backward induction and forward induction. The latter is relatively exotic (although more defensible). Backward induction, by far the most popular technique, employs the iterated elimination of weakly dominated strategies, arriving at the subgame perfect Nash equilibria—the equilibria that remain Nash equilibria in all subgames. An extensive form game is considered generic if it has a unique subgame perfect Nash equilibrium. This chapter develops the tools of modal logic and presents Robert Aumann's famous proof that common knowledge of rationality (CKR) implies backward induction. It concludes that Aumann is perfectly correct, and the real culprit is CKR itself. CKR is in fact self-contradictory when applied to extensive form games.


2009 ◽  
Vol 11 (04) ◽  
pp. 437-451 ◽  
Author(s):  
CHARLES AUDET ◽  
SLIM BELHAIZA ◽  
PIERRE HANSEN

This paper presents two new results on the enumeration of all extreme equilibria of the sequence form of a two person extensive game. The sequence form of an extensive game is expressed, for the first time to our knowledge, as a parametric linear 0 - 1 program. Considering Ext(P) as the set of all of the sequence form extreme Nash equilibria and Ext(Q) as the set of all the parametric linear 0 - 1 program extreme points, we show that Ext(P) ⊆ Ext(Q). Using exact arithmetics classes, the algorithm EχMIP Belhaiza (2002); Audet et al. (2006) is extended to enumerate all elements of Ext(Q). A small procedure is then applied in order to obtain all elements of Ext(P).


Sign in / Sign up

Export Citation Format

Share Document