Multiagent Learning in Large Anonymous Games

In large systems, it is important for agents to learn to act effectively, but sophisticated multi-agent learning algorithms generally do not scale. An alternative approach is to find restricted classes of games where simple, efficient algorithms converge. It is shown that stage learning efficiently converges to Nash equilibria in large anonymous games if best-reply dynamics converge. Two features are identified that improve convergence. First, rather than making learning more difficult, more agents are actually beneficial in many settings. Second, providing agents with statistical information about the behavior of others can significantly reduce the number of observations needed.

Download Full-text

User-Centric Radio Access Technology Selection: A Survey of Game Theory Models and Multi-Agent Learning Algorithms

IEEE Access ◽

10.1109/access.2021.3087410 ◽

2021 ◽

pp. 1-1

Author(s):

Giuseppe Caso ◽

Ozgu Alay ◽

Guido Carlo Ferrante ◽

Luca De Nardis ◽

Maria-Gabriella Di Benedetto ◽

...

Keyword(s):

Game Theory ◽

Learning Algorithms ◽

Technology Selection ◽

Access Technology ◽

Radio Access Technology ◽

Radio Access ◽

Agent Learning ◽

Multi Agent ◽

User Centric ◽

Radio Access Technology Selection

Download Full-text

Multi-agent Learning Algorithms

Encyclopedia of Machine Learning and Data Mining ◽

10.1007/978-1-4899-7687-1_569 ◽

2017 ◽

pp. 860-863

Author(s):

Yoav Shoham ◽

Rob Powers

Keyword(s):

Learning Algorithms ◽

Agent Learning ◽

Multi Agent

Download Full-text

Scalable multi-agent learning algorithms to determine winners in combinatorial double auctions

Applied Intelligence ◽

10.1007/s10489-014-0643-9 ◽

2015 ◽

Vol 43 (2) ◽

pp. 308-324 ◽

Cited By ~ 5

Author(s):

Fu-Shiung Hsieh ◽

Chi-Shiang Liao

Keyword(s):

Learning Algorithms ◽

Double Auctions ◽

Agent Learning ◽

Multi Agent

Download Full-text

Decentralized Anti-coordination Through Multi-agent Learning

Journal of Artificial Intelligence Research ◽

10.1613/jair.3904 ◽

2013 ◽

Vol 47 ◽

pp. 441-473 ◽

Cited By ~ 3

Author(s):

L. Cigler ◽

B. Faltings

Keyword(s):

Resource Allocation ◽

Nash Equilibria ◽

Learning Algorithm ◽

Channel Allocation ◽

Pure Strategy ◽

Correlated Equilibrium ◽

Agent Learning ◽

Multi Agent ◽

Resource Allocation Game ◽

The One

To achieve an optimal outcome in many situations, agents need to choose distinct actions from one another. This is the case notably in many resource allocation problems, where a single resource can only be used by one agent at a time. How shall a designer of a multi-agent system program its identical agents to behave each in a different way? From a game theoretic perspective, such situations lead to undesirable Nash equilibria. For example consider a resource allocation game in that two players compete for an exclusive access to a single resource. It has three Nash equilibria. The two pure-strategy NE are efficient, but not fair. The one mixed-strategy NE is fair, but not efficient. Aumann's notion of correlated equilibrium fixes this problem: It assumes a correlation device that suggests each agent an action to take. However, such a "smart" coordination device might not be available. We propose using a randomly chosen, "stupid" integer coordination signal. "Smart" agents learn which action they should use for each value of the coordination signal. We present a multi-agent learning algorithm that converges in polynomial number of steps to a correlated equilibrium of a channel allocation game, a variant of the resource allocation game. We show that the agents learn to play for each coordination signal value a randomly chosen pure-strategy Nash equilibrium of the game. Therefore, the outcome is an efficient correlated equilibrium. This CE becomes more fair as the number of the available coordination signal values increases.

Download Full-text

Enhanced Cooperative Multi-agent Learning Algorithms (ECMLA) using Reinforcement Learning

2016 International Conference on Computing, Analytics and Security Trends (CAST) ◽

10.1109/cast.2016.7915030 ◽

2016 ◽

Cited By ~ 4

Author(s):

Deepak A Vidhate ◽

Parag Kulkarni

Keyword(s):

Reinforcement Learning ◽

Learning Algorithms ◽

Agent Learning ◽

Multi Agent

Download Full-text

Evolutionary Dynamics of Multi-Agent Learning: A Survey

Journal of Artificial Intelligence Research ◽

10.1613/jair.4818 ◽

2015 ◽

Vol 53 ◽

pp. 659-697 ◽

Cited By ~ 69

Author(s):

Daan Bloembergen ◽

Karl Tuyls ◽

Daniel Hennes ◽

Michael Kaisers

Keyword(s):

Reinforcement Learning ◽

Evolutionary Dynamics ◽

Learning Algorithms ◽

Evolutionary Game ◽

Strategic Interactions ◽

The Past ◽

Wide Range ◽

Agent Learning ◽

Multi Agent ◽

Optimal Behaviour

The interaction of multiple autonomous agents gives rise to highly dynamic and nondeterministic environments, contributing to the complexity in applications such as automated financial markets, smart grids, or robotics. Due to the sheer number of situations that may arise, it is not possible to foresee and program the optimal behaviour for all agents beforehand. Consequently, it becomes essential for the success of the system that the agents can learn their optimal behaviour and adapt to new situations or circumstances. The past two decades have seen the emergence of reinforcement learning, both in single and multi-agent settings, as a strong, robust and adaptive learning paradigm. Progress has been substantial, and a wide range of algorithms are now available. An important challenge in the domain of multi-agent learning is to gain qualitative insights into the resulting system dynamics. In the past decade, tools and methods from evolutionary game theory have been successfully employed to study multi-agent learning dynamics formally in strategic interactions. This article surveys the dynamical models that have been derived for various multi-agent reinforcement learning algorithms, making it possible to study and compare them qualitatively. Furthermore, new learning algorithms that have been introduced using these evolutionary game theoretic tools are reviewed. The evolutionary models can be used to study complex strategic interactions. Examples of such analysis are given for the domains of automated trading in stock markets and collision avoidance in multi-robot systems. The paper provides a roadmap on the progress that has been achieved in analysing the evolutionary dynamics of multi-agent learning by highlighting the main results and accomplishments.

Download Full-text

Expressiveness and Nash Equilibrium in Iterated Boolean Games

ACM Transactions on Computational Logic ◽

10.1145/3439900 ◽

2021 ◽

Vol 22 (2) ◽

pp. 1-38

Author(s):

Julian Gutierrez ◽

Paul Harrenstein ◽

Giuseppe Perelli ◽

Michael Wooldridge

Keyword(s):

Nash Equilibrium ◽

Nash Equilibria ◽

Infinite Sequence ◽

Multi Agent Systems ◽

Temporal Logics ◽

Agent Systems ◽

Temporal Properties ◽

Multi Agent ◽

Game Theoretic ◽

Boolean Games

We define and investigate a novel notion of expressiveness for temporal logics that is based on game theoretic equilibria of multi-agent systems. We use iterated Boolean games as our abstract model of multi-agent systems [Gutierrez et al. 2013, 2015a]. In such a game, each agent has a goal , represented using (a fragment of) Linear Temporal Logic ( ) . The goal captures agent ’s preferences, in the sense that the models of represent system behaviours that would satisfy . Each player controls a subset of Boolean variables , and at each round in the game, player is at liberty to choose values for variables in any way that she sees fit. Play continues for an infinite sequence of rounds, and so as players act they collectively trace out a model for , which for every player will either satisfy or fail to satisfy their goal. Players are assumed to act strategically, taking into account the goals of other players, in an attempt to bring about computations satisfying their goal. In this setting, we apply the standard game-theoretic concept of (pure) Nash equilibria. The (possibly empty) set of Nash equilibria of an iterated Boolean game can be understood as inducing a set of computations, each computation representing one way the system could evolve if players chose strategies that together constitute a Nash equilibrium. Such a set of equilibrium computations expresses a temporal property—which may or may not be expressible within a particular fragment. The new notion of expressiveness that we formally define and investigate is then as follows: What temporal properties are characterised by the Nash equilibria of games in which agent goals are expressed in specific fragments of ? We formally define and investigate this notion of expressiveness for a range of fragments. For example, a very natural question is the following: Suppose we have an iterated Boolean game in which every goal is represented using a particular fragment of : is it then always the case that the equilibria of the game can be characterised within ? We show that this is not true in general.

Download Full-text

Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

Applied Sciences ◽

10.3390/app11114948 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4948

Author(s):

Lorenzo Canese ◽

Gian Carlo Cardarilli ◽

Luca Di Di Nunzio ◽

Rocco Fazzolari ◽

Daniele Giardino ◽

...

Keyword(s):

Reinforcement Learning ◽

Mathematical Models ◽

Learning Algorithms ◽

Single Agent ◽

Critical Issues ◽

Multi Agent ◽

Pros And Cons ◽

Application Fields

In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications—namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.

Download Full-text

Hierarchical Reinforcement Learning

ACM Computing Surveys ◽

10.1145/3453160 ◽

2021 ◽

Vol 54 (5) ◽

pp. 1-35

Author(s):

Shubham Pateria ◽

Budhitama Subagdja ◽

Ah-hwee Tan ◽

Chai Quek

Keyword(s):

Reinforcement Learning ◽

Future Research ◽

Comprehensive Overview ◽

Open Problems ◽

Practical Applications ◽

Hierarchical Reinforcement Learning ◽

The Past ◽

Agent Learning ◽

Multi Agent ◽

Supplementary Material

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of challenging long-horizon decision-making tasks into simpler subtasks. During the past years, the landscape of HRL research has grown profoundly, resulting in copious approaches. A comprehensive overview of this vast landscape is necessary to study HRL in an organized manner. We provide a survey of the diverse HRL approaches concerning the challenges of learning hierarchical policies, subtask discovery, transfer learning, and multi-agent learning using HRL. The survey is presented according to a novel taxonomy of the approaches. Based on the survey, a set of important open problems is proposed to motivate the future research in HRL. Furthermore, we outline a few suitable task domains for evaluating the HRL approaches and a few interesting examples of the practical applications of HRL in the Supplementary Material.

Download Full-text

The possible and the impossible in multi-agent learning

Artificial Intelligence ◽

10.1016/j.artint.2006.10.015 ◽

2007 ◽

Vol 171 (7) ◽

pp. 429-433 ◽

Cited By ~ 16

Author(s):

H. Peyton Young

Keyword(s):

Agent Learning ◽

Multi Agent

Download Full-text