LEARNING TO PLAY K-ARMED BANDIT PROBLEMS

Proceedings of the 4th International Conference on Agents and Artificial Intelligence ◽

10.5220/0003733500740081 ◽

2012 ◽

Keyword(s):

Bandit Problems

Download Full-text

A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems

2020 European Control Conference (ECC) ◽

10.23919/ecc51009.2020.9143811 ◽

2020 ◽

Author(s):

P. Pankayaraj ◽

D. H. S. Maithripala

Keyword(s):

Communication Policy ◽

Bandit Problems ◽

Download Full-text

On transforming an index for generalised bandit problems

Journal of Applied Probability ◽

10.2307/3214927 ◽

1995 ◽

Vol 32 (1) ◽

pp. 168-182 ◽

Author(s):

K. D. Glazebrook ◽

S. Greatrix

Keyword(s):

Dynamic Programming ◽

Policy Evaluation ◽

Gittins Index ◽

Bandit Problem ◽

Bandit Problems ◽

Nash (1980) demonstrated that index policies are optimal for a class of generalised bandit problem. A transform of the index concerned has many of the attributes of the Gittins index. The transformed index is positive-valued, with maximal values yielding optimal actions. It may be characterised as the value of a restart problem and is hence computable via dynamic programming methodologies. The transformed index can also be used in procedures for policy evaluation.

Download Full-text

Gaussian multi-armed bandit problems with multiple objectives

2016 American Control Conference (ACC) ◽

10.1109/acc.2016.7526494 ◽

2016 ◽

Author(s):

Paul Reverdy

Keyword(s):

Multiple Objectives ◽

Bandit Problems

Download Full-text

Bandit problems with arbitrary side observations

42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475) ◽

10.1109/cdc.2003.1273074 ◽

2004 ◽

Author(s):

Chih-Chun Wang ◽

S.R. Kulkami ◽

H.V. Poor

Keyword(s):

Bandit Problems

Download Full-text

Social Learning in One-Arm Bandit Problems

Econometrica ◽

10.1111/j.1468-0262.2007.00807.x ◽

2007 ◽

Vol 75 (6) ◽

pp. 1591-1611 ◽

Author(s):

Dinah Rosenberg ◽

Eilon Solan ◽

Nicolas Vieille

Keyword(s):

Social Learning ◽

Bandit Problems

Download Full-text

Foraging decisions as multi-armed bandit problems: Applying reinforcement learning algorithms to foraging data

Journal of Theoretical Biology ◽

10.1016/j.jtbi.2019.02.002 ◽

2019 ◽

Vol 467 ◽

pp. 48-56 ◽

Author(s):

Juliano Morimoto

Keyword(s):

Reinforcement Learning ◽

Learning Algorithms ◽

Bandit Problems ◽

Foraging Decisions

Download Full-text

Multi-armed bandit problems with heavy-tailed reward distributions

2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton) ◽

10.1109/allerton.2011.6120206 ◽

2011 ◽

Author(s):

Keqin Liu ◽

Qing Zhao

Keyword(s):

Bandit Problems ◽

Download Full-text

Approximate Indexability and Bandit Problems with Concave Rewards and Delayed Feedback

Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques - Lecture Notes in Computer Science ◽

10.1007/978-3-642-40328-6_14 ◽

2013 ◽

pp. 189-204

Author(s):

Sudipto Guha ◽

Kamesh Munagala

Keyword(s):

Delayed Feedback ◽

Bandit Problems

Download Full-text

One- and Two-Armed Bandit Problems

Encyclopedia of Statistical Sciences ◽

10.1002/0471667196.ess1852.pub2 ◽

2006 ◽

Author(s):

Donald A. Berry

Keyword(s):

Bandit Problems

Download Full-text

Corrections to “Satisficing in Multiarmed Bandit Problems”

IEEE Transactions on Automatic Control ◽

10.1109/tac.2020.2981433 ◽

2021 ◽

Vol 66 (1) ◽

pp. 476-478

Author(s):

Paul Reverdy ◽

Vaibhav Srivastava ◽

Naomi Ehrich Leonard

Keyword(s):

Bandit Problems ◽

Multiarmed Bandit

Download Full-text