Multi-Armed Bandits, Gittins Index, and its Calculation

On transforming an index for generalised bandit problems

Journal of Applied Probability ◽

10.2307/3214927 ◽

1995 ◽

Vol 32 (1) ◽

pp. 168-182 ◽

Cited By ~ 4

Author(s):

K. D. Glazebrook ◽

S. Greatrix

Keyword(s):

Dynamic Programming ◽

Policy Evaluation ◽

Gittins Index ◽

Bandit Problem ◽

Bandit Problems ◽

Index Policies

Nash (1980) demonstrated that index policies are optimal for a class of generalised bandit problem. A transform of the index concerned has many of the attributes of the Gittins index. The transformed index is positive-valued, with maximal values yielding optimal actions. It may be characterised as the value of a restart problem and is hence computable via dynamic programming methodologies. The transformed index can also be used in procedures for policy evaluation.

Download Full-text

Switching Costs and the Gittins Index

Econometrica ◽

10.2307/2951664 ◽

1994 ◽

Vol 62 (3) ◽

pp. 687 ◽

Cited By ~ 68

Author(s):

Jeffrey S. Banks ◽

Rangarajan K. Sundaram

Keyword(s):

Switching Costs ◽

Gittins Index

Download Full-text

Sequential Project Selection (Multi-Armed Bandits) and the Gittins Index

Deterministic and Stochastic Scheduling ◽

10.1007/978-94-009-7801-0_19 ◽

1982 ◽

pp. 333-341 ◽

Cited By ~ 3

Author(s):

P. Whittle

Keyword(s):

Project Selection ◽

Gittins Index

Download Full-text

Gittins Index

Encyclopedia of Operations Research and Management Science ◽

10.1007/978-1-4419-1153-7_200264 ◽

2013 ◽

pp. 644-644

Keyword(s):

Gittins Index

Download Full-text

Monotonic Approximation of the Gittins Index

Markov Processes and Controlled Markov Chains ◽

10.1007/978-1-4613-0265-0_22 ◽

2002 ◽

pp. 363-367 ◽

Cited By ~ 1

Author(s):

Xikui Wang

Keyword(s):

Gittins Index

Download Full-text

A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements

Mathematics of Operations Research ◽

10.1287/moor.1080.0342 ◽

2009 ◽

Vol 34 (1) ◽

pp. 26-44 ◽

Cited By ~ 10

Author(s):

K. D. Glazebrook ◽

R. Minty

Keyword(s):

Gittins Index ◽

Resource Requirements

Download Full-text

Some indexable families of restless bandit problems

Advances in Applied Probability ◽

10.1239/aap/1158684996 ◽

2006 ◽

Vol 38 (3) ◽

pp. 643-672 ◽

Cited By ~ 32

Author(s):

K. D. Glazebrook ◽

D. Ruiz-Hernandez ◽

C. Kirkbride

Keyword(s):

Index Theory ◽

Stochastic Scheduling ◽

Gittins Index ◽

Scheduling Problems ◽

Bandit Problems ◽

Index Policy ◽

Restless Bandit ◽

Machine Maintenance ◽

State Evolution ◽

Strong Performance

In 1988 Whittle introduced an important but intractable class of restless bandit problems which generalise the multiarmed bandit problems of Gittins by allowing state evolution for passive projects. Whittle's account deployed a Lagrangian relaxation of the optimisation problem to develop an index heuristic. Despite a developing body of evidence (both theoretical and empirical) which underscores the strong performance of Whittle's index policy, a continuing challenge to implementation is the need to establish that the competing projects all pass an indexability test. In this paper we employ Gittins' index theory to establish the indexability of (inter alia) general families of restless bandits which arise in problems of machine maintenance and stochastic scheduling problems with switching penalties. We also give formulae for the resulting Whittle indices. Numerical investigations testify to the outstandingly strong performance of the index heuristics concerned.

Download Full-text

Optimal decision indices for R&D project evaluation in the pharmaceutical industry: Pearson index versus Gittins index

European Journal of Operational Research ◽

10.1016/j.ejor.2006.01.011 ◽

2007 ◽

Vol 177 (2) ◽

pp. 1105-1112 ◽

Cited By ~ 16

Author(s):

Michael A. Talias

Keyword(s):

Pharmaceutical Industry ◽

Project Evaluation ◽

Optimal Decision ◽

Gittins Index

Download Full-text

Restless bandits: activity allocation in a changing world

Journal of Applied Probability ◽

10.2307/3214163 ◽

1988 ◽

Vol 25 (A) ◽

pp. 287-298 ◽

Cited By ~ 355

Author(s):

P. Whittle

Keyword(s):

Lagrange Multiplier ◽

Gittins Index ◽

Constant Ratio ◽

Expected Number ◽

Restless Bandits ◽

Changing World

We consider a population of n projects which in general continue to evolve whether in operation or not (although by different rules). It is desired to choose the projects in operation at each instant of time so as to maximise the expected rate of reward, under a constraint upon the expected number of projects in operation. The Lagrange multiplier associated with this constraint defines an index which reduces to the Gittins index when projects not being operated are static. If one is constrained to operate m projects exactly then arguments are advanced to support the conjecture that, for m and n large in constant ratio, the policy of operating the m projects of largest current index is nearly optimal. The index is evaluated for some particular projects.

Download Full-text

A Short Proof of the Gittins Index Theorem

The Annals of Applied Probability ◽

10.1214/aoap/1177005207 ◽

1994 ◽

Vol 4 (1) ◽

pp. 194-199 ◽

Cited By ~ 44

Author(s):

John N. Tsitsiklis

Keyword(s):

Short Proof ◽

Index Theorem ◽

Gittins Index

Download Full-text