Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy

We show that the fluid approximation to Whittle's index policy for restless bandits has a globally asymptotically stable equilibrium point when the bandits move on just three states. It follows that in this case the index policy is asymptotic optimal.

Download Full-text

An Index Policy for Multiarmed Multimode Restless Bandits

Proceedings of the 3rd International Conference on Performance Evaluation Methodologies and Tools ◽

10.4108/icst.valuetools2008.4410 ◽

2008 ◽

Author(s):

José Niño-Mora

Keyword(s):

Index Policy ◽

Restless Bandits

Download Full-text

Index policies for a class of discounted restless bandits

Advances in Applied Probability ◽

10.1017/s0001867800011903 ◽

2002 ◽

Vol 34 (04) ◽

pp. 754-774 ◽

Cited By ~ 8

Author(s):

K. D. Glazebrook ◽

J. Niño-Mora ◽

P. S. Ansell

Keyword(s):

Conservation Laws ◽

Special Class ◽

Computational Study ◽

Bandit Problems ◽

Index Policy ◽

Restless Bandit ◽

Restless Bandits ◽

Index Policies ◽

Strong Performance ◽

Dual Speed

The paper concerns a class of discounted restless bandit problems which possess an indexability property. Conservation laws yield an expression for the reward suboptimality of a general policy. These results are utilised to study the closeness to optimality of an index policy for a special class of simple and natural dual speed restless bandits for which indexability is guaranteed. The strong performance of the index policy is confirmed by a computational study.

Download Full-text

Opportunistic Scheduling as Restless Bandits

IEEE Transactions on Control of Network Systems ◽

10.1109/tcns.2017.2774046 ◽

2018 ◽

Vol 5 (4) ◽

pp. 1952-1961 ◽

Cited By ~ 9

Author(s):

Vivek S. Borkar ◽

Gaurav S. Kasbekar ◽

Sarath Pattathil ◽

Priyesh Y. Shetty

Keyword(s):

Opportunistic Scheduling ◽

Restless Bandits

Download Full-text

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

Mathematics ◽

10.3390/math8122226 ◽

2020 ◽

Vol 8 (12) ◽

pp. 2226 ◽

Cited By ~ 1

Author(s):

José Niño-Mora

Keyword(s):

Numerical Study ◽

Index Policy ◽

State Spaces ◽

Restless Bandit ◽

Restless Bandits ◽

Pivoting Algorithm ◽

Markov Decision ◽

Whittle Index ◽

Decision Epoch ◽

Change State

The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and which can change state while passive. It further provides a practical heuristic priority-index policy for the computationally intractable multi-armed restless bandit problem, which has been widely applied over the last three decades in multifarious settings, yet mostly restricted to project models with a one-dimensional state. This is due in part to the difficulty of establishing indexability (existence of the index) and of computing the index for projects with large state spaces. This paper draws on the author’s prior results on sufficient indexability conditions and an adaptive-greedy algorithmic scheme for restless bandits to obtain a new fast-pivoting algorithm that computes the n Whittle index values of an n-state restless bandit by performing, after an initialization stage, n steps that entail (2/3)n3+O(n2) arithmetic operations. This algorithm also draws on the parametric simplex method, and is based on elucidating the pattern of parametric simplex tableaux, which allows to exploit special structure to substantially simplify and reduce the complexity of simplex pivoting steps. A numerical study demonstrates substantial runtime speed-ups versus alternative algorithms.

Download Full-text

Addendum to ‘On an index policy for restless bandits'

Advances in Applied Probability ◽

10.1017/s0001867800023582 ◽

1991 ◽

Vol 23 (02) ◽

pp. 429-430 ◽

Cited By ~ 1

Author(s):

Richard R. Weber ◽

Gideon Weiss

Keyword(s):

Equilibrium Point ◽

Stable Equilibrium ◽

Stable Equilibrium Point ◽

Asymptotically Stable ◽

Index Policy ◽

Fluid Approximation ◽

Globally Asymptotically Stable ◽

Restless Bandits

We show that the fluid approximation to Whittle's index policy for restless bandits has a globally asymptotically stable equilibrium point when the bandits move on just three states. It follows that in this case the index policy is asymptotic optimal.

Download Full-text

Index policies for a class of discounted restless bandits

Advances in Applied Probability ◽

10.1239/aap/1037990952 ◽

2002 ◽

Vol 34 (4) ◽

pp. 754-774 ◽

Cited By ~ 18

Author(s):

K. D. Glazebrook ◽

J. Niño-Mora ◽

P. S. Ansell

Keyword(s):

Conservation Laws ◽

Special Class ◽

Computational Study ◽

Bandit Problems ◽

Index Policy ◽

Restless Bandit ◽

Restless Bandits ◽

Index Policies ◽

Strong Performance ◽

Dual Speed

The paper concerns a class of discounted restless bandit problems which possess an indexability property. Conservation laws yield an expression for the reward suboptimality of a general policy. These results are utilised to study the closeness to optimality of an index policy for a special class of simple and natural dual speed restless bandits for which indexability is guaranteed. The strong performance of the index policy is confirmed by a computational study.

Download Full-text

Whittle Index Policy for Opportunistic Scheduling: Heterogeneous Two-State Channels

Restless Multi-Armed Bandit in Opportunistic Scheduling ◽

10.1007/978-3-030-69959-8_3 ◽

2021 ◽

pp. 37-77

Author(s):

Kehao Wang ◽

Lin Chen

Keyword(s):

Opportunistic Scheduling ◽

Index Policy ◽

Whittle Index

Download Full-text