Whittle Index Policy for Opportunistic Scheduling: Heterogeneous Two-State Channels

We consider the multi-armed bandit problem with penalties for switching that include setup delays and costs, extending the former results of the author for the special case with no switching delays. A priority index for projects with setup delays that characterizes, in part, optimal policies was introduced by Asawa and Teneketzis in 1996, yet without giving a means of computing it. We present a fast two-stage index computing method, which computes the continuation index (which applies when the project has been set up) in a first stage and certain extra quantities with cubic (arithmetic-operation) complexity in the number of project states and then computes the switching index (which applies when the project is not set up), in a second stage, with quadratic complexity. The approach is based on new methodological advances on restless bandit indexation, which are introduced and deployed herein, being motivated by the limitations of previous results, exploiting the fact that the aforementioned index is the Whittle index of the project in its restless reformulation. A numerical study demonstrates substantial runtime speed-ups of the new two-stage index algorithm versus a general one-stage Whittle index algorithm. The study further gives evidence that, in a multi-project setting, the index policy is consistently nearly optimal.

Download Full-text

Whittle Index Policy for Multichannel Scheduling in Queueing Systems

2019 IEEE International Symposium on Information Theory (ISIT) ◽

10.1109/isit.2019.8849774 ◽

2019 ◽

Cited By ~ 1

Author(s):

Saad Kriouile ◽

Maialen Larranaga ◽

Mohamad Assaad

Keyword(s):

Queueing Systems ◽

Index Policy ◽

Whittle Index

Download Full-text

Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy

GLOBECOM 2017 - 2017 IEEE Global Communications Conference ◽

10.1109/glocom.2017.8254159 ◽

2017 ◽

Cited By ~ 3

Author(s):

Kehao Wang ◽

Jihong Yu ◽

Lin Chen ◽

Moe Win

Keyword(s):

Opportunistic Scheduling ◽

Index Policy ◽

Restless Bandits

Download Full-text

Index policies for discounted bandit problems with availability constraints

Advances in Applied Probability ◽

10.1017/s0001867800002573 ◽

2008 ◽

Vol 40 (02) ◽

pp. 377-400 ◽

Cited By ~ 1

Author(s):

Savas Dayanik ◽

Warren Powell ◽

Kazutoshi Yamazaki

Keyword(s):

Bandit Problem ◽

Bandit Problems ◽

Index Policy ◽

State Action ◽

Index Policies ◽

Availability Constraints ◽

Whittle Index ◽

Multiarmed Bandit

A multiarmed bandit problem is studied when the arms are not always available. The arms are first assumed to be intermittently available with some state/action-dependent probabilities. It is proven that no index policy can attain the maximum expected total discounted reward in every instance of that problem. The Whittle index policy is derived, and its properties are studied. Then it is assumed that the arms may break down, but repair is an option at some cost, and the new Whittle index policy is derived. Both problems are indexable. The proposed index policies cannot be dominated by any other index policy over all multiarmed bandit problems considered here. Whittle indices are evaluated for Bernoulli arms with unknown success probabilities.

Download Full-text

Whittle index policy for crawling ephemeral content

2015 54th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc.2015.7403283 ◽

2015 ◽

Cited By ~ 3

Author(s):

Konstantin E. Avrachenkov ◽

Vivek S. Borkar

Keyword(s):

Index Policy ◽

Whittle Index

Download Full-text

Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy

IEEE Transactions on Wireless Communications ◽

10.1109/twc.2019.2931690 ◽

2019 ◽

Vol 18 (10) ◽

pp. 4997-5010

Author(s):

Kehao Wang ◽

Jihong Yu ◽

Lin Chen ◽

Pan Zhou ◽

Xiaohu Ge ◽

...

Keyword(s):

Opportunistic Scheduling ◽

Index Policy ◽

Restless Bandits

Download Full-text

Index policies for discounted bandit problems with availability constraints

Advances in Applied Probability ◽

10.1239/aap/1214950209 ◽

2008 ◽

Vol 40 (2) ◽

pp. 377-400 ◽

Cited By ~ 5

Author(s):

Savas Dayanik ◽

Warren Powell ◽

Kazutoshi Yamazaki

Keyword(s):

Bandit Problem ◽

Bandit Problems ◽

Index Policy ◽

State Action ◽

Index Policies ◽

Availability Constraints ◽

Whittle Index ◽

Multiarmed Bandit

A multiarmed bandit problem is studied when the arms are not always available. The arms are first assumed to be intermittently available with some state/action-dependent probabilities. It is proven that no index policy can attain the maximum expected total discounted reward in every instance of that problem. The Whittle index policy is derived, and its properties are studied. Then it is assumed that the arms may break down, but repair is an option at some cost, and the new Whittle index policy is derived. Both problems are indexable. The proposed index policies cannot be dominated by any other index policy over all multiarmed bandit problems considered here. Whittle indices are evaluated for Bernoulli arms with unknown success probabilities.

Download Full-text

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

Mathematics ◽

10.3390/math8122226 ◽

2020 ◽

Vol 8 (12) ◽

pp. 2226 ◽

Cited By ~ 1

Author(s):

José Niño-Mora

Keyword(s):

Numerical Study ◽

Index Policy ◽

State Spaces ◽

Restless Bandit ◽

Restless Bandits ◽

Pivoting Algorithm ◽

Markov Decision ◽

Whittle Index ◽

Decision Epoch ◽

Change State

The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and which can change state while passive. It further provides a practical heuristic priority-index policy for the computationally intractable multi-armed restless bandit problem, which has been widely applied over the last three decades in multifarious settings, yet mostly restricted to project models with a one-dimensional state. This is due in part to the difficulty of establishing indexability (existence of the index) and of computing the index for projects with large state spaces. This paper draws on the author’s prior results on sufficient indexability conditions and an adaptive-greedy algorithmic scheme for restless bandits to obtain a new fast-pivoting algorithm that computes the n Whittle index values of an n-state restless bandit by performing, after an initialization stage, n steps that entail (2/3)n3+O(n2) arithmetic operations. This algorithm also draws on the parametric simplex method, and is based on elucidating the pattern of parametric simplex tableaux, which allows to exploit special structure to substantially simplify and reduce the complexity of simplex pivoting steps. A numerical study demonstrates substantial runtime speed-ups versus alternative algorithms.

Download Full-text

Whittle Index Policy for Dynamic Multichannel Allocation in Remote State Estimation

IEEE Transactions on Automatic Control ◽

10.1109/tac.2019.2912492 ◽

2020 ◽

Vol 65 (2) ◽

pp. 591-603 ◽

Cited By ~ 2

Author(s):

Jiazheng Wang ◽

Xiaoqiang Ren ◽

Yilin Mo ◽

Ling Shi

Keyword(s):

State Estimation ◽

Index Policy ◽

Whittle Index ◽

Remote State Estimation

Download Full-text

Whittle Index Policy for Opportunistic Scheduling: Heterogeneous Two-State Channels

Whittle Index Policy for Opportunistic Scheduling: Heterogeneous Multistate Channels

Fast Two-Stage Computation of an Index Policy for Multi-Armed Bandits with Setup Delays

Whittle Index Policy for Multichannel Scheduling in Queueing Systems

Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy

Index policies for discounted bandit problems with availability constraints

Whittle index policy for crawling ephemeral content

Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy

Index policies for discounted bandit problems with availability constraints

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

Whittle Index Policy for Dynamic Multichannel Allocation in Remote State Estimation

Export Citation Format