Distributed Recommender Profiling and Selection with Gittins Indices

Explicit Gittins Indices for a Class of Superdiffusive Processes

Journal of Applied Probability ◽

10.1017/s0021900200118029 ◽

2007 ◽

Vol 44 (02) ◽

pp. 554-559 ◽

Cited By ~ 1

Author(s):

Roger Filliger ◽

Max-Olivier Hongler

Keyword(s):

Dynamic Allocation ◽

Noise Sources ◽

Soluble Class ◽

Diffusive Processes ◽

Gittins Indices

We explicitly calculate the dynamic allocation indices (i.e. the Gittins indices) for multi-armed Bandit processes driven by superdiffusive noise sources. This class of model generalizes former results derived by Karatzas for diffusive processes. In particular, the Gittins indices do, in this soluble class of superdiffusive models, explicitly depend on the noise state.

A note on gittins indices for pharmaceutical research

Advances in Applied Probability ◽

10.1017/s000186780002406x ◽

1991 ◽

Vol 23 (04) ◽

pp. 975-977

Author(s):

You-Gan Wang ◽

John Gittins

Keyword(s):

Pharmaceutical Research ◽

Optimal Search ◽

Active Compounds ◽

Gittins Indices

The Bernoulli/exponential target process is considered. Such processes have been found useful in modelling the search for active compounds in pharmaceutical research. An inequality is presented which improves a result of Gittins (1989), thus providing a better approximation to the Gittins indices which define the optimal search policy.

A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient Agents

Operations Research ◽

10.1287/opre.2020.1987 ◽

2020 ◽

Author(s):

Daniel Russo

Keyword(s):

Posterior Distribution ◽

Error Term ◽

Discount Factor ◽

Gittins Index ◽

Confidence Bound ◽

Bandit Problem ◽

Confidence Bounds ◽

Upper Confidence Bound ◽

Gittins Indices ◽

Multiarmed Bandit

This note gives a short, self-contained proof of a sharp connection between Gittins indices and Bayesian upper confidence bound algorithms. I consider a Gaussian multiarmed bandit problem with discount factor [Formula: see text]. The Gittins index of an arm is shown to equal the [Formula: see text]-quantile of the posterior distribution of the arm's mean plus an error term that vanishes as [Formula: see text]. In this sense, for sufficiently patient agents, a Gittins index measures the highest plausible mean-reward of an arm in a manner equivalent to an upper confidence bound.

Gittins indices and constrained allocation in clinical trials

Biometrika ◽

10.1093/biomet/78.1.101 ◽

1991 ◽

Vol 78 (1) ◽

pp. 101-111 ◽

Cited By ~ 9

Author(s):

YOU-GAN WANG

Keyword(s):

Clinical Trials ◽

Gittins Indices

Open bandit processes and optimal scheduling of queueing networks

Advances in Applied Probability ◽

10.2307/1427399 ◽

1988 ◽

Vol 20 (2) ◽

pp. 447-472 ◽

Cited By ~ 32

Author(s):

Tze Leung Lai ◽

Zhiliang Ying

Keyword(s):

Queueing Networks ◽

Queueing Systems ◽

Optimal Scheduling ◽

Discount Factor ◽

Asymptotic Approximations ◽

Bandit Problem ◽

Bandit Problems ◽

Optimal Policies ◽

Gittins Indices

Asymptotic approximations are developed herein for the optimal policies in discounted multi-armed bandit problems in which new projects are continually appearing, commonly known as ‘open bandit problems’ or ‘arm-acquiring bandits’. It is shown that under certain stability assumptions the open bandit problem is asymptotically equivalent to a closed bandit problem in which there is no arrival of new projects, as the discount factor approaches 1. Applications of these results to optimal scheduling of queueing networks are given. In particular, Klimov&s priority indices for scheduling queueing networks are shown to be limits of the Gittins indices for the associated closed bandit problem, and extensions of Klimov&s results to preemptive policies and to unstable queueing systems are given.

Open bandit processes and optimal scheduling of queueing networks

Advances in Applied Probability ◽

10.1017/s0001867800017067 ◽

1988 ◽

Vol 20 (02) ◽

pp. 447-472 ◽

Cited By ~ 7

Author(s):

Tze Leung Lai ◽

Zhiliang Ying

Keyword(s):

Queueing Networks ◽

Queueing Systems ◽

Optimal Scheduling ◽

Discount Factor ◽

Asymptotic Approximations ◽

Bandit Problem ◽

Bandit Problems ◽

Optimal Policies ◽

Gittins Indices

Asymptotic approximations are developed herein for the optimal policies in discounted multi-armed bandit problems in which new projects are continually appearing, commonly known as ‘open bandit problems’ or ‘arm-acquiring bandits’. It is shown that under certain stability assumptions the open bandit problem is asymptotically equivalent to a closed bandit problem in which there is no arrival of new projects, as the discount factor approaches 1. Applications of these results to optimal scheduling of queueing networks are given. In particular, Klimov&s priority indices for scheduling queueing networks are shown to be limits of the Gittins indices for the associated closed bandit problem, and extensions of Klimov&s results to preemptive policies and to unstable queueing systems are given.

The performance of index-based policies for bandit problems with stochastic machine availability

Advances in Applied Probability ◽

10.1017/s0001867800010843 ◽

2001 ◽

Vol 33 (2) ◽

pp. 365-390 ◽

Cited By ~ 1

Author(s):

R. T. Dunn ◽

K. D. Glazebrook

Keyword(s):

Stochastic Process ◽

Discount Rate ◽

Asymptotic Optimality ◽

Stochastic Scheduling ◽

Bandit Problems ◽

Mild Conditions ◽

Machine Availability ◽

Performance Guarantees ◽

Scheduling Models ◽

Gittins Indices

We consider generalisations of two classical stochastic scheduling models, namely the discounted branching bandit and the discounted multi-armed bandit, to the case where the collection of machines available for processing is itself a stochastic process. Under rather mild conditions on the machine availability process we obtain performance guarantees for a range of controls based on Gittins indices. Various forms of asymptotic optimality are established for index-based limit policies as the discount rate approaches 0.

Gittins Indices in the Dynamic Allocation Problem for Diffusion Processes

The Annals of Probability ◽

10.1214/aop/1176993381 ◽

1984 ◽

Vol 12 (1) ◽

pp. 173-192 ◽

Cited By ~ 57

Author(s):

Ioannis Karatzas

Keyword(s):

Diffusion Processes ◽

Allocation Problem ◽

Dynamic Allocation ◽

Gittins Indices

On the Computation of the Gittins Indices in Multi-Armed Bandit Problems

DGOR ◽

10.1007/978-3-642-71161-9_109 ◽

1986 ◽

pp. 548-548

Author(s):

Lodewijk C. M. Kallenberg

Keyword(s):

Bandit Problems ◽

Gittins Indices

Explicit Gittins Indices for a Class of Superdiffusive Processes

Journal of Applied Probability ◽

10.1017/s0021900200003168 ◽

2007 ◽

Vol 44 (02) ◽

pp. 554-559

Author(s):

Roger Filliger ◽

Max-Olivier Hongler

Keyword(s):

Dynamic Allocation ◽

Noise Sources ◽

Soluble Class ◽

Diffusive Processes ◽

Gittins Indices

We explicitly calculate the dynamic allocation indices (i.e. the Gittins indices) for multi-armed Bandit processes driven by superdiffusive noise sources. This class of model generalizes former results derived by Karatzas for diffusive processes. In particular, the Gittins indices do, in this soluble class of superdiffusive models, explicitly depend on the noise state.