scholarly journals On Optimality of Myopic Policy for Restless Multi-Armed Bandit Problem: An Axiomatic Approach

2012 ◽  
Vol 60 (1) ◽  
pp. 300-309 ◽  
Author(s):  
Kehao Wang ◽  
Lin Chen
1999 ◽  
Vol 12 (2) ◽  
pp. 151-160 ◽  
Author(s):  
Doncho S. Donchev

We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a β-discounted reward and prove that a myopic policy is optimal.


Optimization ◽  
1976 ◽  
Vol 7 (3) ◽  
pp. 471-475 ◽  
Author(s):  
P.W. Jones
Keyword(s):  

2007 ◽  
Author(s):  
Dennis Garlick ◽  
Aaron P. Blaisdell
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document