Exact solution of the Bellman equation for a β-discounted reward in a two-armed bandit with switching arms
1999 ◽
Vol 12
(2)
◽
pp. 151-160
◽
We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a β-discounted reward and prove that a myopic policy is optimal.
2012 ◽
Vol 60
(1)
◽
pp. 300-309
◽
Keyword(s):
Keyword(s):
1986 ◽
Vol 47
(6)
◽
pp. 1029-1034
◽
2006 ◽
Vol 106
(2)
◽
pp. 115-130
◽
Keyword(s):