On effectiveness of the Mirror Decent Algorithm for a stochastic multi-armed bandit governed by a stationary finite Markov chain
2006 ◽
Vol 172
(1)
◽
pp. 267-285
◽
1980 ◽
Vol 25
(1)
◽
pp. 70-81
◽
Keyword(s):
1984 ◽
Vol 16
(04)
◽
pp. 804-818
◽
1999 ◽
Vol 21
(1)
◽
pp. 81-93
◽
1962 ◽
Vol 58
(2)
◽
pp. 286-298
◽
1996 ◽
Vol 91
(436)
◽
pp. 1595
◽
Keyword(s):