Mirror decent algorithm for a multi-armed bandit governed by a stationary finite state Markov chain
2003 ◽
Vol 17
(4)
◽
pp. 487-501
◽
2014 ◽
Vol 51
(4)
◽
pp. 1114-1132
◽
2005 ◽
Vol 37
(4)
◽
pp. 1015-1034
◽
Keyword(s):
2019 ◽
Vol 22
(08)
◽
pp. 1950047
◽
Keyword(s):
2004 ◽
Vol 2004
(3)
◽
pp. 197-208
◽
1982 ◽
Vol 19
(02)
◽
pp. 272-288
◽
Keyword(s):