Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
1987 ◽
Vol 32
(11)
◽
pp. 977-982
◽
Keyword(s):
1987 ◽
Vol 32
(11)
◽
pp. 968-976
◽
Keyword(s):
1988 ◽
Vol 33
(10)
◽
pp. 899-906
◽
Keyword(s):
1985 ◽
Vol 6
(1)
◽
pp. 4-22
◽
1997 ◽
Vol 11
(1)
◽
pp. 65-78
◽
Keyword(s):
2008 ◽
Vol 40
(02)
◽
pp. 377-400
◽
Keyword(s):
Keyword(s):
Keyword(s):