A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
2013 ◽
Vol 163
(2)
◽
pp. 674-684
◽
Keyword(s):
2015 ◽
Vol 52
(02)
◽
pp. 419-440
◽
1991 ◽
Vol 7
(1)
◽
pp. 6-16
◽
Keyword(s):
2007 ◽
pp. 263-277
◽
2000 ◽
Vol 14
(4)
◽
pp. 533-548
1995 ◽
Vol 41
(1)
◽
pp. 89-108
◽