The policy iteration algorithm for average reward Markov decision processes with general state space
1997 ◽
Vol 42
(12)
◽
pp. 1663-1680
◽
2016 ◽
Vol 133
(10)
◽
pp. 28-33
◽
Keyword(s):
2012 ◽
Vol 388
(2)
◽
pp. 1254-1267
◽
Keyword(s):
Keyword(s):
2000 ◽
Vol 14
(4)
◽
pp. 533-548
2003 ◽
Vol 17
(2)
◽
pp. 213-234
◽
Keyword(s):