A unified approach to adaptive control of average reward Markov decision processes
Keyword(s):
1991 ◽
Vol 23
(1)
◽
pp. 193-207
◽
2003 ◽
Vol 286
(2)
◽
pp. 636-651
◽
Keyword(s):
2017 ◽
pp. 768-777
Keyword(s):
Keyword(s):
2007 ◽
pp. 373-387
◽
Keyword(s):
The policy iteration algorithm for average reward Markov decision processes with general state space
1997 ◽
Vol 42
(12)
◽
pp. 1663-1680
◽
2004 ◽
Vol 29
(2)
◽
pp. 339-352
◽
1991 ◽
Vol 28
(1)
◽
pp. 229-242
◽
Keyword(s):