An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
Keyword(s):
Keyword(s):
2007 ◽
Vol 178
(3)
◽
pp. 808-818
◽
Keyword(s):
2012 ◽
Vol 38
(5)
◽
pp. 673-687
◽
1992 ◽
Vol 43
(11)
◽
pp. 1095-1102