An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions

Author(s):  
Yao Ma ◽  
Tingting Zhao ◽  
Kohei Hatano ◽  
Masashi Sugiyama
2015 ◽  
Vol 100 (2-3) ◽  
pp. 255-283 ◽  
Author(s):  
Matteo Pirotta ◽  
Marcello Restelli ◽  
Luca Bascetta

Sign in / Sign up

Export Citation Format

Share Document