Policy gradient reinforcement learning method for discrete-time linear quadratic regulation problem using estimated state value function

Author(s):  
Tomotake Sasaki ◽  
Eiji Uchibe ◽  
Hidenao Iwane ◽  
Hitoshi Yanami ◽  
Hirokazu Anai ◽  
...  
Sign in / Sign up

Export Citation Format

Share Document