Fast gradient-descent methods for temporal-difference learning with linear function approximation
2021 ◽
Vol 1827
(1)
◽
pp. 012186
Keyword(s):