Actor-Critic Learning Control Based on
$\ell_{2}$
-Regularized Temporal-Difference Prediction With Gradient Correction
2018 ◽
Vol 29
(12)
◽
pp. 5899-5909
Keyword(s):
2016 ◽
Vol 27
(4)
◽
pp. 771-782
◽
1995 ◽
Vol 115
(1)
◽
pp. 167-168
2017 ◽
Vol 137
(1)
◽
pp. 10-16
2020 ◽
Vol 68
(5)
◽
pp. 195-203
Keyword(s):