A data-based online reinforcement learning algorithm satisfying probably approximately correct principle
2014 ◽
Vol 26
(4)
◽
pp. 775-787
◽
2021 ◽
pp. 1-11