A novel Q-learning algorithm with function approximation for constrained Markov decision processes
2012 ◽
Vol 153
(3)
◽
pp. 688-708
◽
2010 ◽
Vol 59
(12)
◽
pp. 760-766
◽
2007 ◽
Vol 55
(5)
◽
pp. 2170-2181
◽
2019 ◽
Vol 57
(5)
◽
pp. 3118-3136
◽