Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach
2019 ◽
Vol 33
◽
pp. 7949-7956
Keyword(s):
Reinforcement learning (RL) agents have traditionally been tasked with maximizing the value function of a Markov decision process (MDP), either in continuous settings, with fixed discount factor γ
1997 ◽
Vol 22
(4)
◽
pp. 872-885
◽
2014 ◽
Vol 46
(01)
◽
pp. 121-138
◽
2014 ◽
Vol 46
(1)
◽
pp. 121-138
◽
2022 ◽
Vol 16
◽
pp. 115-121
2021 ◽
2010 ◽
Vol 44-47
◽
pp. 3611-3615
◽
Keyword(s):