The Successful Ingredients of Policy Gradient Algorithms
Keyword(s):
Despite the sublime success in recent years, the underlying mechanisms powering the advances of reinforcement learning are yet poorly understood. In this paper, we identify these mechanisms - which we call ingredients - in on-policy policy gradient methods and empirically determine their impact on the learning. To allow an equitable assessment, we conduct our experiments based on a unified and modular implementation. Our results underline the significance of recent algorithmic advances and demonstrate that reaching state-of-the-art performance may not need sophisticated algorithms but can also be accomplished by the combination of a few simple ingredients.
2020 ◽
Vol 34
(04)
◽
pp. 3316-3323
2009 ◽
Vol 5
(12)
◽
pp. e1000586
◽
Keyword(s):
Keyword(s):
2020 ◽
Vol 34
(04)
◽
pp. 6770-6777
Keyword(s):
2016 ◽
Vol 57
◽
pp. 187-227
◽