Constrained Cross-Entropy Method for Safe Reinforcement Learning

Author(s):  
Min Wen ◽  
Ufuk Topcu
2006 ◽  
Vol 18 (12) ◽  
pp. 2936-2941 ◽  
Author(s):  
István Szita ◽  
András Lörincz

The cross-entropy method is an efficient and general optimization algorithm. However, its applicability in reinforcement learning (RL) seems to be limited because it often converges to suboptimal policies. We apply noise for preventing early convergence of the cross-entropy method, using Tetris, a computer game, for demonstration. The resulting policy outperforms previous RL algorithms by almost two orders of magnitude.


Sign in / Sign up

Export Citation Format

Share Document