scholarly journals Weak Human Preference Supervision for Deep Reinforcement Learning

Author(s):  
Zehong Cao ◽  
KaiChiu Wong ◽  
Chin-Teng Lin
Decision ◽  
2016 ◽  
Vol 3 (2) ◽  
pp. 115-131 ◽  
Author(s):  
Helen Steingroever ◽  
Ruud Wetzels ◽  
Eric-Jan Wagenmakers

Sign in / Sign up

Export Citation Format

Share Document