Weak Human Preference Supervision for Deep Reinforcement Learning
Zehong Cao
◽
KaiChiu Wong
◽
Chin-Teng Lin
Jinyoung Choi
◽
Christopher Dance
◽
Jung-eun Kim
◽
Kyung-sik Park
◽
Jaehun Han
◽
...
2016 ◽
Vol 3
(2)
◽
pp. 115-131
◽
Helen Steingroever
◽
Ruud Wetzels
◽
Eric-Jan Wagenmakers
James Foster
◽
Matt Jones
Adnane Ez-Zizi
◽
Simon Farrell
◽
David Leslie
Jinglu Chen
◽
Ling Tan
◽
Lu Liu
◽
Ling Wang
Yang Zhao
◽
Jian-Ming Hu
◽
Ming-Yang Gao
◽
Zuo Zhang
Yilong Ren
◽
Le Zhang
◽
Han Jiang
◽
Chengsheng Liu