Preference-Based Reinforcement Learning Using Dyad Ranking
2018 ◽
pp. 161-175
Keyword(s):