Optimistic Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds
Keyword(s):
In the paper, “Optimistic Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds,” the authors propose an extension to Monte Carlo tree search that uses the idea of “sampling the future” to produce noisy upper bounds on nodes in the decision tree. These upper bounds can help guide the tree expansion process and produce decision trees that are deeper rather than wider, in effect concentrating computation toward more useful parts of the state space. The algorithm’s effectiveness is illustrated in a ride-sharing setting, where a driver/vehicle needs to make dynamic decisions regarding trip acceptance and relocations.
2020 ◽
Vol 2674
(8)
◽
pp. 167-178
Keyword(s):
2018 ◽
Vol 42
(4)
◽
pp. 1-5
◽
Keyword(s):
Keyword(s):