Point-Based Monte Carto Online Planning in POMDPs
2013 ◽
Vol 846-847
◽
pp. 1388-1391
Keyword(s):
The online planning and learning in partially observable Markov decision processes are often intractable because belief states space has two curses: dimensionality and history. In order to address this problem, this paper proposes a point-based Monte Carto online planning approach in POMDPs. This approach involves performing value backup at specific reachable belief points, rather than over the entire belief simplex, to speed up computation processes. Then Monte Carlo tree search algorithm is exploited to share the value of actions across each subtree of the search tree so as to minimise the mean squared error. The experimental results show that the proposed algorithm is effective in real-time system.
2014 ◽
Vol 51
◽
pp. 165-205
◽
Keyword(s):
Keyword(s):
Keyword(s):
2018 ◽
Vol 38
(2-3)
◽
pp. 162-181
◽