Monte Carlo Tree Search and Cognitive Hierarchy Theory for Interactive-Behavior Prediction in Fast Trajectory Planning and Automated Lane Change

Abstract Predicting the states of the surrounding traffic is one of the major problems in automated driving. Maneuvers such as lane change, merge, and exit management could pose challenges in the absence of intervehicular communications and can benefit from driver behavior prediction. Predicting the motion of surrounding vehicles and trajectory planning need to be computationally efficient for real-time implementation. The main goal of this paper is to develop a fast algorithm that predicts the future states of the neighboring vehicles. The proposed workflow employs Monte Carlo Tree Search (MCTS) along with an on-policy learning technique for fast trajectory planning in multi-lane highway traffic scenarios. Also, for the inclusion of behavioral aspects, cognitive hierarchy and level-K game theories are utilized to predict the reaction and decision of the surrounding drivers. Simulation case studies demonstrate that our proposed approach is real-time implementable and can often avoid collision in difficult simulated confrontations.

Download Full-text

Real-Time Monte Carlo Tree Search in Ms Pac-Man

IEEE Transactions on Computational Intelligence and AI in Games ◽

10.1109/tciaig.2013.2291577 ◽

2014 ◽

Vol 6 (3) ◽

pp. 245-257 ◽

Cited By ~ 24

Author(s):

Tom Pepels ◽

Mark H. M. Winands ◽

Marc Lanctot

Keyword(s):

Monte Carlo ◽

Real Time ◽

Tree Search ◽

Monte Carlo Tree Search

Download Full-text

Genetic Optimizing Method for Real-time Monte Carlo Tree Search Problem

10.1145/3426020.3426030 ◽

2020 ◽

Author(s):

Man-Je Kim ◽

Jong-Hyun Lee ◽

Chang Wook Ahn

Keyword(s):

Monte Carlo ◽

Real Time ◽

Search Problem ◽

Tree Search ◽

Monte Carlo Tree Search

Download Full-text

Multiobjective Monte Carlo Tree Search for Real-Time Games

IEEE Transactions on Computational Intelligence and AI in Games ◽

10.1109/tciaig.2014.2345842 ◽

2015 ◽

Vol 7 (4) ◽

pp. 347-360 ◽

Cited By ~ 7

Author(s):

Diego Perez ◽

Sanaz Mostaghim ◽

Spyridon Samothrakis ◽

Simon M. Lucas

Keyword(s):

Monte Carlo ◽

Real Time ◽

Tree Search ◽

Monte Carlo Tree Search

Download Full-text

Enhancements for real-time Monte-Carlo Tree Search in General Video Game Playing

2016 IEEE Conference on Computational Intelligence and Games (CIG) ◽

10.1109/cig.2016.7860448 ◽

2016 ◽

Cited By ~ 9

Author(s):

Dennis J. N. J. Soemers ◽

Chiara F. Sironi ◽

Torsten Schuster ◽

Mark H. M. Winands

Keyword(s):

Monte Carlo ◽

Real Time ◽

Video Game ◽

Tree Search ◽

Game Playing ◽

Monte Carlo Tree Search ◽

Video Game Playing

Download Full-text

Tackling Sparse Rewards in Real-Time Games with Statistical Forward Planning Methods

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33011691 ◽

2019 ◽

Vol 33 ◽

pp. 1691-1698 ◽

Cited By ~ 5

Author(s):

Raluca D. Gaina ◽

Simon M. Lucas ◽

Diego Pérez-Liébana

Keyword(s):

Monte Carlo ◽

Evolutionary Algorithms ◽

Real Time ◽

Fitness Landscape ◽

Tree Search ◽

Rolling Horizon ◽

Reward Systems ◽

Monte Carlo Tree Search ◽

Planning Methods ◽

High Level

One of the issues general AI game players are required to deal with is the different reward systems in the variety of games they are expected to be able to play at a high level. Some games may present plentiful rewards which the agents can use to guide their search for the best solution, whereas others feature sparse reward landscapes that provide little information to the agents. The work presented in this paper focuses on the latter case, which most agents struggle with. Thus, modifications are proposed for two algorithms, Monte Carlo Tree Search and Rolling Horizon Evolutionary Algorithms, aiming at improving performance in this type of games while maintaining overall win rate across those where rewards are plentiful. Results show that longer rollouts and individual lengths, either fixed or responsive to changes in fitness landscape features, lead to a boost of performance in the games during testing without being detrimental to non-sparse reward scenarios.

Download Full-text

Informed Monte Carlo Tree Search for Real-Time Strategy games

2016 IEEE Conference on Computational Intelligence and Games (CIG) ◽

10.1109/cig.2016.7860394 ◽

2016 ◽

Cited By ~ 3

Author(s):

Santiago Ontanon

Keyword(s):

Monte Carlo ◽

Real Time ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Strategy Games

Download Full-text

Monte Carlo Tree Search-Based Mixed Traffic Flow Control Algorithm for Arterial Intersections

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/0361198120919746 ◽

2020 ◽

Vol 2674 (8) ◽

pp. 167-178

Author(s):

Yanqiu Cheng ◽

Xianbiao Hu ◽

Qing Tang ◽

Hongsheng Qi ◽

Hong Yang

Keyword(s):

Monte Carlo ◽

Real Time ◽

Traffic Flow ◽

Tree Search ◽

Mixed Traffic ◽

Solution Quality ◽

Monte Carlo Tree Search ◽

Mixed Traffic Flow ◽

Model Free ◽

Tree Expansion

A model-free approach is presented, based on the Monte Carlo tree search (MCTS) algorithm, for the control of mixed traffic flow of human-driven vehicles (HDV) and connected and autonomous vehicles (CAV), named MCTS-MTF, on a one-lane roadway with signalized intersection control. Previous research has often simplified the problem with certain assumptions to reduce computational burden, such as dividing a vehicle trajectory into several segments with constant speed or linear acceleration/deceleration, which was rather unrealistic. This study departs from the existing research in that minimum constraints on CAV trajectory control were required, as long as the basic rules such as safety considerations and vehicular performance limits were followed. Modeling efforts were made to improve the algorithm solution quality and the run time efficiency over the naïve MCTS algorithm. This was achieved by an exploration-exploitation balance calibration module, and a tree expansion determination module to expand the tree more effectively along the desired direction. Results of a case study found that the proposed algorithm was able to achieve a travel time saving of 3.5% and a fuel consumption saving of 6.5%. It was also demonstrated to run at eight times the speed of a naïve MCTS model, suggesting a promising potential for real-time or near real-time applications.

Download Full-text