A reinforcement learning application of a guided Monte Carlo Tree Search algorithm for beam orientation selection in radiation therapy

Gomoku is a two-player board game that originated in ancient China. There are various cases of developing Gomoku using artificial intelligence, such as a genetic algorithm and a tree search algorithm. Alpha-Gomoku, Gomoku AI built with Alpha-Go’s algorithm, defines all possible situations in the Gomoku board using Monte-Carlo tree search (MCTS), and minimizes the probability of learning other correct answers in the duplicated Gomoku board situation. However, in the tree search algorithm, the accuracy drops, because the classification criteria are manually set. In this paper, we propose an improved reinforcement learning-based high-level decision approach using convolutional neural networks (CNN). The proposed algorithm expresses each state as One-Hot Encoding based vectors and determines the state of the Gomoku board by combining the similar state of One-Hot Encoding based vectors. Thus, in a case where a stone that is determined by CNN has already been placed or cannot be placed, we suggest a method for selecting an alternative. We verify the proposed method of Gomoku AI in GuPyEngine, a Python-based 3D simulation platform.

Download Full-text

Deep learning inspired routing in ICN using Monte Carlo Tree Search algorithm

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2020.12.014 ◽

2021 ◽

Author(s):

Nitul Dutta ◽

Shobhit K. Patel ◽

Vadim Samusenkov ◽

Vigneswaran D.

Keyword(s):

Monte Carlo ◽

Deep Learning ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Development of rehabilitation system (RehabGame) through Monte-Carlo tree search algorithm using kinect and Myo sensor interface

2017 Computing Conference ◽

10.1109/sai.2017.8252217 ◽

2017 ◽

Cited By ~ 3

Author(s):

Shabnam Sadeghi Esfahlani ◽

George Wilson

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Tree Search ◽

Sensor Interface ◽

Monte Carlo Tree Search ◽

Rehabilitation System ◽

Tree Search Algorithm

Download Full-text

Adjustment of Difficulty Level on Wobble Board-Based Game Using Monte Carlo Tree Search Algorithm

2018 5th International Conference on Data and Software Engineering (ICoDSE) ◽

10.1109/icodse.2018.8705843 ◽

2018 ◽

Author(s):

Adi Purnama ◽

Saiful Akbar ◽

Dody Dharma

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Difficulty Level ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

A modified Monte-Carlo Tree Search Algorithm for Two-sided Assembly Line Balancing Problem

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2019.11.483 ◽

2019 ◽

Vol 52 (13) ◽

pp. 1920-1924

Author(s):

Chuanxun Wu ◽

Xiaofeng Hu ◽

Yahui Zhang ◽

Pengfei Wang

Keyword(s):

Monte Carlo ◽

Assembly Line ◽

Search Algorithm ◽

Assembly Line Balancing ◽

Line Balancing ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Assembly Line Balancing Problem ◽

Tree Search Algorithm

Download Full-text

Heuristic Model Checking using a Monte-Carlo Tree Search Algorithm

Proceedings of the 2015 on Genetic and Evolutionary Computation Conference - GECCO '15 ◽

10.1145/2739480.2754767 ◽

2015 ◽

Cited By ~ 5

Author(s):

Simon Poulding ◽

Robert Feldt

Keyword(s):

Monte Carlo ◽

Model Checking ◽

Search Algorithm ◽

Tree Search ◽

Heuristic Model ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Enhancing the Monte Carlo Tree Search Algorithm for Video Game Testing

2020 IEEE Conference on Games (CoG) ◽

10.1109/cog47356.2020.9231670 ◽

2020 ◽

Author(s):

Sinan Ariyurek ◽

Aysu Betin-Can ◽

Elif Surer

Keyword(s):

Monte Carlo ◽

Video Game ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Using Supervised Learning and Guided Monte Carlo Tree Search for Beam Orientation Optimization in Radiation Therapy

Artificial Intelligence in Radiation Therapy - Lecture Notes in Computer Science ◽

10.1007/978-3-030-32486-5_1 ◽

2019 ◽

pp. 1-9

Author(s):

Azar Sadeghnejad Barkousaraie ◽

Olalekan Ogunmolu ◽

Steve Jiang ◽

Dan Nguyen

Keyword(s):

Monte Carlo ◽

Radiation Therapy ◽

Supervised Learning ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Beam Orientation Optimization ◽

Beam Orientation ◽

Orientation Optimization

Download Full-text

Enhanced strategic Monte-Carlo Tree Search algorithm to play the game of Tic-Tac-Toe

Journal of Korea Game Society ◽

10.7583/jkgs.2016.16.4.79 ◽

2016 ◽

Vol 16 (4) ◽

pp. 79-86

Author(s):

Byung-Doo Lee ◽

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

ALTERNATIVE SELECTION FUNCTIONS FOR INFORMATION SET MONTE CARLO TREE SEARCH

Acta Polytechnica ◽

10.14311/ap.2014.54.0333 ◽

2014 ◽

Vol 54 (5) ◽

pp. 333-340

Author(s):

Viliam Lisy

Keyword(s):

Monte Carlo ◽

Imperfect Information ◽

Search Algorithm ◽

Superior Performance ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Information Set ◽

Imperfect Information Games ◽

Zero Sum ◽

Tree Search Algorithm

We evaluate the performance of various selection methods for the Monte Carlo Tree Search algorithm in two-player zero-sum extensive-form games with imperfect information. We compare the standard Upper Confident Bounds applied to Trees (UCT) along with the less common Exponential Weights for Exploration and Exploitation (Exp3) and novel Regret matching (RM) selection in two distinct imperfect information games: Imperfect Information Goofspiel and Phantom Tic-Tac-Toe. We show that UCT after initial fast convergence towards a Nash equilibrium computes increasingly worse strategies after some point in time. This is not the case with Exp3 and RM, which also show superior performance in head-to-head matches.

Download Full-text