A Monte-Carlo AIXI Approximation

Journal of Artificial Intelligence Research ◽

10.1613/jair.3125 ◽

2011 ◽

Vol 40 ◽

pp. 95-142 ◽

Cited By ~ 46

Author(s):

J. Veness ◽

K.S. Ng ◽

M. Hutter ◽

W. Uther ◽

D. Silver

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Search Algorithm ◽

Future Research ◽

Monte Carlo Tree Search ◽

Practical Algorithms ◽

Learning Agent ◽

Context Tree ◽

Direct Approximation ◽

Partially Observable

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a new Monte-Carlo Tree Search algorithm along with an agent-specific extension to the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a variety of stochastic and partially observable domains. We conclude by proposing a number of directions for future research.

Download Full-text

A reinforcement learning application of a guided Monte Carlo Tree Search algorithm for beam orientation selection in radiation therapy

Machine Learning: Science and Technology ◽

10.1088/2632-2153/abe528 ◽

2021 ◽

Author(s):

Azar Sadeghnejad Barkousaraie ◽

Gyanendra Bohara ◽

Steve B Jiang ◽

Dan Nguyen

Keyword(s):

Monte Carlo ◽

Radiation Therapy ◽

Reinforcement Learning ◽

Search Algorithm ◽

Tree Search ◽

Orientation Selection ◽

Monte Carlo Tree Search ◽

Beam Orientation ◽

Tree Search Algorithm

Download Full-text

Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions

Applied Sciences ◽

10.3390/app11031291 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1291

Author(s):

Bonwoo Gu ◽

Yunsick Sung

Keyword(s):

Reinforcement Learning ◽

Search Algorithm ◽

Classification Criteria ◽

Tree Search ◽

Learning Method ◽

Board Game ◽

Ancient China ◽

Monte Carlo Tree Search ◽

High Level ◽

Tree Search Algorithm

Gomoku is a two-player board game that originated in ancient China. There are various cases of developing Gomoku using artificial intelligence, such as a genetic algorithm and a tree search algorithm. Alpha-Gomoku, Gomoku AI built with Alpha-Go’s algorithm, defines all possible situations in the Gomoku board using Monte-Carlo tree search (MCTS), and minimizes the probability of learning other correct answers in the duplicated Gomoku board situation. However, in the tree search algorithm, the accuracy drops, because the classification criteria are manually set. In this paper, we propose an improved reinforcement learning-based high-level decision approach using convolutional neural networks (CNN). The proposed algorithm expresses each state as One-Hot Encoding based vectors and determines the state of the Gomoku board by combining the similar state of One-Hot Encoding based vectors. Thus, in a case where a stone that is determined by CNN has already been placed or cannot be placed, we suggest a method for selecting an alternative. We verify the proposed method of Gomoku AI in GuPyEngine, a Python-based 3D simulation platform.

Download Full-text

Deep learning inspired routing in ICN using Monte Carlo Tree Search algorithm

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2020.12.014 ◽

2021 ◽

Author(s):

Nitul Dutta ◽

Shobhit K. Patel ◽

Vadim Samusenkov ◽

Vigneswaran D.

Keyword(s):

Monte Carlo ◽

Deep Learning ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Monte Carlo Tree Search for Bayesian Reinforcement Learning

2012 11th International Conference on Machine Learning and Applications ◽

10.1109/icmla.2012.30 ◽

2012 ◽

Cited By ~ 2

Author(s):

Ngo Anh Vien ◽

Wolfgang Ertel

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Bayesian Reinforcement Learning

Download Full-text

Development of rehabilitation system (RehabGame) through Monte-Carlo tree search algorithm using kinect and Myo sensor interface

2017 Computing Conference ◽

10.1109/sai.2017.8252217 ◽

2017 ◽

Cited By ~ 3

Author(s):

Shabnam Sadeghi Esfahlani ◽

George Wilson

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Tree Search ◽

Sensor Interface ◽

Monte Carlo Tree Search ◽

Rehabilitation System ◽

Tree Search Algorithm

Download Full-text

Adjustment of Difficulty Level on Wobble Board-Based Game Using Monte Carlo Tree Search Algorithm

2018 5th International Conference on Data and Software Engineering (ICoDSE) ◽

10.1109/icodse.2018.8705843 ◽

2018 ◽

Author(s):

Adi Purnama ◽

Saiful Akbar ◽

Dody Dharma

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Difficulty Level ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning

Chemical Science ◽

10.1039/d0sc04184j ◽

2020 ◽

Vol 11 (40) ◽

pp. 10959-10972

Author(s):

Xiaoxue Wang ◽

Yujie Qian ◽

Hanyu Gao ◽

Connor W. Coley ◽

Yiming Mo ◽

...

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Prediction Model ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Value Network ◽

Synthesis Routes

A new MCTS variant with a reinforcement learning value network and solvent prediction model proposes shorter synthesis routes with greener solvents.

Download Full-text

A modified Monte-Carlo Tree Search Algorithm for Two-sided Assembly Line Balancing Problem

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2019.11.483 ◽

2019 ◽

Vol 52 (13) ◽

pp. 1920-1924

Author(s):

Chuanxun Wu ◽

Xiaofeng Hu ◽

Yahui Zhang ◽

Pengfei Wang

Keyword(s):

Monte Carlo ◽

Assembly Line ◽

Search Algorithm ◽

Assembly Line Balancing ◽

Line Balancing ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Assembly Line Balancing Problem ◽

Tree Search Algorithm

Download Full-text

Heuristic Model Checking using a Monte-Carlo Tree Search Algorithm

Proceedings of the 2015 on Genetic and Evolutionary Computation Conference - GECCO '15 ◽

10.1145/2739480.2754767 ◽

2015 ◽

Cited By ~ 5

Author(s):

Simon Poulding ◽

Robert Feldt

Keyword(s):

Monte Carlo ◽

Model Checking ◽

Search Algorithm ◽

Tree Search ◽

Heuristic Model ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Contracts for Difference: A Reinforcement Learning Approach

Journal of Risk and Financial Management ◽

10.3390/jrfm13040078 ◽

2020 ◽

Vol 13 (4) ◽

pp. 78

Author(s):

Nico Zengeler ◽

Uwe Handmann

Keyword(s):

Reinforcement Learning ◽

Short Term Memory ◽

Learning Agents ◽

Learning Framework ◽

Learning Agent ◽

Markov Decision ◽

Economic Trends ◽

Model Size ◽

Contracts For Difference ◽

Partially Observable

We present a deep reinforcement learning framework for an automatic trading of contracts for difference (CfD) on indices at a high frequency. Our contribution proves that reinforcement learning agents with recurrent long short-term memory (LSTM) networks can learn from recent market history and outperform the market. Usually, these approaches depend on a low latency. In a real-world example, we show that an increased model size may compensate for a higher latency. As the noisy nature of economic trends complicates predictions, especially in speculative assets, our approach does not predict courses but instead uses a reinforcement learning agent to learn an overall lucrative trading policy. Therefore, we simulate a virtual market environment, based on historical trading data. Our environment provides a partially observable Markov decision process (POMDP) to reinforcement learners and allows the training of various strategies.

Download Full-text