Artificial intelligence‐based radiotherapy machine parameter optimization using reinforcement learning

Reinforcement Learning for Cloud Computing Digital Library

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.105 ◽

2014 ◽

Vol 571-572 ◽

pp. 105-108

Author(s):

Lin Xu

Keyword(s):

Artificial Intelligence ◽

Cloud Computing ◽

Reinforcement Learning ◽

Digital Library ◽

Learning Algorithms ◽

Experimental Results ◽

Current Status ◽

Self Learning ◽

New Framework

This paper proposes a new framework of combining reinforcement learning with cloud computing digital library. Unified self-learning algorithms, which includes reinforcement learning, artificial intelligence and etc, have led to many essential advances. Given the current status of highly-available models, analysts urgently desire the deployment of write-ahead logging. In this paper we examine how DNS can be applied to the investigation of superblocks, and introduce the reinforcement learning to improve the quality of current cloud computing digital library. The experimental results show that the method works more efficiency.

Download Full-text

Pemanfaatan Asynchronous Advantage Actor-Critic Dalam Pembuatan AI Game Bot Pada Game Arcade

Journal of Intelligent System and Computation ◽

10.52985/insyst.v1i2.82 ◽

2019 ◽

Vol 1 (2) ◽

pp. 74-84

Author(s):

Evan Kusuma Susanto ◽

Yosi Kristian

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Reinforcement Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Trial And Error ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Asynchronous Advantage Actor-Critic (A3C) adalah sebuah algoritma deep reinforcement learning yang dikembangkan oleh Google DeepMind. Algoritma ini dapat digunakan untuk menciptakan sebuah arsitektur artificial intelligence yang dapat menguasai berbagai jenis game yang berbeda melalui trial and error dengan mempelajari tempilan layar game dan skor yang diperoleh dari hasil tindakannya tanpa campur tangan manusia. Sebuah network A3C terdiri dari Convolutional Neural Network (CNN) di bagian depan, Long Short-Term Memory Network (LSTM) di tengah, dan sebuah Actor-Critic network di bagian belakang. CNN berguna sebagai perangkum dari citra output layar dengan mengekstrak fitur-fitur yang penting yang terdapat pada layar. LSTM berguna sebagai pengingat keadaan game sebelumnya. Actor-Critic Network berguna untuk menentukan tindakan terbaik untuk dilakukan ketika dihadapkan dengan suatu kondisi tertentu. Dari hasil percobaan yang dilakukan, metode ini cukup efektif dan dapat mengalahkan pemain pemula dalam memainkan 5 game yang digunakan sebagai bahan uji coba.

Download Full-text

Cloud Load Balancing and Reinforcement Learning

Advances in Business Information Systems and Analytics - Cloud Computing Technologies for Green Enterprises ◽

10.4018/978-1-5225-3038-1.ch011 ◽

2018 ◽

pp. 266-291

Author(s):

Abdelghafour Harraz ◽

Mostapha Zbakh

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Load Balancing ◽

Decision Process ◽

Cloud System ◽

Human Intervention ◽

Q Learning ◽

State Action ◽

Learning Techniques ◽

Markov Decision

Artificial Intelligence allows to create engines that are able to explore, learn environments and therefore create policies that permit to control them in real time with no human intervention. It can be applied, through its Reinforcement Learning techniques component, using frameworks such as temporal differences, State-Action-Reward-State-Action (SARSA), Q Learning to name a few, to systems that are be perceived as a Markov Decision Process, this opens door in front of applying Reinforcement Learning to Cloud Load Balancing to be able to dispatch load dynamically to a given Cloud System. The authors will describe different techniques that can used to implement a Reinforcement Learning based engine in a cloud system.

Download Full-text

Predictability of AI Decisions

Analyzing Future Applications of AI, Sensors, and Robotics in Society - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-3499-1.ch002 ◽

2021 ◽

pp. 17-28

Author(s):

Grzegorz Musiolik

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Free Will ◽

Intelligent Agents ◽

Intelligent Agent ◽

Mathematical Structure ◽

General Question ◽

Safety Issues ◽

Robotic Applications ◽

Mathematics And Physics

Artificial intelligence evolves rapidly and will have a great impact on the society in the future. One important question which still cannot be addressed with satisfaction is whether the decision of an intelligent agent can be predicted. As a consequence of this, the general question arises if such agents can be controllable and future robotic applications can be safe. This chapter shows that unpredictable systems are very common in mathematics and physics although the underlying mathematical structure can be very simple. It also shows that such unpredictability can also emerge for intelligent agents in reinforcement learning, especially for complex tasks with various input parameters. An observer would not be capable to distinguish this unpredictability from a free will of the agent. This raises ethical questions and safety issues which are briefly presented.

Download Full-text

Model‐Informed Artificial Intelligence: Reinforcement Learning for Precision Dosing

Clinical Pharmacology & Therapeutics ◽

10.1002/cpt.1777 ◽

2020 ◽

Vol 107 (4) ◽

pp. 853-857 ◽

Cited By ~ 7

Author(s):

Benjamin Ribba ◽

Sherri Dudal ◽

Thierry Lavé ◽

Richard W. Peck

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning

Download Full-text

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

Science ◽

10.1126/science.aar6404 ◽

2018 ◽

Vol 362 (6419) ◽

pp. 1140-1144 ◽

Cited By ~ 388

Author(s):

David Silver ◽

Thomas Hubert ◽

Julian Schrittwieser ◽

Ioannis Antonoglou ◽

Matthew Lai ◽

...

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Domain Knowledge ◽

Learning Algorithm ◽

Search Techniques ◽

Domain Specific ◽

Evaluation Functions ◽

History Of ◽

World Champion ◽

Reinforcement Learning Algorithm

The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play. In this paper, we generalize this approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games. Starting from random play and given no domain knowledge except the game rules, AlphaZero convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.

Download Full-text

SU-GG-J-140: Online Re-Planning Using Direct Machine Parameter Optimization: A Non-Human Primate Lung Irradiation Study

Medical Physics ◽

10.1118/1.2961689 ◽

2008 ◽

Vol 35 (6Part7) ◽

pp. 2711-2711

Author(s):

F Lerma ◽

B Liu ◽

Z Wang ◽

C Yu ◽

B Yi ◽

...

Keyword(s):

Parameter Optimization ◽

Machine Parameter ◽

Lung Irradiation ◽

Direct Machine Parameter Optimization ◽

Human Primate

Download Full-text

TH-D-AUD B-02: A Technique For Creating VMAT Plans Using Direct Machine Parameter Optimization

Medical Physics ◽

10.1118/1.2962904 ◽

2008 ◽

Vol 35 (6Part27) ◽

pp. 2984-2984

Author(s):

C Ramsey ◽

Y Charara ◽

D Chase

Keyword(s):

Parameter Optimization ◽

Machine Parameter ◽

Direct Machine Parameter Optimization

Download Full-text

SU-FF-T-106: Does Intrafraction Motion Increase Dose to the Inframammary Fold Areas in Whole Breast Treatment Using Direct Machine Parameter Optimization IMRT?

Medical Physics ◽

10.1118/1.3181580 ◽

2009 ◽

Vol 36 (6Part10) ◽

pp. 2544-2544

Author(s):

S Hossain ◽

P Xia ◽

M Descovich ◽

J Chen ◽

L Ma

Keyword(s):

Parameter Optimization ◽

Inframammary Fold ◽

Machine Parameter ◽

Intrafraction Motion ◽

Direct Machine Parameter Optimization

Download Full-text

BOOSTR: A Dataset for Accelerator Control Systems

Data ◽

10.3390/data6040042 ◽

2021 ◽

Vol 6 (4) ◽

pp. 42

Author(s):

Diana Kafkes ◽

Jason St. John

Keyword(s):

Artificial Intelligence ◽

Time Series ◽

Reinforcement Learning ◽

Control Systems ◽

Power Supply ◽

Cycle Time ◽

Rapid Cycling ◽

Operation Optimization ◽

Advanced Control ◽

Rapid Cycling Synchrotron

The Booster Operation Optimization Sequential Time-series for Regression (BOOSTR) dataset was created to provide a cycle-by-cycle time series of readings and settings from instruments and controllable devices of the Booster, Fermilab’s Rapid-Cycling Synchrotron (RCS) operating at 15 Hz. BOOSTR provides a time series from 55 device readings and settings that pertain most directly to the high-precision regulation of the Booster’s gradient magnet power supply (GMPS). To our knowledge, this is one of the first well-documented datasets of accelerator device parameters made publicly available. We are releasing it in the hopes that it can be used to demonstrate aspects of artificial intelligence for advanced control systems, such as reinforcement learning and autonomous anomaly detection.

Download Full-text