Reinforcement learning-based control of drug dosing for cancer chemotherapy treatment

2017 ◽  
Vol 293 ◽  
pp. 11-20 ◽  
Author(s):  
Regina Padmanabhan ◽  
Nader Meskin ◽  
Wassim M. Haddad
2012 ◽  
pp. 1434-1444
Author(s):  
Adam E. Gaweda

This chapter presents application of reinforcement learning to drug dosing personalization in treatment of chronic conditions. Reinforcement learning is a machine learning paradigm that mimics the trialand- error skill acquisition typical for humans and animals. In treatment of chronic illnesses, finding the optimal dose amount for an individual is also a process that is usually based on trial-and-error. In this chapter, the author focuses on the challenge of personalized anemia treatment with recombinant human erythropoietin. The author demonstrates the application of a standard reinforcement learning method, called Q-learning, to guide the physician in selecting the optimal erythropoietin dose. The author further addresses the issue of random exploration in Q-learning from the drug dosing perspective and proposes a “smart” exploration method. Finally, the author performs computer simulations to compare the outcomes from reinforcement learning-based anemia treatment to those achieved by a standard dosing protocol used at a dialysis unit.


1986 ◽  
Vol 22 (1) ◽  
pp. 3-8 ◽  
Author(s):  
Brian G Birkhead ◽  
Walter M Gregory ◽  
Maurice L Slevin ◽  
Vernon J Harvey

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Brydon Eastman ◽  
Michelle Przedborski ◽  
Mohammad Kohandel

AbstractThe in-silico development of a chemotherapeutic dosing schedule for treating cancer relies upon a parameterization of a particular tumour growth model to describe the dynamics of the cancer in response to the dose of the drug. In practice, it is often prohibitively difficult to ensure the validity of patient-specific parameterizations of these models for any particular patient. As a result, sensitivities to these particular parameters can result in therapeutic dosing schedules that are optimal in principle not performing well on particular patients. In this study, we demonstrate that chemotherapeutic dosing strategies learned via reinforcement learning methods are more robust to perturbations in patient-specific parameter values than those learned via classical optimal control methods. By training a reinforcement learning agent on mean-value parameters and allowing the agent periodic access to a more easily measurable metric, relative bone marrow density, for the purpose of optimizing dose schedule while reducing drug toxicity, we are able to develop drug dosing schedules that outperform schedules learned via classical optimal control methods, even when such methods are allowed to leverage the same bone marrow measurements.


2020 ◽  
Vol 53 (2) ◽  
pp. 16353-16358
Author(s):  
Joseph T. Liparulo ◽  
Timothy D. Knab ◽  
Robert S. Parker

Author(s):  
Siqi Liu ◽  
Kay Choong See ◽  
Kee Yuan Ngiam ◽  
Leo Anthony Celi ◽  
Xingzhi Sun ◽  
...  

BACKGROUND Decision support systems based on reinforcement learning (RL) have been implemented to facilitate the delivery of personalized care. This paper aimed to provide a comprehensive review of RL applications in the critical care setting. OBJECTIVE This review aimed to survey the literature on RL applications for clinical decision support in critical care and to provide insight into the challenges of applying various RL models. METHODS We performed an extensive search of the following databases: PubMed, Google Scholar, Institute of Electrical and Electronics Engineers (IEEE), ScienceDirect, Web of Science, Medical Literature Analysis and Retrieval System Online (MEDLINE), and Excerpta Medica Database (EMBASE). Studies published over the past 10 years (2010-2019) that have applied RL for critical care were included. RESULTS We included 21 papers and found that RL has been used to optimize the choice of medications, drug dosing, and timing of interventions and to target personalized laboratory values. We further compared and contrasted the design of the RL models and the evaluation metrics for each application. CONCLUSIONS RL has great potential for enhancing decision making in critical care. Challenges regarding RL system design, evaluation metrics, and model choice exist. More importantly, further work is required to validate RL in authentic clinical environments.


2019 ◽  
Vol 309 ◽  
pp. 131-142 ◽  
Author(s):  
Regina Padmanabhan ◽  
Nader Meskin ◽  
Wassim M. Haddad

Sign in / Sign up

Export Citation Format

Share Document