Optimizing Daily Service Scheduling for Medical Diagnostic Equipment Considering Patient Satisfaction and Hospital Revenue

Under the background of the unbalanced supply and demand of medical diagnostic equipment and rising health care costs, this study aims to optimize the service scheduling for medical diagnostic equipment so as to improve patient satisfaction by ensuring the equipment utilization rate and hospital revenue. The finite horizon Markov Decision Process (MDP) was adopted to solve this problem. On the basis of field research, we divided patients into four categories: emergency patients, inpatients, appointed outpatients, and the randomly arrived outpatients according to the severity of illness and appointment situations. In the construction of the MDP model, we considered the possibility of cancellation (no-show patients) in scheduling optimization. Combined with the benefits and costs related to patient satisfaction, based on the value iteration algorithm, we took patient satisfaction and hospital revenue as the objective functions. Results indicated that, compared with the current scheduling strategy, the integrated strategy proposed in this study has a better performance, which could maintain the sustainable usage rate of large medical resources and patient satisfaction.

Download Full-text

Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control.

International Journal of Emerging Electric Power Systems ◽

10.2202/1553-779x.1066 ◽

2005 ◽

Vol 3 (1) ◽

Cited By ~ 14

Author(s):

Damien Ernst ◽

Mevludin Glavic ◽

Pierre Geurts ◽

Louis Wehenkel

Keyword(s):

Reinforcement Learning ◽

Power System ◽

Control Problem ◽

Learning Algorithm ◽

Electrical Power ◽

Complex Case ◽

Iteration Algorithm ◽

Value Iteration ◽

Learning Context ◽

Power System Control

In this paper we explain how to design intelligent agents able to process the information acquired from interaction with a system to learn a good control policy and show how the methodology can be applied to control some devices aimed to damp electrical power oscillations. The control problem is formalized as a discrete-time optimal control problem and the information acquired from interaction with the system is a set of samples, where each sample is composed of four elements: a state, the action taken while being in this state, the instantaneous reward observed and the successor state of the system. To process this information we consider reinforcement learning algorithms that determine an approximation of the so-called Q-function by mimicking the behavior of the value iteration algorithm. Simulations are first carried on a benchmark power system modeled with two state variables. Then we present a more complex case study on a four-machine power system where the reinforcement learning algorithm controls a Thyristor Controlled Series Capacitor (TCSC) aimed to damp power system oscillations.

Download Full-text

A Model-Based Factored Bayesian Reinforcement Learning Approach

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.513-517.1092 ◽

2014 ◽

Vol 513-517 ◽

pp. 1092-1095

Author(s):

Bo Wu ◽

Yan Peng Feng ◽

Hong Yan Zheng

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Iteration Algorithm ◽

Value Iteration ◽

Practical Applications ◽

Model Based ◽

Online Planning ◽

Bayesian Reinforcement Learning ◽

Bayesian Inference Method ◽

Unknown Structure

Bayesian reinforcement learning has turned out to be an effective solution to the optimal tradeoff between exploration and exploitation. However, in practical applications, the learning parameters with exponential growth are the main impediment for online planning and learning. To overcome this problem, we bring factored representations, model-based learning, and Bayesian reinforcement learning together in a new approach. Firstly, we exploit a factored representation to describe the states to reduce the size of learning parameters, and adopt Bayesian inference method to learn the unknown structure and parameters simultaneously. Then, we use an online point-based value iteration algorithm to plan and learn. The experimental results show that the proposed approach is an effective way for improving the learning efficiency in large-scale state spaces.

Download Full-text

Adiabatic Markov Decision Process: Convergence of Value Iteration Algorithm

Journal of Dynamic Systems Measurement and Control ◽

10.1115/1.4032875 ◽

2016 ◽

Vol 138 (6) ◽

Author(s):

Thai Duong ◽

Duong Nguyen-Huu ◽

Thinh Nguyen

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Transition Probability ◽

Transition Probability Matrix ◽

Rate Of Change ◽

Optimal Decision ◽

Iteration Algorithm ◽

Value Iteration ◽

Markov Decision ◽

Value Iteration Algorithm

Markov decision process (MDP) is a well-known framework for devising the optimal decision-making strategies under uncertainty. Typically, the decision maker assumes a stationary environment which is characterized by a time-invariant transition probability matrix. However, in many real-world scenarios, this assumption is not justified, thus the optimal strategy might not provide the expected performance. In this paper, we study the performance of the classic value iteration algorithm for solving an MDP problem under nonstationary environments. Specifically, the nonstationary environment is modeled as a sequence of time-variant transition probability matrices governed by an adiabatic evolution inspired from quantum mechanics. We characterize the performance of the value iteration algorithm subject to the rate of change of the underlying environment. The performance is measured in terms of the convergence rate to the optimal average reward. We show two examples of queuing systems that make use of our analysis framework.

Download Full-text

nso-HSVI: A Not-So-Optimistic Heuristic Search Value Iteration Algorithm for POMDPs

2014 IEEE 26th International Conference on Tools with Artificial Intelligence ◽

10.1109/ictai.2014.108 ◽

2014 ◽

Author(s):

Feng Liu ◽

Haibo Li ◽

Chongjun Wang

Keyword(s):

Heuristic Search ◽

Iteration Algorithm ◽

Value Iteration ◽

Value Iteration Algorithm

Download Full-text

Accessible Medical Diagnostic Equipment: A Rapid Review

Archives of Physical Medicine and Rehabilitation ◽

10.1016/j.apmr.2021.07.457 ◽

2021 ◽

Vol 102 (10) ◽

pp. e113

Author(s):

Susan Magasi ◽

Hilary Marshall

Keyword(s):

Rapid Review ◽

Diagnostic Equipment ◽

Medical Diagnostic

Download Full-text

Accelerating Procedures of the Value Iteration Algorithm for Discounted Markov Decision Processes, Based on a One-Step Lookahead Analysis

Operations Research ◽

10.1287/opre.42.5.940 ◽

1994 ◽

Vol 42 (5) ◽

pp. 940-946 ◽

Cited By ~ 10

Author(s):

Meir Herzberg ◽

Uri Yechiali

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Iteration Algorithm ◽

Value Iteration ◽

Markov Decision ◽

One Step ◽

Value Iteration Algorithm

Download Full-text

A Correction to “A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions''

SIAM Journal on Control and Optimization ◽

10.1137/16m110650x ◽

2017 ◽

Vol 55 (3) ◽

pp. 1711-1715

Author(s):

Ari Arapostathis ◽

Vivek S. Borkar

Keyword(s):

Iteration Algorithm ◽

Value Iteration ◽

Relative Value ◽

Controlled Diffusions ◽

Value Iteration Algorithm

Download Full-text

A Probabilistic Forward Search Value Iteration Algorithm for POMDP

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai.2019.00061 ◽

2019 ◽

Author(s):

Feng Liu ◽

Cheng Lei ◽

Hanyi Liu ◽

Chongjun Wang

Keyword(s):

Iteration Algorithm ◽

Value Iteration ◽

Forward Search ◽

Value Iteration Algorithm

Download Full-text

A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

Journal of Electronic Commerce in Organizations ◽

10.4018/jeco.2015070104 ◽

2015 ◽

Vol 13 (3) ◽

pp. 47-57 ◽

Cited By ~ 1

Author(s):

Sanaa Chafik ◽

Cherki Daoui

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Iteration Algorithm ◽

Value Iteration ◽

Decomposition Technique ◽

Artificial Data ◽

Markov Decision ◽

Speed Up ◽

Value Iteration Algorithm

As many real applications need a large amount of states, the classical methods are intractable for solving large Markov Decision Processes. The decomposition technique basing on the topology of each state in the associated graph and the parallelization technique are very useful methods to cope with this problem. In this paper, the authors propose a Modified Value Iteration algorithm, adding the parallelism technique. They test their implementation on artificial data using an Open MP that offers a significant speed-up.

Download Full-text