An Application of Markov Decision Processes to the Seat Inventory Control Problem

In this paper our objective is to study continuous-time Markov decision processes on a general Borel state space with both impulsive and continuous controls for the infinite time horizon discounted cost. The continuous-time controlled process is shown to be nonexplosive under appropriate hypotheses. The so-called Bellman equation associated to this control problem is studied. Sufficient conditions ensuring the existence and the uniqueness of a bounded measurable solution to this optimality equation are provided. Moreover, it is shown that the value function of the optimization problem under consideration satisfies this optimality equation. Sufficient conditions are also presented to ensure on the one hand the existence of an optimal control strategy, and on the other hand the existence of a ε-optimal control strategy. The decomposition of the state space into two disjoint subsets is exhibited where, roughly speaking, one should apply a gradual action or an impulsive action correspondingly to obtain an optimal or ε-optimal strategy. An interesting consequence of our previous results is as follows: the set of strategies that allow interventions at time t = 0 and only immediately after natural jumps is a sufficient set for the control problem under consideration.

Download Full-text

Impulsive Control for Continuous-Time Markov Decision Processes

Advances in Applied Probability ◽

10.1017/s0001867800007722 ◽

2015 ◽

Vol 47 (01) ◽

pp. 106-127 ◽

Cited By ~ 2

Author(s):

François Dufour ◽

Alexei B. Piunovskiy

Keyword(s):

Optimal Control ◽

Control Problem ◽

Markov Decision Processes ◽

Control Strategy ◽

Continuous Time ◽

Sufficient Conditions ◽

Decision Processes ◽

Optimal Control Strategy ◽

Optimality Equation ◽

Markov Decision

In this paper our objective is to study continuous-time Markov decision processes on a general Borel state space with both impulsive and continuous controls for the infinite time horizon discounted cost. The continuous-time controlled process is shown to be nonexplosive under appropriate hypotheses. The so-called Bellman equation associated to this control problem is studied. Sufficient conditions ensuring the existence and the uniqueness of a bounded measurable solution to this optimality equation are provided. Moreover, it is shown that the value function of the optimization problem under consideration satisfies this optimality equation. Sufficient conditions are also presented to ensure on the one hand the existence of an optimal control strategy, and on the other hand the existence of a ε-optimal control strategy. The decomposition of the state space into two disjoint subsets is exhibited where, roughly speaking, one should apply a gradual action or an impulsive action correspondingly to obtain an optimal or ε-optimal strategy. An interesting consequence of our previous results is as follows: the set of strategies that allow interventions at time t = 0 and only immediately after natural jumps is a sufficient set for the control problem under consideration.

Download Full-text

A dynamic bid price approach for the seat inventory control problem in railway networks with consideration of passenger transfer

PLoS ONE ◽

10.1371/journal.pone.0201718 ◽

2018 ◽

Vol 13 (8) ◽

pp. e0201718 ◽

Cited By ~ 3

Author(s):

Wuyang Yuan ◽

Lei Nie ◽

Xin Wu ◽

Huiling Fu

Keyword(s):

Control Problem ◽

Inventory Control ◽

Railway Networks ◽

Bid Price ◽

Price Approach ◽

Inventory Control Problem ◽

Seat Inventory Control

Download Full-text

Simulation Experimental Analysis on a Seat Inventory Control Problem for Sequential Multiple Flights with Customer Choice Behavior

Korean Management Science Review ◽

10.7737/kmsr.2013.30.1.001 ◽

2013 ◽

Vol 30 (1) ◽

pp. 1-14

Author(s):

Changkyu Park ◽

Junyong Seo ◽

Yunsook Hong

Keyword(s):

Control Problem ◽

Inventory Control ◽

Experimental Analysis ◽

Choice Behavior ◽

Customer Choice ◽

Customer Choice Behavior ◽

Inventory Control Problem ◽

Seat Inventory Control

Download Full-text

Deep reinforcement learning in seat inventory control problem: an action generation approach

Journal of Revenue and Pricing Management ◽

10.1057/s41272-020-00275-x ◽

2021 ◽

Author(s):

Neda Etebari Alamdari ◽

Gilles Savard

Keyword(s):

Reinforcement Learning ◽

Control Problem ◽

Inventory Control ◽

Inventory Control Problem ◽

Seat Inventory Control

Download Full-text

A Markov decision process-based policy characterization approach for a stochastic inventory control problem with unreliable sourcing

International Journal of Production Economics ◽

10.1016/j.ijpe.2013.03.021 ◽

2013 ◽

Vol 144 (2) ◽

pp. 485-496 ◽

Cited By ~ 21

Author(s):

S. Sebnem Ahiska ◽

Samyuktha R. Appaji ◽

Russell E. King ◽

Donald P. Warsing

Keyword(s):

Control Problem ◽

Inventory Control ◽

Markov Decision Process ◽

Decision Process ◽

Stochastic Inventory ◽

Stochastic Inventory Control ◽

Markov Decision ◽

Inventory Control Problem

Download Full-text

The Steady-State Control Problem for Markov Decision Processes

Quantitative Evaluation of Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-642-40196-1_26 ◽

2013 ◽

pp. 290-304 ◽

Cited By ~ 1

Author(s):

S. Akshay ◽

Nathalie Bertrand ◽

Serge Haddad ◽

Loïc Hélouët

Keyword(s):

Steady State ◽

Control Problem ◽

Markov Decision Processes ◽

Decision Processes ◽

State Control ◽

Markov Decision ◽

Steady State Control

Download Full-text

On gradual-impulse control of continuous-time Markov decision processes with exponential utility

Advances in Applied Probability ◽

10.1017/apr.2020.64 ◽

2021 ◽

Vol 53 (2) ◽

pp. 301-334

Author(s):

Xin Guo ◽

Aiko Kurushima ◽

Alexey Piunovskiy ◽

Yi Zhang

Keyword(s):

Control Problem ◽

Markov Decision Processes ◽

Continuous Time ◽

Impulse Control ◽

Decision Processes ◽

Exponential Utility ◽

Impulse Control Problem ◽

Markov Decision ◽

The Value Function ◽

Selection Of

AbstractWe consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

Download Full-text

A Version of the Euler Equation in Discounted Markov Decision Processes

Journal of Applied Mathematics ◽

10.1155/2012/103698 ◽

2012 ◽

Vol 2012 ◽

pp. 1-16 ◽

Cited By ~ 1

Author(s):

H. Cruz-Suárez ◽

G. Zacarías-Espinoza ◽

V. Vázquez-Guevara

Keyword(s):

Control Problem ◽

Euler Equation ◽

Markov Decision Processes ◽

Optimal Policy ◽

Infinite Horizon ◽

Decision Processes ◽

Value Iteration ◽

Programming Technique ◽

Iteration Functions ◽

Markov Decision

This paper deals with Markov decision processes (MDPs) on Euclidean spaces with an infinite horizon. An approach to study this kind of MDPs is using the dynamic programming technique (DP). Then the optimal value function is characterized through the value iteration functions. The paper provides conditions that guarantee the convergence of maximizers of the value iteration functions to the optimal policy. Then, using the Euler equation and an envelope formula, the optimal solution of the optimal control problem is obtained. Finally, this theory is applied to a linear-quadratic control problem in order to find its optimal policy.

Download Full-text

On the optimality equation for average cost Markov decision processes and its validity for inventory control

Annals of Operations Research ◽

10.1007/s10479-017-2561-9 ◽

2017 ◽

Cited By ~ 5

Author(s):

Eugene A. Feinberg ◽

Yan Liang

Keyword(s):

Inventory Control ◽

Markov Decision Processes ◽

Average Cost ◽

Decision Processes ◽

Optimality Equation ◽

Markov Decision

Download Full-text