Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality

Operations Research ◽

10.1287/opre.2021.2181 ◽

2021 ◽

Author(s):

David B. Brown ◽

Jingwei Zhang

Keyword(s):

Inventory Management ◽

Fluid Model ◽

Asymptotic Optimality ◽

Capital Budgeting ◽

Stochastic Dynamic ◽

Performance Bounds ◽

Sequential Decision ◽

Shared Resources ◽

Management Problems ◽

Dynamic Programs

Allocating Resources Across Systems Coupled by Shared Information Many sequential decision problems involve repeatedly allocating a limited resource across subsystems that are jointly affected by randomly evolving exogenous factors. For example, in adaptive clinical trials, a decision maker needs to allocate patients to treatments in an effort to learn about the efficacy of treatments, but the number of available patients may vary randomly over time. In capital budgeting problems, firms may allocate resources to conduct R&D on new products, but funding budgets may evolve randomly. In many inventory management problems, firms need to allocate limited production capacity to satisfy uncertain demands at multiple locations, and these demands may be correlated due to vagaries in shared market conditions. In this paper, we develop a model involving “shared resources and signals” that captures these and potentially many other applications. The framework is naturally described as a stochastic dynamic program, but this problem is quite difficult to solve. We develop an approximation method based on a “dynamic fluid relaxation”: in this approximation, the subsystem state evolution is approximated by a deterministic fluid model, but the exogenous states (the signals) retain their stochastic evolution. We develop an algorithm for solving the dynamic fluid relaxation. We analyze the corresponding feasible policies and performance bounds from the dynamic fluid relaxation and show that these are asymptotically optimal as the number of subsystems grows large. We show that competing state-of-the-art approaches used in the literature on weakly coupled dynamic programs in general fail to provide asymptotic optimality. Finally, we illustrate the approach on the aforementioned dynamic capital budgeting and multilocation inventory management problems.

Download Full-text

Index Policies and Performance Bounds for Dynamic Selection Problems

Management Science ◽

10.1287/mnsc.2019.3342 ◽

2020 ◽

Vol 66 (7) ◽

pp. 3029-3050 ◽

Cited By ~ 2

Author(s):

David B. Brown ◽

James E. Smith

Keyword(s):

Large Scale ◽

Stochastic Dynamic ◽

Performance Bounds ◽

Index Policy ◽

Dynamic Selection ◽

Demand Learning ◽

Selection Problems ◽

Index Policies ◽

Dynamic Programs ◽

And Performance

We consider dynamic selection problems, where a decision maker repeatedly selects a set of items from a larger collection of available items. A classic example is the dynamic assortment problem with demand learning, where a retailer chooses items to offer for sale subject to a display space constraint. The retailer may adjust the assortment over time in response to the observed demand. These dynamic selection problems are naturally formulated as stochastic dynamic programs (DPs) but are difficult to solve because the optimal selection decisions depend on the states of all items. In this paper, we study heuristic policies for dynamic selection problems and provide upper bounds on the performance of an optimal policy that can be used to assess the performance of a heuristic policy. The policies and bounds that we consider are based on a Lagrangian relaxation of the DP that relaxes the constraint limiting the number of items that may be selected. We characterize the performance of the Lagrangian index policy and bound and show that, under mild conditions, these policies and bounds are asymptotically optimal for problems with many items; mixed policies and tiebreaking play an essential role in the analysis of these index policies and can have a surprising impact on performance. We demonstrate these policies and bounds in two large scale examples: a dynamic assortment problem with demand learning and an applicant screening problem. This paper was accepted by Yinyu Ye, optimization.

Download Full-text

Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality

SSRN Electronic Journal ◽

10.2139/ssrn.3728111 ◽

2020 ◽

Author(s):

David B. Brown ◽

Jingwei Zhang

Keyword(s):

Asymptotic Optimality ◽

Shared Resources ◽

Dynamic Programs

Download Full-text

On the structure of solutions of stochastic dynamic programs

Proceedings of the Seventh Conference on Probability Theory ◽

10.1515/9783112314036-020 ◽

1985 ◽

pp. 173-182

Keyword(s):

Stochastic Dynamic ◽

Structure Of Solutions ◽

Dynamic Programs

Download Full-text

Analysis of Inventory Management in Reverse Supply Chain Using Stochastic Dynamic Programming Model

Volume 8: Energy Systems: Analysis, Thermodynamics and Sustainability; Sustainable Products and Processes ◽

10.1115/imece2008-67374 ◽

2008 ◽

Author(s):

Badr O. Johar ◽

Surendra M. Gupta

Keyword(s):

Dynamic Programming ◽

Inventory Management ◽

Reverse Logistics ◽

Stochastic Dynamic Programming ◽

Public Awareness ◽

Financial Burden ◽

Probabilistic Approach ◽

Complex Nature ◽

Stochastic Dynamic

Reverse logistics is a critical topic that has captured the attention of government, private entities and researchers in recent years. This increase in the concern was driven by current set of government regulations, increase of public awareness, and the attractive economic opportunities. Also, environmentalists have always demanded Original Equipment Manufacturers (OEMs) to be more involved and be responsible of their products at the end of its life cycle. However, the uncertainty in quality of items returned, and its quantity discourage OEMs from participating in such programs. Because of the unique problems associated and the complex nature of the reverse logistics activities, numerous studies have been carried out in this field. One of those crucial areas is inventory management of End-of-Life (EOL) products. The take back program could possibly bring financial burden to OEM if it is not managed well. Thus, an efficient yet cost effective system should be implemented to appropriately manage the overwhelming number of returns. Previously, we have analyzed the problem based on the assumption that the number of core products returned and disassembled parts and subassemblies are known in advance. In this paper, we introduce a probabilistic approach where different quality levels of for every component disassembled are considered and different probabilities of these qualities given the quality of the returned product. The model utilizes a multi-period stochastic dynamic programming in a disassembly line context to solve the problem, and generate the best option that will maximize the system total profit. A numerical example is given to illustrate the approach. Finally, directions for future research are suggested.

Download Full-text

Application of orthogonal arrays and MARS to inventory forecasting stochastic dynamic programs

Computational Statistics & Data Analysis ◽

10.1016/s0167-9473(98)00084-x ◽

1999 ◽

Vol 30 (3) ◽

pp. 317-341 ◽

Cited By ~ 19

Author(s):

Victoria C.P Chen

Keyword(s):

Orthogonal Arrays ◽

Stochastic Dynamic ◽

Dynamic Programs

Download Full-text

Fixed-Dimensional Stochastic Dynamic Programs: An Approximation Scheme and an Inventory Application

SSRN Electronic Journal ◽

10.2139/ssrn.2193021 ◽

2012 ◽

Cited By ~ 1

Author(s):

Wei Chen ◽

Milind Dawande ◽

Ganesh Janakiraman

Keyword(s):

Approximation Scheme ◽

Stochastic Dynamic ◽

Dynamic Programs

Download Full-text

On the Strength of Relaxations of Weakly Coupled Stochastic Dynamic Programs

SSRN Electronic Journal ◽

10.2139/ssrn.3849573 ◽

2021 ◽

Author(s):

David B. Brown ◽

Jingwei Zhang

Keyword(s):

Stochastic Dynamic ◽

Weakly Coupled ◽

Dynamic Programs

Download Full-text

Choosing Alternatives to Contaminated Groundwater Supplies: A Sequential Decision Framework Under Uncertainty

Northeastern Journal of Agricultural and Resource Economics ◽

10.1017/s0899367x00001434 ◽

1987 ◽

Vol 16 (2) ◽

pp. 102-112 ◽

Cited By ~ 2

Author(s):

Carol L. Sarnat ◽

Cleve E. Willis ◽

Carolyn R. Harper

Keyword(s):

Stochastic Dynamic Programming ◽

Septic Systems ◽

Stochastic Dynamic ◽

Sequential Decision ◽

Decision Framework ◽

Agricultural Pesticides ◽

Long Run ◽

Fuel Storage ◽

Uncertain Outcomes ◽

Benefit Cost

In increasing numbers, communities that rely on groundwater for drinking supplies have discovered contamination from agricultural pesticides and herbicides, road salt, underground fuel storage, and septic systems. A variety of short- and long-run remedies are available with highly uncertain outcomes. An appropriate technique for solving a benefit-cost problem of this type is a sequential decision framework using stochastic dynamic programming procedures for solution. The approach is illustrated here by means of an application to the problem of the recent contamination of the groundwater of Whately, Massachusetts by the agricultural fumigant EDB and the pesticide aldicarb.

Download Full-text