The optimal value of markov stopping problems with one-step look ahead policy

This paper treats stopping problems on Markov chains in which the OLA (one-step look ahead) policy is optimal. Its associated optimal value can be explicitly expressed by a potential for a charge function of the difference between the immediate reward and the one-step-after reward. As an application to the best choice problem, we shall obtain the value of three problems: the classical secretary problem, a problem with a refusal probability and a problem with a random number of objects.

Download Full-text

Urn sampling distributions giving alternate correspondences between two optimal stopping problems

Advances in Applied Probability ◽

10.1017/apr.2016.25 ◽

2016 ◽

Vol 48 (3) ◽

pp. 726-743 ◽

Cited By ~ 1

Author(s):

Mitsushi Tamaki

Keyword(s):

Optimal Stopping ◽

Random Variable ◽

Planning Horizon ◽

Information Model ◽

Secretary Problem ◽

Choice Problem ◽

Sampling Distributions ◽

Best Choice Problem ◽

Optimal Stopping Problems ◽

Bounded Random Variable

Abstract The best-choice problem and the duration problem, known as versions of the secretary problem, are concerned with choosing an object from those that appear sequentially. Let (B,p) denote the best-choice problem and (D,p) the duration problem when the total number N of objects is a bounded random variable with prior p=(p1, p2,...,pn) for a known upper bound n. Gnedin (2005) discovered the correspondence relation between these two quite different optimal stopping problems. That is, for any given prior p, there exists another prior q such that (D,p) is equivalent to (B,q). In this paper, motivated by his discovery, we attempt to find the alternate correspondence {p(m),m≥0}, i.e. an infinite sequence of priors such that (D,p(m-1)) is equivalent to (B,p(m)) for all m≥1, starting with p(0)=(0,...,0,1). To be more precise, the duration problem is distinguished into (D1,p) or (D2,p), referred to as model 1 or model 2, depending on whether the planning horizon is N or n. The aforementioned problem is model 1. For model 2 as well, we can find the similar alternate correspondence {p[m],m≥ 0}. We treat both the no-information model and the full-information model and examine the limiting behaviors of their optimal rules and optimal values related to the alternate correspondences as n→∞. A generalization of the no-information model is given. It is worth mentioning that the alternate correspondences for model 1 and model 2 are respectively related to the urn sampling models without replacement and with replacement.

Download Full-text

The Best Choice Problem for a Random Number of Objects

Theory of Probability and Its Applications ◽

10.1137/1117078 ◽

1973 ◽

Vol 17 (4) ◽

pp. 657-668 ◽

Cited By ~ 50

Author(s):

E. L. Presman ◽

I. M. Sonin

Keyword(s):

Random Number ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Optimality of the one step look-ahead stopping times

Journal of Applied Probability ◽

10.2307/3213269 ◽

1977 ◽

Vol 14 (1) ◽

pp. 162-169 ◽

Cited By ~ 5

Author(s):

M. Abdel-Hameed

Keyword(s):

Markov Process ◽

Discrete Time ◽

Measurable Function ◽

Infinitesimal Generator ◽

Stopping Time ◽

Stopping Times ◽

Homogeneous Markov Process ◽

Look Ahead ◽

One Step ◽

The One

The optimality of the one step look-ahead stopping rule is shown to hold under conditions different from those discussed by Chow, Robbins and Seigmund [5]. These results are corollaries of the following theorem: Let {Xn, n = 0, 1, …}; X0 = x be a discrete-time homogeneous Markov process with state space (E, ℬ). For any ℬ-measurable function g and α in (0, 1], define Aαg(x) = αExg(X1) – g(x) to be the infinitesimal generator of g. If τ is any stopping time satisfying the conditions: Ex[αNg(XN)I(τ > N)]→0 as as N → ∞, then Applications of the results are considered.

Download Full-text

Influence of the calibration on experimental UV index at a midlatitude site, Granada (Spain)

Atmospheric Measurement Techniques Discussions ◽

10.5194/amtd-3-5645-2010 ◽

2010 ◽

Vol 3 (6) ◽

pp. 5645-5670

Author(s):

M. Antón ◽

J. E. Gil ◽

A. Cazorla ◽

J. M. Vilaplana ◽

F. J. Olmo ◽

...

Keyword(s):

Calibration Method ◽

Conversion Factors ◽

Uv Index ◽

Calibration Methods ◽

One Step ◽

The Difference ◽

Mean Differences ◽

Experimental Values ◽

The One

Abstract. The ultraviolet (UV) index is the variable most commonly used to inform the general public about the levels and potential harmful effects of UV radiation incident at Earth's surface. This variable is derived from the output signal of the UV radiometers applying conversion factors obtained by calibration methods. This paper focused on the influence of the use of two of these methods (called one-step and two-steps methods) on the resulting experimental UV Index (UVI) as measured by a YES UVB-1 radiometer located in a midlatitude station, Granada (Spain) for the period 2006–2009. In addition, it is also analyzed the difference with the UVI values obtained when the calibration factors provided by the manufacturer are used. For this goal, the detailed characterization of the UVB-1 radiometer obtained in the first Spanish calibration campaign of broadband UV radiometers at the "El Arenosillo" INTA station in 2007 is used. In addition, modeled UVI data derived from the LibRadtran/UVSPEC radiative transfer code are compared with the experimental values recorded at Granada for cloud-free conditions. The absolute mean differences between the measured and modeled UVI data at Granada are around 5% using the one-step and two-steps calibration methods. This result indicates the excellent performance of these two techniques for obtaining UVI data from the UVB-1 radiometer. In contrast, the application of the calibration factor supplied by the manufacturer produces a high overestimation (~14%) of the UVI values. This fact generates unreliable alarming high UVI data in summer when the manufacturer's factor is used. Thus, days with an extreme erythemal risk (UVI higher than 10) increase up to 46% of all cases measured between May and September at Granada when the manufacturer's factor is applied. This percentage is reduced to a more reliable value of 3% when the conversion factors obtained with the two-steps calibration method are used. All these results report about the need of a sound calibration of the broadband UV instruments in order to obtain reliable measurements.

Download Full-text

Asymptotic results for the best-choice problem with a random number of objects

Journal of Applied Probability ◽

10.2307/3213614 ◽

1984 ◽

Vol 21 (3) ◽

pp. 521-536 ◽

Cited By ~ 5

Author(s):

Masami Yasuda

Keyword(s):

Integral Equation ◽

Markov Decision Processes ◽

Random Number ◽

Scaling Limit ◽

Decision Processes ◽

Choice Problem ◽

Asymptotic Results ◽

Optimality Equation ◽

Best Choice Problem ◽

Markov Decision

This paper considers the best-choice problem with a random number of objects having a known distribution. The optimality equation of the problem reduces to an integral equation by a scaling limit. The equation is explicitly solved under conditions on the distribution, which relate to the condition for an OLA policy to be optimal in Markov decision processes. This technique is then applied to three different versions of the problem and an exact value for the asymptotic optimal strategy is found.

Download Full-text

The full-information best choice problem with a random number of observations

Stochastic Processes and their Applications ◽

10.1016/0304-4149(87)90020-2 ◽

1987 ◽

Vol 24 (2) ◽

pp. 293-307 ◽

Cited By ~ 33

Author(s):

Zdzisław Porosiński

Keyword(s):

Random Number ◽

Full Information ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Characterization of the monotone case for a best choice problem with a random number of objects

Statistics & Probability Letters ◽

10.1016/s0167-7152(02)00038-x ◽

2002 ◽

Vol 56 (4) ◽

pp. 419-423

Author(s):

Zdzisław Porosiński

Keyword(s):

Random Number ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Asymptotic results for the best-choice problem with a random number of objects

Journal of Applied Probability ◽

10.1017/s0021900200028722 ◽

1984 ◽

Vol 21 (03) ◽

pp. 521-536 ◽

Cited By ~ 3

Author(s):

Masami Yasuda

Keyword(s):

Integral Equation ◽

Markov Decision Processes ◽

Random Number ◽

Scaling Limit ◽

Decision Processes ◽

Choice Problem ◽

Asymptotic Results ◽

Optimality Equation ◽

Best Choice Problem ◽

Markov Decision

This paper considers the best-choice problem with a random number of objects having a known distribution. The optimality equation of the problem reduces to an integral equation by a scaling limit. The equation is explicitly solved under conditions on the distribution, which relate to the condition for an OLA policy to be optimal in Markov decision processes. This technique is then applied to three different versions of the problem and an exact value for the asymptotic optimal strategy is found.

Download Full-text

The Reliability Analysis of Mechanical Leg (2UPS+UP) Based on a Discrete Time Repairable Markov Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.713-715.760 ◽

2015 ◽

Vol 713-715 ◽

pp. 760-763

Author(s):

Jia Lei Zhang ◽

Zhen Lin Jin ◽

Dong Mei Zhao

Keyword(s):

Steady State ◽

Discrete Time ◽

Transition Probability ◽

Continuous Model ◽

Transition Probability Matrix ◽

State Equations ◽

One Step ◽

The Difference ◽

Step Transition ◽

The One

We have analyzed some reliability problems of the 2UPS+UP mechanism using continuous Markov repairable model in our previous work. According to the check and repair of the robot is periodic, the discrete time Markov repairable model should be more appropriate. Firstly we built up the discrete time repairable model and got the one step transition probability matrix. Secondly solved the steady state equations and got the steady state availability of the mechanical leg, by the solution of the difference equations the reliability and the mean time to first failure were obtained. In the end we compared the reliability indexes with the continuous model.

Download Full-text