The Best Choice Problem for a Random Number of Objects

This paper treats stopping problems on Markov chains in which the OLA (one-step look ahead) policy is optimal. Its associated optimal value can be explicitly expressed by a potential for a charge function of the difference between the immediate reward and the one-step-after reward. As an application to the best choice problem, we shall obtain the value of three problems: the classical secretary problem, a problem with a refusal probability and a problem with a random number of objects.

Download Full-text

Asymptotic results for the best-choice problem with a random number of objects

Journal of Applied Probability ◽

10.2307/3213614 ◽

1984 ◽

Vol 21 (3) ◽

pp. 521-536 ◽

Cited By ~ 5

Author(s):

Masami Yasuda

Keyword(s):

Integral Equation ◽

Markov Decision Processes ◽

Random Number ◽

Scaling Limit ◽

Decision Processes ◽

Choice Problem ◽

Asymptotic Results ◽

Optimality Equation ◽

Best Choice Problem ◽

Markov Decision

This paper considers the best-choice problem with a random number of objects having a known distribution. The optimality equation of the problem reduces to an integral equation by a scaling limit. The equation is explicitly solved under conditions on the distribution, which relate to the condition for an OLA policy to be optimal in Markov decision processes. This technique is then applied to three different versions of the problem and an exact value for the asymptotic optimal strategy is found.

Download Full-text

The full-information best choice problem with a random number of observations

Stochastic Processes and their Applications ◽

10.1016/0304-4149(87)90020-2 ◽

1987 ◽

Vol 24 (2) ◽

pp. 293-307 ◽

Cited By ~ 33

Author(s):

Zdzisław Porosiński

Keyword(s):

Random Number ◽

Full Information ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Characterization of the monotone case for a best choice problem with a random number of objects

Statistics & Probability Letters ◽

10.1016/s0167-7152(02)00038-x ◽

2002 ◽

Vol 56 (4) ◽

pp. 419-423

Author(s):

Zdzisław Porosiński

Keyword(s):

Random Number ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Asymptotic results for the best-choice problem with a random number of objects

Journal of Applied Probability ◽

10.1017/s0021900200028722 ◽

1984 ◽

Vol 21 (03) ◽

pp. 521-536 ◽

Cited By ~ 3

Author(s):

Masami Yasuda

Keyword(s):

Integral Equation ◽

Markov Decision Processes ◽

Random Number ◽

Scaling Limit ◽

Decision Processes ◽

Choice Problem ◽

Asymptotic Results ◽

Optimality Equation ◽

Best Choice Problem ◽

Markov Decision

This paper considers the best-choice problem with a random number of objects having a known distribution. The optimality equation of the problem reduces to an integral equation by a scaling limit. The equation is explicitly solved under conditions on the distribution, which relate to the condition for an OLA policy to be optimal in Markov decision processes. This technique is then applied to three different versions of the problem and an exact value for the asymptotic optimal strategy is found.

Download Full-text

The optimal value of markov stopping problems with one-step look ahead policy

Journal of Applied Probability ◽

10.1017/s0021900200041267 ◽

1988 ◽

Vol 25 (03) ◽

pp. 544-552 ◽

Cited By ~ 1

Author(s):

Masami Yasuda

Keyword(s):

Random Number ◽

Secretary Problem ◽

Choice Problem ◽

Best Choice Problem ◽

Look Ahead ◽

Optimal Value ◽

One Step ◽

The Difference ◽

The One ◽

A Charge

This paper treats stopping problems on Markov chains in which the OLA (one-step look ahead) policy is optimal. Its associated optimal value can be explicitly expressed by a potential for a charge function of the difference between the immediate reward and the one-step-after reward. As an application to the best choice problem, we shall obtain the value of three problems: the classical secretary problem, a problem with a refusal probability and a problem with a random number of objects.

Download Full-text

The best choice problem with random arrivals: How to beat the 1/e-strategy

Stochastic Processes and their Applications ◽

10.1016/j.spa.2021.12.008 ◽

2021 ◽

Author(s):

Alexander Gnedin

Keyword(s):

Choice Problem ◽

Best Choice Problem

Download Full-text

Estimating Optimal Stopping Rules in the Multiple Best Choice Problem with Minimal Summarized Rank via the Cross-Entropy Method

Evolutionary Learning and Optimization - Exploitation of Linkage Learning in Evolutionary Algorithms ◽

10.1007/978-3-642-12834-9_11 ◽

2010 ◽

pp. 227-241 ◽

Cited By ~ 1

Author(s):

T. V. Polushina

Keyword(s):

Optimal Stopping ◽

Entropy Method ◽

Cross Entropy ◽

Stopping Rules ◽

Choice Problem ◽

Best Choice Problem ◽

Cross Entropy Method ◽

Optimal Stopping Rules ◽

The Cross

Download Full-text

The Full-Information Best Choice Problem with Two Choices

Operations Research ’91 ◽

10.1007/978-3-642-48417-9_77 ◽

1992 ◽

pp. 278-281 ◽

Cited By ~ 1

Author(s):

Zdzislaw Porosinski

Keyword(s):

Full Information ◽

Choice Problem ◽

Best Choice Problem

Download Full-text

Urn sampling distributions giving alternate correspondences between two optimal stopping problems

Advances in Applied Probability ◽

10.1017/apr.2016.25 ◽

2016 ◽

Vol 48 (3) ◽

pp. 726-743 ◽

Cited By ~ 1

Author(s):

Mitsushi Tamaki

Keyword(s):

Optimal Stopping ◽

Random Variable ◽

Planning Horizon ◽

Information Model ◽

Secretary Problem ◽

Choice Problem ◽

Sampling Distributions ◽

Best Choice Problem ◽

Optimal Stopping Problems ◽

Bounded Random Variable

Abstract The best-choice problem and the duration problem, known as versions of the secretary problem, are concerned with choosing an object from those that appear sequentially. Let (B,p) denote the best-choice problem and (D,p) the duration problem when the total number N of objects is a bounded random variable with prior p=(p1, p2,...,pn) for a known upper bound n. Gnedin (2005) discovered the correspondence relation between these two quite different optimal stopping problems. That is, for any given prior p, there exists another prior q such that (D,p) is equivalent to (B,q). In this paper, motivated by his discovery, we attempt to find the alternate correspondence {p(m),m≥0}, i.e. an infinite sequence of priors such that (D,p(m-1)) is equivalent to (B,p(m)) for all m≥1, starting with p(0)=(0,...,0,1). To be more precise, the duration problem is distinguished into (D1,p) or (D2,p), referred to as model 1 or model 2, depending on whether the planning horizon is N or n. The aforementioned problem is model 1. For model 2 as well, we can find the similar alternate correspondence {p[m],m≥ 0}. We treat both the no-information model and the full-information model and examine the limiting behaviors of their optimal rules and optimal values related to the alternate correspondences as n→∞. A generalization of the no-information model is given. It is worth mentioning that the alternate correspondences for model 1 and model 2 are respectively related to the urn sampling models without replacement and with replacement.

Download Full-text