Refined knowledge-gradient policy for learning probabilities

The performance of clustering depends on an appropriately defined similarity between two items. When the similarity is measured based on human perception, human workers are often employed to estimate a similarity score between items in order to support clustering, leading to a procedure called crowdsourced clustering. Assuming a monetary reward is paid to a worker for each similarity score and assuming the similarities between pairs and workers' reliability have a large diversity, when the budget is limited, it is critical to wisely assign pairs of items to different workers to optimize the clustering result. We model this budget allocation problem as a Markov decision process where item pairs are dynamically assigned to workers based on the historical similarity scores they provided. We propose an optimistic knowledge gradient policy where the assignment of items in each stage is based on the minimum-weight K-cut defined on a similarity graph. We provide simulation studies and real data analysis to demonstrate the performance of the proposed method.

Download Full-text

The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery

INFORMS Journal on Computing ◽

10.1287/ijoc.1100.0417 ◽

2011 ◽

Vol 23 (3) ◽

pp. 346-363 ◽

Cited By ~ 43

Author(s):

Diana M. Negoescu ◽

Peter I. Frazier ◽

Warren B. Powell

Keyword(s):

Drug Discovery ◽

Gradient Algorithm ◽

Knowledge Gradient

Download Full-text

The conjunction of the knowledge gradient and the economic approach to simulation selection

Proceedings of the 2009 Winter Simulation Conference (WSC) ◽

10.1109/wsc.2009.5429722 ◽

2009 ◽

Cited By ~ 10

Author(s):

Stephen E. Chick ◽

Peter Frazier

Keyword(s):

Economic Approach ◽

Knowledge Gradient

Download Full-text

The knowledge-gradient stopping rule for ranking and selection

2008 Winter Simulation Conference ◽

10.1109/wsc.2008.4736082 ◽

2008 ◽

Cited By ~ 13

Author(s):

Peter Frazier ◽

Warren B. Powell

Keyword(s):

Stopping Rule ◽

Ranking And Selection ◽

Knowledge Gradient

Download Full-text

ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS

Probability in the Engineering and Informational Sciences ◽

10.1017/s0269964816000279 ◽

2016 ◽

Vol 31 (2) ◽

pp. 239-263 ◽

Cited By ~ 2

Author(s):

James Edwards ◽

Paul Fearnhead ◽

Kevin Glazebrook

Keyword(s):

Decision Making ◽

Exponential Family ◽

Numerical Study ◽

Ranking And Selection ◽

Gittins Index ◽

Selection Problems ◽

Index Policies ◽

Online Decision Making ◽

New Policies ◽

Knowledge Gradient

The knowledge gradient (KG) policy was originally proposed for online ranking and selection problems but has recently been adapted for use in online decision-making in general and multi-armed bandit problems (MABs) in particular. We study its use in a class of exponential family MABs and identify weaknesses, including a propensity to take actions which are dominated with respect to both exploitation and exploration. We propose variants of KG which avoid such errors. These new policies include an index heuristic, which deploys a KG approach to develop an approximation to the Gittins index. A numerical study shows this policy to perform well over a range of MABs including those for which index policies are not optimal. While KG does not take dominated actions when bandits are Gaussian, it fails to be index consistent and appears not to enjoy a performance advantage over competitor policies when arms are correlated to compensate for its greater computational demands.

Download Full-text

Asynchronous knowledge gradient policy for ranking and selection

Proceedings of the Winter Simulation Conference 2014 ◽

10.1109/wsc.2014.7020206 ◽

2014 ◽

Author(s):

Bogumil Kaminski ◽

Przemyslaw Szufel

Keyword(s):

Ranking And Selection ◽

Knowledge Gradient

Download Full-text

Expertise as a domain of epistemics in intensive care shift-handovers

Discourse Studies ◽

10.1177/14614456211016801 ◽

2021 ◽

pp. 146144562110168

Author(s):

Paulien Harms ◽

Tom Koole ◽

Ninke Stukker ◽

Jaap Tulleken

Keyword(s):

Intensive Care ◽

Medical Information ◽

Professional Competence ◽

Factual Knowledge ◽

Clinical Impression ◽

Knowing How ◽

Clinical Procedures ◽

Resident Physicians ◽

Horizontal Expertise ◽

Knowledge Gradient

This paper examines how expertise is treated as a separable domain of epistemics by looking at simulated intensive care shift-handovers between resident physicians. In these handovers, medical information about a patient is transferred from an outgoing physician (OP) to an incoming physician (IP). These handovers contain different interactional activities, such as discussing the patient identifiers, giving a clinical impression, and discussing tasks and focus points. We found that with respect to (factual) knowledge about the patient, the OPs display an orientation to a knowledge imbalance, but with respect to (clinical) procedures, reasoning, and activities, they display an orientation to a knowledge balance. We use ‘expertise’ to refer to this latter type of knowledge. ‘Expertise’ differs from, and adds to, how knowledge is often treated in epistemics in that it is concerned with professional competence or ‘knowing how’. In terms of epistemics, the participants in the handovers orient to a steep epistemic or knowledge gradient when it concerns the patient, while simultaneously displaying an orientation to a horizontal expertise gradient.

Download Full-text