When to consult precision-recall curves

Receiver operating characteristic (ROC) curves are commonly used to evaluate predictions of binary outcomes. When there is a small percentage of items of interest (as would be the case with fraud detection, for example), ROC curves can provide an inflated view of performance. This can cause challenges in determining which set of predictions is better. In this article, we discuss the conditions under which precision-recall curves may be preferable to ROC curves. As an illustrative example, we compare two commonly used fraud predictors (Beneish’s [1999, Financial Analysts Journal 55: 24–36] M score and Dechow et al.’s [2011, Contemporary Accounting Research 28: 17–82] F score) using both ROC and precision-recall curves. To aid the reader with using precision-recall curves, we also introduce the command prcurve to plot them.

Download Full-text

Use of Receiver Operating Characteristic (ROC) Curves to Evaluate Computer Confidence Threshold and Clinical Performance in the Diagnosis of Appendicitis

Methods of Information in Medicine ◽

10.1055/s-0038-1636435 ◽

1978 ◽

Vol 17 (03) ◽

pp. 157-161 ◽

Cited By ~ 12

Author(s):

F. T. De Dombal ◽

Jane C. Horrocks

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Clinical Performance ◽

Roc Curves ◽

Final Diagnosis ◽

Confidence Threshold ◽

Computer Aided ◽

Computer Confidence ◽

Overall Performance ◽

Receiver Operating

This paper uses simple receiver operating characteristic (ROC) curves (i) to study the effect of varying computer confidence of threshold levels and (ii) to evaluate clinical performance in the diagnosis of acute appendicitis. Over 1300 patients presenting to five centres with abdominal pain of short duration were studied in varying detail. Clinical and computer-aided diagnostic predictions were compared with the »final« diagnosis. From these studies it is concluded the simplistic setting of a 50/50 confidence threshold for the computer program is as »good« as any other. The proximity of a computer-aided system changed clinical behaviour patterns; a higher overall performance level was achieved and clinicians performance levels became associated with the »mildly conservative« end of the computers ROC curve. Prior forecasts of over-confidence or ultra-caution amongst clinicians using the computer-aided system have not been fulfilled.

Download Full-text

MicroRNAs-1299, -126-3p and -30e-3p as Potential Diagnostic Biomarkers for Prediabetes

Diagnostics ◽

10.3390/diagnostics11060949 ◽

2021 ◽

Vol 11 (6) ◽

pp. 949

Author(s):

Cecil J. Weale ◽

Don M. Matshazi ◽

Saarah F. G. Davids ◽

Shanel Raghubeer ◽

Rajiv T. Erasmus ◽

...

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Characteristic Curve ◽

Predictive Ability ◽

Roc Curves ◽

Tolerance Test ◽

Cross Sectional Study ◽

Oral Glucose ◽

Cross Sectional ◽

Receiver Operating

This cross-sectional study investigated the association of miR-1299, -126-3p and -30e-3p with and their diagnostic capability for dysglycaemia in 1273 (men, n = 345) South Africans, aged >20 years. Glycaemic status was assessed by oral glucose tolerance test (OGTT). Whole blood microRNA (miRNA) expressions were assessed using TaqMan-based reverse transcription quantitative-PCR (RT-qPCR). Receiver operating characteristic (ROC) curves assessed the ability of each miRNA to discriminate dysglycaemia, while multivariable logistic regression analyses linked expression with dysglycaemia. In all, 207 (16.2%) and 94 (7.4%) participants had prediabetes and type 2 diabetes mellitus (T2DM), respectively. All three miRNAs were significantly highly expressed in individuals with prediabetes compared to normotolerant patients, p < 0.001. miR-30e-3p and miR-126-3p were also significantly more expressed in T2DM versus normotolerant patients, p < 0.001. In multivariable logistic regressions, the three miRNAs were consistently and continuously associated with prediabetes, while only miR-126-3p was associated with T2DM. The ROC analysis indicated all three miRNAs had a significant overall predictive ability to diagnose prediabetes, diabetes and the combination of both (dysglycaemia), with the area under the receiver operating characteristic curve (AUC) being significantly higher for miR-126-3p in prediabetes. For prediabetes diagnosis, miR-126-3p (AUC = 0.760) outperformed HbA1c (AUC = 0.695), p = 0.042. These results suggest that miR-1299, -126-3p and -30e-3p are associated with prediabetes, and measuring miR-126-3p could potentially contribute to diabetes risk screening strategies.

Download Full-text

Correction to: Cut-off points between pain intensities of the postoperative pain using receiver operating characteristic (ROC) curves

BMC Anesthesiology ◽

10.1186/s12871-021-01410-w ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Sooyoung Cho ◽

Youn Jin Kim ◽

Minjin Lee ◽

Jae Hee Woo ◽

Hyun Jung Lee

Keyword(s):

Postoperative Pain ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Receiver Operating

Download Full-text

SOME EFFECTS OF IMAGE SEGMENTATION ON SUBSPACE-BASED AND COVARIANCE-BASED DETECTION OF ANOMALOUS SUB-PIXEL MATERIALS

International Journal of High Speed Electronics and Systems ◽

10.1142/s0129156408005394 ◽

2008 ◽

Vol 18 (02) ◽

pp. 349-367

Author(s):

CHRISTOPHER GITTINS ◽

DAISEI KONNO ◽

MICHAEL HOKE ◽

ANTHONY RATKOWSKI

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Synthetic Spectrum ◽

Ratio Test ◽

Probability Of False Alarm ◽

Near Ir ◽

Data Segmentation ◽

Material Detection ◽

Receiver Operating

In this paper we assess the effect that clustering pixels into spectrally-similar background types, for example, soil, vegetation, and water in hyperspectral visible/near-IR/SWIR imagery, prior to applying a detection methodology has on material detection statistics. Specifically, we examine the effects of data segmentation on two statistically-based detection metrics, the Subspace Generalized Likelihood Ratio Test (Subspace GLRT) and the Adaptive Cosine Estimator (ACE), applied to a publicly-available AVIRIS datacube augmented with a synthetic material spectrum in selected pixels. The use of synthetic spectrum-augmented data enables quantitative comparison of Subspace-GLRT and ACE using Receiver Operating Characteristic (ROC) curves. For all cases investigated, Receiver Operating Characteristic (ROC) curves generated using ACE were as good as or superior to those generated using Subspace-GLRT. The favorability of ACE over Subspace-GLRT was more pronounced as the synthetic spectrum mixing fraction decreased. For probabilities of detection in the range of 50-80%, segmentation reduced the probability of false alarm by a factor of 3–5 when using ACE. In contrast, segmentation had no apparent effect on detection statistics using Subspace-GLRT, in this example.

Download Full-text

Minimum-Norm Estimation for Binormal Receiver Operating Characteristic (ROC) Curves

Biometrical Journal ◽

10.1002/bimj.200900128 ◽

2009 ◽

pp. NA-NA

Author(s):

Ori Davidov ◽

Yuval Nov

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Minimum Norm ◽

Receiver Operating

Download Full-text

A Simulation Based Study for Comparing Tests Associated With Receiver Operating Characteristic (ROC) Curves

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2012.752840 ◽

2014 ◽

Vol 43 (10) ◽

pp. 2444-2467 ◽

Cited By ~ 1

Author(s):

D. N. Jayasekara ◽

M. R. Sooriyarachchi

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Simulation Based ◽

Receiver Operating

Download Full-text

Receiver Operating Characteristic (ROC) Curves

Wiley StatsRef: Statistics Reference Online ◽

10.1002/9781118445112.stat05255 ◽

2014 ◽

Cited By ~ 3

Author(s):

J. A. Hanley

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Receiver Operating

Download Full-text

Identifying the Effects of Sex on Reactive Strength Scores using Receiver Operating Characteristic (ROC) Curves

Medicine & Science in Sports & Exercise ◽

10.1249/01.mss.0000536527.61375.b1 ◽

2018 ◽

Vol 50 (5S) ◽

pp. 439

Author(s):

Lara Boman ◽

Jordan Preuss ◽

Jake Rosburg ◽

Nile Banks ◽

Talin Louder

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Roc Curves ◽

Receiver Operating

Download Full-text

A Bayesian approach to incorporating maximum entropy‐derived signal parameter statistics into the receiver operating characteristic (ROC) curves

The Journal of the Acoustical Society of America ◽

10.1121/1.4809120 ◽

2005 ◽

Vol 118 (3) ◽

pp. 1935-1935

Author(s):

R. Lee Culver ◽

Leon H. Sibul ◽

David L. Bradley ◽

Jeffrey A. Ballard ◽

H. John Camin

Keyword(s):

Receiver Operating Characteristic ◽

Maximum Entropy ◽

Bayesian Approach ◽

Operating Characteristic ◽

Roc Curves ◽

Signal Parameter ◽

Receiver Operating

Download Full-text

Comparative Assessment of Three Common Algorithms for Estimating the Variance of the Area under the Nonparametric Receiver Operating Characteristic Curve

The Stata Journal Promoting communications on statistics and Stata ◽

10.1177/1536867x0200200304 ◽

2002 ◽

Vol 2 (3) ◽

pp. 280-289 ◽

Cited By ~ 8

Author(s):

Mario A. Cleves

Keyword(s):

Receiver Operating Characteristic ◽

Diagnostic Test ◽

Roc Curve ◽

Operating Characteristic ◽

Characteristic Curve ◽

Simulated Data ◽

Small Samples ◽

Binary Outcomes ◽

Discriminatory Accuracy ◽

Receiver Operating

The area under the receiver operating characteristic (ROC) curve is often used to summarize and compare the discriminatory accuracy of a diagnostic test or modality, and to evaluate the predictive power of statistical models for binary outcomes. Parametric maximum likelihood methods for fitting of the ROC curve provide direct estimates of the area under the ROC curve and its variance. Nonparametric methods, on the other hand, provide estimates of the area under the ROC curve, but do not directly estimate its variance. Three algorithms for computing the variance for the area under the nonparametric ROC curve are commonly used, although ambiguity exists about their behavior under diverse study conditions. Using simulated data, we found similar asymptotic performance between these algorithms when the diagnostic test produces results on a continuous scale, but found notable differences in small samples, and when the diagnostic test yields results on a discrete diagnostic scale.

Download Full-text