Probability Score Decompositions as Complements or Alternatives to ROC Analyses

1989 ◽  
Author(s):  
J. Frank Yates ◽  
Ilan Yaniv ◽  
Juwhei Lee ◽  
J. E. Keith Smith
Keyword(s):  
2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Edward Wheatcroft

Abstract A scoring rule is a function of a probabilistic forecast and a corresponding outcome used to evaluate forecast performance. There is some debate as to which scoring rules are most appropriate for evaluating forecasts of sporting events. This paper focuses on forecasts of the outcomes of football matches. The ranked probability score (RPS) is often recommended since it is ‘sensitive to distance’, that is it takes into account the ordering in the outcomes (a home win is ‘closer’ to a draw than it is to an away win). In this paper, this reasoning is disputed on the basis that it adds nothing in terms of the usual aims of using scoring rules. A local scoring rule is one that only takes the probability placed on the outcome into consideration. Two simulation experiments are carried out to compare the performance of the RPS, which is non-local and sensitive to distance, the Brier score, which is non-local and insensitive to distance, and the Ignorance score, which is local and insensitive to distance. The Ignorance score outperforms both the RPS and the Brier score, casting doubt on the value of non-locality and sensitivity to distance as properties of scoring rules in this context.


2013 ◽  
Vol 22 (2) ◽  
pp. 73-79 ◽  
Author(s):  
Michele Iester ◽  
Francesco Oddone ◽  
Mirko Prato ◽  
Marco Centofanti ◽  
Paolo Fogagnolo ◽  
...  

CHEST Journal ◽  
2019 ◽  
Vol 156 (4) ◽  
pp. A1003
Author(s):  
Audrey Jernigan ◽  
Xian Qiao ◽  
Victoria Okhomina ◽  
Kathleen Waybill ◽  
Markos Kashiouris ◽  
...  

2021 ◽  
Author(s):  
Surya P. Bhatt ◽  
Pallavi P. Balte ◽  
Joseph E. Schwartz ◽  
Byron C. Jaeger ◽  
Patricia A. Cassano ◽  
...  

2015 ◽  
pp. 373-378 ◽  
Author(s):  
Seyedeh Atefeh Mohammadi ◽  
Morteza Rahmani ◽  
Majid Azadi

2020 ◽  
Vol 16 (1) ◽  
pp. 27-39
Author(s):  
Mitchell Pearson ◽  
Glen Livingston Jr ◽  
Robert King

AbstractPredictive football modelling has become progressively popular over the last two decades. Due to this, numerous studies have proposed different types of statistical models to predict the outcome of a football match. This study provides a review of three different models published in the academic literature and then implements these on recent match data from the top football leagues in Europe. These models are then compared utilising the rank probability score to assess their predictive capability. Additionally, a modification is proposed which includes the travel distance of the away team. When tested on football leagues from both Australia and Russia, it is shown to improve predictive capability according to the rank probability score.


2010 ◽  
Vol 14 (11) ◽  
pp. 2303-2317 ◽  
Author(s):  
J. A. Velázquez ◽  
F. Anctil ◽  
C. Perrin

Abstract. This work investigates the added value of ensembles constructed from seventeen lumped hydrological models against their simple average counterparts. It is thus hypothesized that there is more information provided by all the outputs of these models than by their single aggregated predictors. For all available 1061 catchments, results showed that the mean continuous ranked probability score of the ensemble simulations were better than the mean average error of the aggregated simulations, confirming the added value of retaining all the components of the model outputs. Reliability of the simulation ensembles is also achieved for about 30% of the catchments, as assessed by rank histograms and reliability plots. Nonetheless this imperfection, the ensemble simulations were shown to have better skills than the deterministic simulations at discriminating between events and non-events, as confirmed by relative operating characteristic scores especially for larger streamflows. From 7 to 10 models are deemed sufficient to construct ensembles with improved performance, based on a genetic algorithm search optimizing the continuous ranked probability score. In fact, many model subsets were found improving the performance of the reference ensemble. This is thus not essential to implement as much as seventeen lumped hydrological models. The gain in performance of the optimized subsets is accompanied by some improvement of the ensemble reliability in most cases. Nonetheless, a calibration of the predictive distribution is still needed for many catchments.


Sign in / Sign up

Export Citation Format

Share Document