scoring rules Latest Research Papers

Weighted Scoring Rules and Convex Risk Measures

Operations Research ◽

10.1287/opre.2021.2190 ◽

2022 ◽

Author(s):

Zachary J. Smith ◽

J. Eric Bickel

Keyword(s):

Risk Measures ◽

Risk Measure ◽

Mathematical Finance ◽

Scoring Rules ◽

Convex Risk Measures ◽

University Of Texas ◽

Probabilistic Forecasts ◽

Expected Utility Maximization ◽

The University ◽

The Relationship

In Weighted Scoring Rules and Convex Risk Measures, Dr. Zachary J. Smith and Prof. J. Eric Bickel (both at the University of Texas at Austin) present a general connection between weighted proper scoring rules and investment decisions involving the minimization of a convex risk measure. Weighted scoring rules are quantitative tools for evaluating the accuracy of probabilistic forecasts relative to a baseline distribution. In their paper, the authors demonstrate that the relationship between convex risk measures and weighted scoring rules relates closely with previous economic characterizations of weighted scores based on expected utility maximization. As illustrative examples, the authors study two families of weighted scoring rules based on phi-divergences (generalizations of the Weighted Power and Weighted Pseudospherical Scoring rules) along with their corresponding risk measures. The paper will be of particular interest to the decision analysis and mathematical finance communities as well as those interested in the elicitation and evaluation of subjective probabilistic forecasts.

Weighted Scoring Committees

Games ◽

10.3390/g12040094 ◽

2021 ◽

Vol 12 (4) ◽

pp. 94

Author(s):

Alexander Mayer ◽

Stefan Napel

Keyword(s):

Social Choice ◽

Equivalence Classes ◽

Scoring Rules ◽

Scoring Methods ◽

Weight Changes ◽

Party Leaders ◽

Voting Weights ◽

Choice Method ◽

Minimal Integer

Weighted committees allow shareholders, party leaders, etc. to wield different numbers of votes or voting weights as they decide between multiple candidates by a given social choice method. We consider committees that apply scoring methods such as plurality, Borda, or antiplurality rule. Many different weights induce the same mapping from committee members’ preferences to winning candidates. The numbers of respective weight equivalence classes and hence of structurally distinct plurality committees, Borda commitees, etc. differ widely. There are 6, 51, and 5 plurality, Borda, and antiplurality committees, respectively, if three players choose between three candidates and up to 163 (229) committees for scoring rules in between plurality and Borda (Borda and antiplurality). A key implication is that plurality, Borda, and antiplurality rule are much less sensitive to weight changes than other scoring rules. We illustrate the geometry of weight equivalence classes, with a map of all Borda classes, and identify minimal integer representations.

Theoretical evaluation of partial credit scoring of the multiple-response test item

10.31234/osf.io/g5wmb ◽

2021 ◽

Author(s):

Rasmus Anders Xerxes Persson

Keyword(s):

Domain Knowledge ◽

Credit Scoring ◽

Scoring Rules ◽

Multiple Response ◽

Response Type ◽

Test Taker ◽

Arbitrary Integer ◽

Theoretical Evaluation ◽

Rational Agents ◽

Test Items

We compute and compare statistics of five different scoring rules for the selected-response type of test items where the number of keys is an arbitrary integer and the test-takers are perfectly rational agents. We consider a hypothetical test of factual recognition, in which the underlying ability that we seek to measure is the fraction of the item options that the test-taker truly recognizes (and not only guesses correctly), assumed directly proportional the test-taker’s domain knowledge. From these comparisons, two of these scoring rules are singled out as superior to the others.

A construction principle for proper scoring rules

Proceedings of the American Mathematical Society Series B ◽

10.1090/bproc/98 ◽

2021 ◽

Vol 8 (24) ◽

pp. 297-301

Author(s):

Jonas Brehmer

Keyword(s):

Predictive Distribution ◽

Scoring Rules ◽

Parametric Family ◽

Scoring Rule ◽

Proper Scoring Rules ◽

Construction Principle ◽

Probabilistic Forecasts ◽

New Scoring

Proper scoring rules enable decision-theoretically principled comparisons of probabilistic forecasts. New scoring rules can be constructed by identifying the predictive distribution with an element of a parametric family and then applying a known scoring rule. We introduce a condition which ensures propriety in this construction and thereby obtain novel proper scoring rules.

Forecast score distributions with imperfect observations

Advances in Statistical Climatology Meteorology and Oceanography ◽

10.5194/ascmo-7-53-2021 ◽

2021 ◽

Vol 7 (2) ◽

pp. 53-71

Author(s):

Julie Bessac ◽

Philippe Naveau

Keyword(s):

Heavy Tails ◽

Evaluation Studies ◽

Surface Wind ◽

Random Variable ◽

Surface Wind Speed ◽

Radar Data ◽

Scoring Rules ◽

Noise Model ◽

Model Errors ◽

Conditional Distributions

Abstract. The field of statistics has become one of the mathematical foundations in forecast evaluation studies, especially with regard to computing scoring rules. The classical paradigm of scoring rules is to discriminate between two different forecasts by comparing them with observations. The probability distribution of the observed record is assumed to be perfect as a verification benchmark. In practice, however, observations are almost always tainted by errors and uncertainties. These may be due to homogenization problems, instrumental deficiencies, the need for indirect reconstructions from other sources (e.g., radar data), model errors in gridded products like reanalysis, or any other data-recording issues. If the yardstick used to compare forecasts is imprecise, one can wonder whether such types of errors may or may not have a strong influence on decisions based on classical scoring rules. We propose a new scoring rule scheme in the context of models that incorporate errors of the verification data. We rely on existing scoring rules and incorporate uncertainty and error of the verification data through a hidden variable and the conditional expectation of scores when they are viewed as a random variable. The proposed scoring framework is applied to standard setups, mainly an additive Gaussian noise model and a multiplicative Gamma noise model. These classical examples provide known and tractable conditional distributions and, consequently, allow us to interpret explicit expressions of our score. By considering scores to be random variables, one can access the entire range of their distribution. In particular, we illustrate that the commonly used mean score can be a misleading representative of the distribution when the latter is highly skewed or has heavy tails. In a simulation study, through the power of a statistical test, we demonstrate the ability of the newly proposed score to better discriminate between forecasts when verification data are subject to uncertainty compared with the scores used in practice. We apply the benefit of accounting for the uncertainty of the verification data in the scoring procedure on a dataset of surface wind speed from measurements and numerical model outputs. Finally, we open some discussions on the use of this proposed scoring framework for non-explicit conditional distributions.

Even More Effort Towards Improved Bounds and Fixed-Parameter Tractability for Multiwinner Rules

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/31 ◽

2021 ◽

Author(s):

Sushmita Gupta ◽

Pallavi Jain ◽

Saket Saurabh ◽

Nimrod Talmon

Keyword(s):

Weighted Average ◽

Scoring Rules ◽

Voting Rules ◽

Voter Preferences ◽

Voting Rule ◽

The Family ◽

Fixed Parameter ◽

Real World Applications ◽

Ordered Weighted Average ◽

Fruitful Research

Multiwinner elections have proven to be a fruitful research topic with many real world applications. We contribute to this line of research by improving the state of the art regarding the computational complexity of computing good committees. More formally, given a set of candidates C, a set of voters V, each ranking the candidates according to their preferences, and an integer k; a multiwinner voting rule identifies a committee of size k, based on these given voter preferences. In this paper we consider several utilitarian and egailitarian OWA (ordered weighted average) scoring rules, which are an extensively researched family of rules (and a subfamily of the family of committee scoring rules). First, we improve the result of Betzler et al. [JAIR, 2013], which gave a O(n^n) algorithm for computing winner under the Chamberlin Courant rule (CC), where n is the number of voters; to a running time of O(2^n), which is optimal. Furthermore, we study the parameterized complexity of the Pessimist voting rule and describe a few tractable and intractable cases. Apart from such utilitarian voting rules, we extend our study and consider egalitarian median and egalitarian mean (both committee scoring rules), showing some tractable and intractable results, based on nontrivial structural observations.

From Proper Scoring Rules to Max-Min Optimal Forecast Aggregation

Proceedings of the 22nd ACM Conference on Economics and Computation ◽

10.1145/3465456.3467599 ◽

2021 ◽

Author(s):

Eric Neyman ◽

Tim Roughgarden

Keyword(s):

Scoring Rules ◽

Proper Scoring Rules

Binary Scoring Rules that Incentivize Precision

Proceedings of the 22nd ACM Conference on Economics and Computation ◽

10.1145/3465456.3467639 ◽

2021 ◽

Author(s):

Eric Neyman ◽

Georgy Noarov ◽

S. Matthew Weinberg

Keyword(s):

Scoring Rules

Evaluating probabilistic forecasts of football matches: the case against the ranked probability score

Journal of Quantitative Analysis in Sports ◽

10.1515/jqas-2019-0089 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Edward Wheatcroft

Keyword(s):

Scoring Rules ◽

Brier Score ◽

Sporting Events ◽

Forecast Performance ◽

Scoring Rule ◽

Probability Score ◽

Probabilistic Forecasts ◽

Non Local ◽

Evaluating Forecasts ◽

Non Locality

Abstract A scoring rule is a function of a probabilistic forecast and a corresponding outcome used to evaluate forecast performance. There is some debate as to which scoring rules are most appropriate for evaluating forecasts of sporting events. This paper focuses on forecasts of the outcomes of football matches. The ranked probability score (RPS) is often recommended since it is ‘sensitive to distance’, that is it takes into account the ordering in the outcomes (a home win is ‘closer’ to a draw than it is to an away win). In this paper, this reasoning is disputed on the basis that it adds nothing in terms of the usual aims of using scoring rules. A local scoring rule is one that only takes the probability placed on the outcome into consideration. Two simulation experiments are carried out to compare the performance of the RPS, which is non-local and sensitive to distance, the Brier score, which is non-local and insensitive to distance, and the Ignorance score, which is local and insensitive to distance. The Ignorance score outperforms both the RPS and the Brier score, casting doubt on the value of non-locality and sensitivity to distance as properties of scoring rules in this context.

Different scoring rules for respiratory event-related leg movements: effects on the prevalence of periodic limb movements during sleep and their association with depressive symptoms in patients with obstructive sleep apnea

Sleep Medicine ◽

10.1016/j.sleep.2021.07.020 ◽

2021 ◽

Author(s):

Sang-Ahm Lee ◽

Yong-Seok Lee ◽

Cheon-Woong Cho ◽

Ha-rin Yang ◽

Kayeong Im

Keyword(s):

Obstructive Sleep Apnea ◽

Depressive Symptoms ◽

Sleep Apnea ◽

Scoring Rules ◽

Periodic Limb Movements ◽

Limb Movements ◽

Obstructive Sleep ◽

Respiratory Event ◽

Leg Movements

scoring rules
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Weighted Scoring Rules and Convex Risk Measures

Weighted Scoring Committees

Theoretical evaluation of partial credit scoring of the multiple-response test item

A construction principle for proper scoring rules

Forecast score distributions with imperfect observations

Even More Effort Towards Improved Bounds and Fixed-Parameter Tractability for Multiwinner Rules

From Proper Scoring Rules to Max-Min Optimal Forecast Aggregation

Binary Scoring Rules that Incentivize Precision

Evaluating probabilistic forecasts of football matches: the case against the ranked probability score

Different scoring rules for respiratory event-related leg movements: effects on the prevalence of periodic limb movements during sleep and their association with depressive symptoms in patients with obstructive sleep apnea

Export Citation Format

scoring rulesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Weighted Scoring Rules and Convex Risk Measures

Weighted Scoring Committees

Theoretical evaluation of partial credit scoring of the multiple-response test item

A construction principle for proper scoring rules

Forecast score distributions with imperfect observations

Even More Effort Towards Improved Bounds and Fixed-Parameter Tractability for Multiwinner Rules

From Proper Scoring Rules to Max-Min Optimal Forecast Aggregation

Binary Scoring Rules that Incentivize Precision

Evaluating probabilistic forecasts of football matches: the case against the ranked probability score

Different scoring rules for respiratory event-related leg movements: effects on the prevalence of periodic limb movements during sleep and their association with depressive symptoms in patients with obstructive sleep apnea

scoring rules
Recently Published Documents