Treatment of sample under-representation and skewed heavy-tailed distributions in survey-based microsimulation: An analysis of redistribution effects in compulsory health care insurance in Switzerland

Abstract The credibility of microsimulation modeling with the research community and policymakers depends on high-quality baseline surveys. Quality problems with the baseline survey tend to impair the quality of microsimulation built on top of the survey data. We address two potential issues that both relate to skewed and heavy-tailed distributions.First, we find that ultra-high-income households are under-represented in the baseline household survey. Moreover, the sample estimate of average income underestimates the known population average. Although the Deville–Särndal calibration method corrects the under-representation, it cannot achieve alignment of estimated average income in the right tail of the distribution with known population values without distorting the empirical income distribution. To overcome the problem, we introduce a Pareto tail model. With the help of the tail model, we can adjust the sample income distribution in the tail to meet the alignment targets. Our method can be a useful tool for microsimulation modelers working with survey income data.The second contribution refers to the treatment of an outlier-prone variable that has been added to the survey by record linkage (our empirical example is health care cost). The nature of the baseline survey is not affected by record linkage, that is, the baseline survey still covers only a small part of the population. Hence, the sampling weights are relatively large. An outlying observation together with a high sampling weight can heavily influence or even ruin an estimate of a population characteristic. Thus, we argue that it is beneficial—in terms of mean square error—to use robust estimation and alignment methods, because robust methods are less affected by the presence of outliers.

Download Full-text

Log-Transform Kernel Density Estimation of Income Distribution

L Actualité économique ◽

10.7202/1036917ar ◽

2016 ◽

Vol 91 (1-2) ◽

pp. 141-159 ◽

Cited By ~ 5

Author(s):

Arthur Charpentier ◽

Emmanuel Flachaire

Keyword(s):

Income Distribution ◽

Density Estimation ◽

Kernel Density Estimation ◽

Kernel Density ◽

Estimation Methods ◽

Density Functions ◽

Income Distributions ◽

Heavy Tailed Distributions ◽

Heavy Tailed ◽

Positive Support

Standard kernel density estimation methods are very often used in practice to estimate density functions. It works well in numerous cases. However, it is known not to work so well with skewed, multimodal and heavy-tailed distributions. Such features are usual with income distributions, defined over the positive support. In this paper, we show that a preliminary logarithmic transformation of the data, combined with standard kernel density estimation methods, can provide a much better fit of the density estimation.

Download Full-text

Fitting Heavy-Tailed Distributions to Health Care Data by Parametric and Bayesian Methods

Journal of Statistical Theory and Practice ◽

10.1080/15598608.2013.824823 ◽

2014 ◽

Vol 8 (4) ◽

pp. 619-652 ◽

Cited By ~ 7

Author(s):

Joseph C. Gardiner ◽

Zhehui Luo ◽

Xiaoqin Tang ◽

R.V. Ramamoorthi

Keyword(s):

Health Care ◽

Bayesian Methods ◽

Health Care Data ◽

Heavy Tailed Distributions ◽

Heavy Tailed

Download Full-text

199: Health Care Cost Associated with Prostate Cancer, Androgen Deprivation Therapy, and Bone Complications

The Journal of Urology ◽

10.1016/s0022-5347(18)32466-2 ◽

2006 ◽

Vol 175 (4S) ◽

pp. 65-65

Author(s):

Tracey L. Krupski ◽

Kathleen A. Foley ◽

Onur Baser ◽

Stacey R. Long ◽

David Macarios ◽

...

Keyword(s):

Prostate Cancer ◽

Health Care ◽

Androgen Deprivation Therapy ◽

Health Care Cost ◽

Androgen Deprivation ◽

Bone Complications

Download Full-text

Income Distribution, Parallel Imports and Innovations in Health Care Markets

SSRN Electronic Journal ◽

10.2139/ssrn.1156308 ◽

2008 ◽

Cited By ~ 1

Author(s):

Rajat Acharyya

Keyword(s):

Health Care ◽

Income Distribution ◽

Parallel Imports ◽

Health Care Markets

Download Full-text

Heavy-Tailed Distributions, GARCH Model and the Stock Market Returns in South Korea

SSRN Electronic Journal ◽

10.2139/ssrn.3014472 ◽

2017 ◽

Author(s):

Yoon Hong ◽

Ji-chul Lee ◽

Ding Guoping

Keyword(s):

Stock Market ◽

South Korea ◽

Garch Model ◽

Market Returns ◽

Stock Market Returns ◽

Heavy Tailed Distributions ◽

Heavy Tailed

Download Full-text

Clinical Characteristics and Health Care Cost Among Patients Successfully Treated for COVID-19 in Henan, China: A Descriptive Study

SSRN Electronic Journal ◽

10.2139/ssrn.3588531 ◽

2020 ◽

Author(s):

Yudong Miao ◽

Soumitra S Bhuyan ◽

Yi Kang ◽

Jianqin Gu ◽

Sabiha Hussain ◽

...

Keyword(s):

Health Care ◽

Clinical Characteristics ◽

Health Care Cost ◽

Descriptive Study

Download Full-text

Probability and Random Processes

10.1093/oso/9780198821939.003.0002 ◽

2018 ◽

Author(s):

Stefan Thurner ◽

Rudolf Hanel ◽

Peter Klimekl

Keyword(s):

Random Processes ◽

Distribution Functions ◽

Basic Logic ◽

Heavy Tailed Distributions ◽

Random Components ◽

Classical Distribution ◽

Heavy Tailed ◽

Basic Facts ◽

Stochastic Events ◽

Stochastic Event

Phenomena, systems, and processes are rarely purely deterministic, but contain stochastic,probabilistic, or random components. For that reason, a probabilistic descriptionof most phenomena is necessary. Probability theory provides us with the tools for thistask. Here, we provide a crash course on the most important notions of probabilityand random processes, such as odds, probability, expectation, variance, and so on. Wedescribe the most elementary stochastic event—the trial—and develop the notion of urnmodels. We discuss basic facts about random variables and the elementary operationsthat can be performed on them. We learn how to compose simple stochastic processesfrom elementary stochastic events, and discuss random processes as temporal sequencesof trials, such as Bernoulli and Markov processes. We touch upon the basic logic ofBayesian reasoning. We discuss a number of classical distribution functions, includingpower laws and other fat- or heavy-tailed distributions.

Download Full-text

The health care cost spiral: monster or myth?

The Medical Journal of Australia ◽

10.5694/j.1326-5377.1983.tb122650.x ◽

1983 ◽

Vol 2 (11) ◽

pp. 537-538

Author(s):

G. Priddle

Keyword(s):

Health Care ◽

Health Care Cost

Download Full-text

Semiparametric Mixed-Effects Ordinary Differential Equation Models with Heavy-Tailed Distributions

Journal of Agricultural Biological and Environmental Statistics ◽

10.1007/s13253-021-00446-2 ◽

2021 ◽

Author(s):

Baisen Liu ◽

Liangliang Wang ◽

Yunlong Nie ◽

Jiguo Cao

Keyword(s):

Differential Equation ◽

Ordinary Differential Equation ◽

Mixed Effects ◽

Heavy Tailed Distributions ◽

Differential Equation Models ◽

Heavy Tailed

Download Full-text

A Method for Confidence Intervals of High Quantiles

Entropy ◽

10.3390/e23010070 ◽

2021 ◽

Vol 23 (1) ◽

pp. 70

Author(s):

Mei Ling Huang ◽

Xiang Raney-Yan

Keyword(s):

Confidence Intervals ◽

Computational Algorithm ◽

Geometric Mean ◽

Asymptotic Properties ◽

Quantile Estimation ◽

Heavy Tailed Distributions ◽

High Quantiles ◽

High Quantile ◽

Heavy Tailed ◽

Infinite Moments

The high quantile estimation of heavy tailed distributions has many important applications. There are theoretical difficulties in studying heavy tailed distributions since they often have infinite moments. There are also bias issues with the existing methods of confidence intervals (CIs) of high quantiles. This paper proposes a new estimator for high quantiles based on the geometric mean. The new estimator has good asymptotic properties as well as it provides a computational algorithm for estimating confidence intervals of high quantiles. The new estimator avoids difficulties, improves efficiency and reduces bias. Comparisons of efficiencies and biases of the new estimator relative to existing estimators are studied. The theoretical are confirmed through Monte Carlo simulations. Finally, the applications on two real-world examples are provided.

Download Full-text