scholarly journals ON DERIVING INSTITUTIONAL NORMS FROM DATA COLLECTED THROUGH COMPLEX SAMPLE SURVEYS

1981 ◽  
Vol 1981 (2) ◽  
pp. i-16
Author(s):  
Henry I. Braun
2012 ◽  
Vol 32 (9) ◽  
pp. 1509-1523 ◽  
Author(s):  
Huiping Xu ◽  
Joanne Daggy ◽  
Danni Yu ◽  
Bruce A. Craig ◽  
Laura Sands

2015 ◽  
Vol 31 (2) ◽  
pp. 177-203 ◽  
Author(s):  
Diego Zardetto

Abstract ReGenesees is a new software system for design-based and model-assisted analysis of complex sample surveys, based on R. As compared to traditional estimation platforms, it ensures easier and safer usage and achieves a dramatic reduction in user workload for both the calibration and the variance estimation tasks. Indeed, ReGenesees allows the specification of calibration models in a symbolic way, using R model formulae. Driven by this symbolic metadata, the system automatically and transparently generates the right values and formats for the auxiliary variables at the sample level, and assists the user in defining and calculating the corresponding population totals. Moreover, ReGenesees can handle arbitrary complex estimators, provided they can be expressed as differentiable functions of Horvitz-Thompson or calibration estimators of totals. Complex estimators can be defined in a completely free fashion: the user only needs to provide the system with the symbolic expression of the estimator as a mathematical function. ReGenesees is in fact able to automatically linearize such complex estimators, so that the estimation of their variance comes at no cost at all to the user. Remarkably, all the innovative features sketched above leverage a particular strong point of the R programming language, namely its ability to process symbolic information.


2019 ◽  
Vol 7 (3) ◽  
pp. 334-364 ◽  
Author(s):  
Carolina Franco ◽  
Roderick J A Little ◽  
Thomas A Louis ◽  
Eric V Slud

Abstract The most widespread method of computing confidence intervals (CIs) in complex surveys is to add and subtract the margin of error (MOE) from the point estimate, where the MOE is the estimated standard error multiplied by the suitable Gaussian quantile. This Wald-type interval is used by the American Community Survey (ACS), the largest US household sample survey. For inferences on small proportions with moderate sample sizes, this method often results in marked under-coverage and lower CI endpoint less than 0. We assess via simulation the coverage and width, in complex sample surveys, of seven alternatives to the Wald interval for a binomial proportion with sample size replaced by the ‘effective sample size,’ that is, the sample size divided by the design effect. Building on previous work by the present authors, our simulations address the impact of clustering, stratification, different stratum sampling fractions, and stratum-specific proportions. We show that all intervals undercover when there is clustering and design effects are computed from a simple design-based estimator of sampling variance. Coverage can be better calibrated for the alternatives to Wald by improving estimation of the effective sample size through superpopulation modeling. This approach is more effective in our simulations than previously proposed modifications of effective sample size. We recommend intervals of the Wilson or Bayes uniform prior form, with the Jeffreys prior interval not far behind.


Sign in / Sign up

Export Citation Format

Share Document