scholarly journals Comparing the Ability of Regression Modeling and Bayesian Additive Regression Trees to Predict Costs in a Responsive Survey Design Context

2020 ◽  
Vol 36 (4) ◽  
pp. 907-931
Author(s):  
James Wagner ◽  
Brady T. West ◽  
Michael R. Elliott ◽  
Stephanie Coffey

AbstractResponsive survey designs rely upon incoming data from the field data collection to optimize cost and quality tradeoffs. In order to make these decisions in real-time, survey managers rely upon monitoring tools that generate proxy indicators for cost and quality. There is a developing literature on proxy indicators for the risk of nonresponse bias. However, there is very little research on proxy indicators for costs and almost none aimed at predicting costs under alternative design strategies. Predictions of survey costs and proxy error indicators can be used to optimize survey designs in real time. Using data from the National Survey of Family Growth, we evaluate alternative modeling strategies aimed at predicting survey costs (specifically, interviewer hours). The models include multilevel regression (with random interviewer effects) and Bayesian Additive Regression Trees (BART).

2019 ◽  
Vol 113 (4) ◽  
pp. 1060-1065 ◽  
Author(s):  
JAMES BISBEE

Multilevel regression and post-stratification (MRP) is the current gold standard for extrapolating opinion data from nationally representative surveys to smaller geographic units. However, innovations in nonparametric regularization methods can further improve the researcher’s ability to extrapolate opinion data to a geographic unit of interest. I test an ensemble of regularization algorithms and find that there is room for substantial improvement on the multilevel model via more flexible methods of regularization. I propose a modified version of MRP that replaces the multilevel model with a nonparametric approach called Bayesian additive regression trees (BART or, when combined with post-stratification, BARP). I compare both methods across a number of data contexts, demonstrating the benefits of applying more powerful regularization methods to extrapolate opinion data to target geographical units. I provide an R package that implements the BARP method.


2010 ◽  
Vol 4 (1) ◽  
pp. 266-298 ◽  
Author(s):  
Hugh A. Chipman ◽  
Edward I. George ◽  
Robert E. McCulloch

2021 ◽  
pp. 395-414
Author(s):  
Carlos M. Carvalho ◽  
Edward I. George ◽  
P. Richard Hahn ◽  
Robert E. McCulloch

2020 ◽  
Vol 29 (11) ◽  
pp. 3218-3234 ◽  
Author(s):  
Liangyuan Hu ◽  
Chenyang Gu ◽  
Michael Lopez ◽  
Jiayi Ji ◽  
Juan Wisnivesky

There is a dearth of robust methods to estimate the causal effects of multiple treatments when the outcome is binary. This paper uses two unique sets of simulations to propose and evaluate the use of Bayesian additive regression trees in such settings. First, we compare Bayesian additive regression trees to several approaches that have been proposed for continuous outcomes, including inverse probability of treatment weighting, targeted maximum likelihood estimator, vector matching, and regression adjustment. Results suggest that under conditions of non-linearity and non-additivity of both the treatment assignment and outcome generating mechanisms, Bayesian additive regression trees, targeted maximum likelihood estimator, and inverse probability of treatment weighting using generalized boosted models provide better bias reduction and smaller root mean squared error. Bayesian additive regression trees and targeted maximum likelihood estimator provide more consistent 95% confidence interval coverage and better large-sample convergence property. Second, we supply Bayesian additive regression trees with a strategy to identify a common support region for retaining inferential units and for avoiding extrapolating over areas of the covariate space where common support does not exist. Bayesian additive regression trees retain more inferential units than the generalized propensity score-based strategy, and shows lower bias, compared to targeted maximum likelihood estimator or generalized boosted model, in a variety of scenarios differing by the degree of covariate overlap. A case study examining the effects of three surgical approaches for non-small cell lung cancer demonstrates the methods.


2019 ◽  
Vol 38 (25) ◽  
pp. 5048-5069 ◽  
Author(s):  
Yaoyuan Vincent Tan ◽  
Jason Roy

Sign in / Sign up

Export Citation Format

Share Document