scholarly journals Optimal bandwidth choice for robust bias-corrected inference in regression discontinuity designs

2019 ◽  
Vol 23 (2) ◽  
pp. 192-210 ◽  
Author(s):  
Sebastian Calonico ◽  
Matias D Cattaneo ◽  
Max H Farrell

Summary Modern empirical work in regression discontinuity (RD) designs often employs local polynomial estimation and inference with a mean square error (MSE) optimal bandwidth choice. This bandwidth yields an MSE-optimal RD treatment effect estimator, but is by construction invalid for inference. Robust bias-corrected (RBC) inference methods are valid when using the MSE-optimal bandwidth, but we show that they yield suboptimal confidence intervals in terms of coverage error. We establish valid coverage error expansions for RBC confidence interval estimators and use these results to propose new inference-optimal bandwidth choices for forming these intervals. We find that the standard MSE-optimal bandwidth for the RD point estimator is too large when the goal is to construct RBC confidence intervals with the smaller coverage error rate. We further optimize the constant terms behind the coverage error to derive new optimal choices for the auxiliary bandwidth required for RBC inference. Our expansions also establish that RBC inference yields higher-order refinements (relative to traditional undersmoothing) in the context of RD designs. Our main results cover sharp and sharp kink RD designs under conditional heteroskedasticity, and we discuss extensions to fuzzy and other RD designs, clustered sampling, and pre-intervention covariates adjustments. The theoretical findings are illustrated with a Monte Carlo experiment and an empirical application, and the main methodological results are available in R and Stata packages.

Author(s):  
Matias D. Cattaneo ◽  
Rocío Titiunik ◽  
Gonzalo Vazquez-Bare

In this article, we introduce two commands, rdpow and rdsampsi, that conduct power calculations and survey sample selection when using local polynomial estimation and inference methods in regression-discontinuity designs. rdpow conducts power calculations using modern robust bias-corrected local polynomial inference procedures and allows for new hypothetical sample sizes and bandwidth selections, among other features. rdsampsi uses power calculations to compute the minimum sample size required to achieve a desired level of power, given estimated or user-supplied bandwidths, biases, and variances. Together, these commands are useful when devising new experiments or surveys in regression-discontinuity designs, which will later be analyzed using modern local polynomial techniques for estimation, inference, and falsification. Because our commands use the communitycontributed (and R) package rdrobust for the underlying bandwidths, biases, and variances estimation, all the options currently available in rdrobust can also be used for power calculations and sample-size selection, including preintervention covariate adjustment, clustered sampling, and many bandwidth selectors. Finally, we also provide companion R functions with the same syntax and capabilities.


2020 ◽  
pp. 1-45
Author(s):  
Feng Yao ◽  
Taining Wang

We propose a nonparametric test of significant variables in the partial derivative of a regression mean function. The derivative is estimated by local polynomial estimation and the test statistic is constructed through a variation-based measure of the derivative in the direction of variables of interest. We establish the asymptotic null distribution of the test statistic and demonstrate that it is consistent. Motivated by the null distribution, we propose a wild bootstrap test, and show that it exhibits the same null distribution, whether the null is valid or not. We perform a Monte Carlo study to demonstrate its encouraging finite sample performance. An empirical application is conducted showing how the test can be applied to infer certain aspects of regression structures in a hedonic price model.


2018 ◽  
Vol 108 (8) ◽  
pp. 2277-2304 ◽  
Author(s):  
Michal Kolesár ◽  
Christoph Rothe

We consider inference in regression discontinuity designs when the running variable only takes a moderate number of distinct values. In particular, we study the common practice of using confidence intervals (CIs) based on standard errors that are clustered by the running variable as a means to make inference robust to model misspecification (Lee and Card 2008). We derive theoretical results and present simulation and empirical evidence showing that these CIs do not guard against model misspecification, and that they have poor coverage properties. We therefore recommend against using these CIs in practice. We instead propose two alternative CIs with guaranteed coverage properties under easily interpretable restrictions on the conditional expectation function. (JEL C13, C51, J13, J31, J64, J65)


Test ◽  
2011 ◽  
Vol 20 (3) ◽  
pp. 653-677 ◽  
Author(s):  
Han-Ying Liang ◽  
Jacobo de Uña-Álvarez ◽  
María del Carmen Iglesias-Pérez

2002 ◽  
Vol 18 (2) ◽  
pp. 227-241 ◽  
Author(s):  
Jan Beran ◽  
Yuanhua Feng ◽  
Sucharita Ghosh ◽  
Philipp Sibbertsen

Sign in / Sign up

Export Citation Format

Share Document