Optimal bandwidth choice for robust bias-corrected inference in regression discontinuity designs

Summary Modern empirical work in regression discontinuity (RD) designs often employs local polynomial estimation and inference with a mean square error (MSE) optimal bandwidth choice. This bandwidth yields an MSE-optimal RD treatment effect estimator, but is by construction invalid for inference. Robust bias-corrected (RBC) inference methods are valid when using the MSE-optimal bandwidth, but we show that they yield suboptimal confidence intervals in terms of coverage error. We establish valid coverage error expansions for RBC confidence interval estimators and use these results to propose new inference-optimal bandwidth choices for forming these intervals. We find that the standard MSE-optimal bandwidth for the RD point estimator is too large when the goal is to construct RBC confidence intervals with the smaller coverage error rate. We further optimize the constant terms behind the coverage error to derive new optimal choices for the auxiliary bandwidth required for RBC inference. Our expansions also establish that RBC inference yields higher-order refinements (relative to traditional undersmoothing) in the context of RD designs. Our main results cover sharp and sharp kink RD designs under conditional heteroskedasticity, and we discuss extensions to fuzzy and other RD designs, clustered sampling, and pre-intervention covariates adjustments. The theoretical findings are illustrated with a Monte Carlo experiment and an empirical application, and the main methodological results are available in R and Stata packages.

Download Full-text

Power calculations for regression-discontinuity designs

The Stata Journal Promoting communications on statistics and Stata ◽

10.1177/1536867x19830919 ◽

2019 ◽

Vol 19 (1) ◽

pp. 210-245 ◽

Cited By ~ 10

Author(s):

Matias D. Cattaneo ◽

Rocío Titiunik ◽

Gonzalo Vazquez-Bare

Keyword(s):

Sample Size ◽

Regression Discontinuity ◽

Sample Selection ◽

R Package ◽

Power Calculations ◽

Local Polynomial ◽

Regression Discontinuity Designs ◽

Local Polynomial Estimation ◽

Polynomial Estimation ◽

Inference Methods

In this article, we introduce two commands, rdpow and rdsampsi, that conduct power calculations and survey sample selection when using local polynomial estimation and inference methods in regression-discontinuity designs. rdpow conducts power calculations using modern robust bias-corrected local polynomial inference procedures and allows for new hypothetical sample sizes and bandwidth selections, among other features. rdsampsi uses power calculations to compute the minimum sample size required to achieve a desired level of power, given estimated or user-supplied bandwidths, biases, and variances. Together, these commands are useful when devising new experiments or surveys in regression-discontinuity designs, which will later be analyzed using modern local polynomial techniques for estimation, inference, and falsification. Because our commands use the communitycontributed (and R) package rdrobust for the underlying bandwidths, biases, and variances estimation, all the options currently available in rdrobust can also be used for power calculations and sample-size selection, including preintervention covariate adjustment, clustered sampling, and many bandwidth selectors. Finally, we also provide companion R functions with the same syntax and capabilities.

Download Full-text

A NONPARAMETRIC TEST OF SIGNIFICANT VARIABLES IN GRADIENTS

Econometric Theory ◽

10.1017/s0266466620000407 ◽

2020 ◽

pp. 1-45

Author(s):

Feng Yao ◽

Taining Wang

Keyword(s):

Null Distribution ◽

Monte Carlo Study ◽

Nonparametric Test ◽

Hedonic Price ◽

Finite Sample ◽

Test Statistic ◽

Bootstrap Test ◽

Local Polynomial Estimation ◽

Polynomial Estimation ◽

Mean Function

We propose a nonparametric test of significant variables in the partial derivative of a regression mean function. The derivative is estimated by local polynomial estimation and the test statistic is constructed through a variation-based measure of the derivative in the direction of variables of interest. We establish the asymptotic null distribution of the test statistic and demonstrate that it is consistent. Motivated by the null distribution, we propose a wild bootstrap test, and show that it exhibits the same null distribution, whether the null is valid or not. We perform a Monte Carlo study to demonstrate its encouraging finite sample performance. An empirical application is conducted showing how the test can be applied to infer certain aspects of regression structures in a hedonic price model.

Download Full-text

Local Polynomial Estimation in Multiparameter Likelihood Models

Journal of the American Statistical Association ◽

10.1080/01621459.1997.10473675 ◽

1997 ◽

Vol 92 (440) ◽

pp. 1536-1545 ◽

Cited By ~ 25

Author(s):

Marc Aerts ◽

Gerda Claeskens

Keyword(s):

Local Polynomial ◽

Local Polynomial Estimation ◽

Polynomial Estimation

Download Full-text

Local polynomial estimation in partial linear regression models under dependence

Computational Statistics & Data Analysis ◽

10.1016/j.csda.2007.10.009 ◽

2008 ◽

Vol 52 (5) ◽

pp. 2757-2777 ◽

Cited By ~ 9

Author(s):

G. Aneiros-Pérez ◽

J.M. Vilar-Fernández

Keyword(s):

Linear Regression ◽

Regression Models ◽

Linear Regression Models ◽

Local Polynomial ◽

Local Polynomial Estimation ◽

Partial Linear ◽

Polynomial Estimation

Download Full-text

Local polynomial estimation of regression functions for mixing processes

Proceedings of 1994 Workshop on Information Theory and Statistics ◽

10.1109/wits.1994.513873 ◽

2002 ◽

Cited By ~ 2

Author(s):

E. Masry ◽

Jianqing Fan

Keyword(s):

Mixing Processes ◽

Local Polynomial ◽

Regression Functions ◽

Local Polynomial Estimation ◽

Polynomial Estimation

Download Full-text

Weighted Averages and Local Polynomial Estimation for Fractional Linear ARCH Processes

Journal of Statistical Theory and Practice ◽

10.1080/15598608.2007.10411831 ◽

2007 ◽

Vol 1 (2) ◽

pp. 149-166 ◽

Cited By ~ 6

Author(s):

Jan Beran ◽

Yuanhua Feng

Keyword(s):

Weighted Averages ◽

Local Polynomial ◽

Local Polynomial Estimation ◽

Arch Processes ◽

Polynomial Estimation

Download Full-text

Inference in Regression Discontinuity Designs with a Discrete Running Variable

The American Economic Review ◽

10.1257/aer.20160945 ◽

2018 ◽

Vol 108 (8) ◽

pp. 2277-2304 ◽

Cited By ~ 43

Author(s):

Michal Kolesár ◽

Christoph Rothe

Keyword(s):

Confidence Intervals ◽

Conditional Expectation ◽

Regression Discontinuity ◽

Model Misspecification ◽

Standard Errors ◽

Moderate Number ◽

Regression Discontinuity Designs ◽

The Common ◽

Present Simulation ◽

Theoretical Results

We consider inference in regression discontinuity designs when the running variable only takes a moderate number of distinct values. In particular, we study the common practice of using confidence intervals (CIs) based on standard errors that are clustered by the running variable as a means to make inference robust to model misspecification (Lee and Card 2008). We derive theoretical results and present simulation and empirical evidence showing that these CIs do not guard against model misspecification, and that they have poor coverage properties. We therefore recommend against using these CIs in practice. We instead propose two alternative CIs with guaranteed coverage properties under easily interpretable restrictions on the conditional expectation function. (JEL C13, C51, J13, J31, J64, J65)

Download Full-text