An Instrumental Variable Tree Approach for Detecting Heterogeneous Treatment Effects in Observational Studies

Author(s):  
Guihua Wang ◽  
Jun Li ◽  
Wallace J. Hopp
2018 ◽  
Vol 38 (3) ◽  
pp. 339-353
Author(s):  
Hang J. Kim ◽  
Bo Lu ◽  
Edward J. Nehus ◽  
Mi-Ok Kim

2021 ◽  
pp. 1-23
Author(s):  
Hiroyuki Kasahara ◽  
Katsumi Shimotsu

We study identification in nonparametric regression models with a misclassified and endogenous binary regressor when an instrument is correlated with misclassification error. We show that the regression function is nonparametrically identified if one binary instrument variable and one binary covariate satisfy the following conditions. The instrumental variable corrects endogeneity; the instrumental variable must be correlated with the unobserved true underlying binary variable, must be uncorrelated with the error term in the outcome equation, but is allowed to be correlated with the misclassification error. The covariate corrects misclassification; this variable can be one of the regressors in the outcome equation, must be correlated with the unobserved true underlying binary variable, and must be uncorrelated with the misclassification error. We also propose a mixture-based framework for modeling unobserved heterogeneous treatment effects with a misclassified and endogenous binary regressor and show that treatment effects can be identified if the true treatment effect is related to an observed regressor and another observable variable.


2020 ◽  
pp. 096228022097183
Author(s):  
Tao Liu ◽  
Joseph W Hogan

Confounding is a major concern when using data from observational studies to infer the causal effect of a treatment. Instrumental variables, when available, have been used to construct bound estimates on population average treatment effects when outcomes are binary and unmeasured confounding exists. With continuous outcomes, meaningful bounds are more challenging to obtain because the domain of the outcome is unrestricted. In this paper, we propose to unify the instrumental variable and inverse probability weighting methods, together with suitable assumptions in the context of an observational study, to construct meaningful bounds on causal treatment effects. The contextual assumptions are imposed in terms of the potential outcomes that are partially identified by data. The inverse probability weighting component incorporates a sensitivity parameter to encode the effect of unmeasured confounding. The instrumental variable and inverse probability weighting methods are unified using the principal stratification. By solving the resulting system of estimating equations, we are able to quantify both the causal treatment effect and the sensitivity parameter (i.e. the degree of the unmeasured confounding). We demonstrate our method by analyzing data from the HIV Epidemiology Research Study.


2021 ◽  
Author(s):  
Guihua Wang ◽  
Jun Li ◽  
Wallace J. Hopp

This study addresses the ubiquitous challenge of using big observational data to identify heterogeneous treatment effects. This problem arises in precision medicine, targeted marketing, personalized education, and many other environments. Identifying heterogeneous treatment effects presents several analytical challenges including high dimensionality and endogeneity issues. We develop a new instrumental variable tree (IVT) approach that incorporates the instrumental variable method into a causal tree (CT) to correct for potential endogeneity biases that may exist in observational data. Our IVT approach partitions subjects into subgroups with similar treatment effects within subgroups and different treatment effects across subgroups. The estimated treatment effects are asymptotically consistent under a set of mild assumptions. Using simulated data, we show our approach has a better coverage rate and smaller mean-squared error than the conventional CT approach. We also demonstrate that an instrumental variable forest (IVF) constructed using IVTs has better accuracy and stratification than a generalized random forest. Finally, by applying the IVF approach to an empirical assessment of laparoscopic colectomy, we demonstrate the importance of accounting for endogeneity to make accurate comparisons of the heterogeneous effects of the treatment (teaching hospitals) and control (nonteaching hospitals) on different types of patients. This paper was accepted by J. George Shanthikumar, big data analytics.


Biometrika ◽  
2020 ◽  
Author(s):  
Oliver Dukes ◽  
Stijn Vansteelandt

Summary Eliminating the effect of confounding in observational studies typically involves fitting a model for an outcome adjusted for covariates. When, as often, these covariates are high-dimensional, this necessitates the use of sparse estimators, such as the lasso, or other regularization approaches. Naïve use of such estimators yields confidence intervals for the conditional treatment effect parameter that are not uniformly valid. Moreover, as the number of covariates grows with the sample size, correctly specifying a model for the outcome is nontrivial. In this article we deal with both of these concerns simultaneously, obtaining confidence intervals for conditional treatment effects that are uniformly valid, regardless of whether the outcome model is correct. This is done by incorporating an additional model for the treatment selection mechanism. When both models are correctly specified, we can weaken the standard conditions on model sparsity. Our procedure extends to multivariate treatment effect parameters and complex longitudinal settings.


2017 ◽  
Vol 25 (4) ◽  
pp. 413-434 ◽  
Author(s):  
Justin Grimmer ◽  
Solomon Messing ◽  
Sean J. Westwood

Randomized experiments are increasingly used to study political phenomena because they can credibly estimate the average effect of a treatment on a population of interest. But political scientists are often interested in how effects vary across subpopulations—heterogeneous treatment effects—and how differences in the content of the treatment affects responses—the response to heterogeneous treatments. Several new methods have been introduced to estimate heterogeneous effects, but it is difficult to know if a method will perform well for a particular data set. Rather than using only one method, we show how an ensemble of methods—weighted averages of estimates from individual models increasingly used in machine learning—accurately measure heterogeneous effects. Building on a large literature on ensemble methods, we show how the weighting of methods can contribute to accurate estimation of heterogeneous treatment effects and demonstrate how pooling models lead to superior performance to individual methods across diverse problems. We apply the ensemble method to two experiments, illuminating how the ensemble method for heterogeneous treatment effects facilitates exploratory analysis of treatment effects.


Sign in / Sign up

Export Citation Format

Share Document