scholarly journals Recursive partitioning for heterogeneous causal effects

2016 ◽  
Vol 113 (27) ◽  
pp. 7353-7360 ◽  
Author(s):  
Susan Athey ◽  
Guido Imbens

In this paper we propose methods for estimating heterogeneity in causal effects in experimental and observational studies and for conducting hypothesis tests about the magnitude of differences in treatment effects across subsets of the population. We provide a data-driven approach to partition the data into subpopulations that differ in the magnitude of their treatment effects. The approach enables the construction of valid confidence intervals for treatment effects, even with many covariates relative to the sample size, and without “sparsity” assumptions. We propose an “honest” approach to estimation, whereby one sample is used to construct the partition and another to estimate treatment effects for each subpopulation. Our approach builds on regression tree methods, modified to optimize for goodness of fit in treatment effects and to account for honest estimation. Our model selection criterion anticipates that bias will be eliminated by honest estimation and also accounts for the effect of making additional splits on the variance of treatment effect estimates within each subpopulation. We address the challenge that the “ground truth” for a causal effect is not observed for any individual unit, so that standard approaches to cross-validation must be modified. Through a simulation study, we show that for our preferred method honest estimation results in nominal coverage for 90% confidence intervals, whereas coverage ranges between 74% and 84% for nonhonest approaches. Honest estimation requires estimating the model with a smaller sample size; the cost in terms of mean squared error of treatment effects for our preferred method ranges between 7–22%.

Biometrika ◽  
2020 ◽  
Author(s):  
Oliver Dukes ◽  
Stijn Vansteelandt

Summary Eliminating the effect of confounding in observational studies typically involves fitting a model for an outcome adjusted for covariates. When, as often, these covariates are high-dimensional, this necessitates the use of sparse estimators, such as the lasso, or other regularization approaches. Naïve use of such estimators yields confidence intervals for the conditional treatment effect parameter that are not uniformly valid. Moreover, as the number of covariates grows with the sample size, correctly specifying a model for the outcome is nontrivial. In this article we deal with both of these concerns simultaneously, obtaining confidence intervals for conditional treatment effects that are uniformly valid, regardless of whether the outcome model is correct. This is done by incorporating an additional model for the treatment selection mechanism. When both models are correctly specified, we can weaken the standard conditions on model sparsity. Our procedure extends to multivariate treatment effect parameters and complex longitudinal settings.


2009 ◽  
Vol 26 (3) ◽  
pp. 931-951 ◽  
Author(s):  
Yanqin Fan ◽  
Sang Soo Park

In this paper, we propose nonparametric estimators of sharp bounds on the distribution of treatment effects of a binary treatment and establish their asymptotic distributions. We note the possible failure of the standard bootstrap with the same sample size and apply the fewer-than-nbootstrap to making inferences on these bounds. The finite sample performances of the confidence intervals for the bounds based on normal critical values, the standard bootstrap, and the fewer-than-nbootstrap are investigated via a simulation study. Finally we establish sharp bounds on the treatment effect distribution when covariates are available.


2020 ◽  
Vol 29 (9) ◽  
pp. 2445-2469
Author(s):  
Brandon Koch ◽  
David M Vock ◽  
Julian Wolfson ◽  
Laura Boehm Vock

Unbiased estimation of causal effects with observational data requires adjustment for confounding variables that are related to both the outcome and treatment assignment. Standard variable selection techniques aim to maximize predictive ability of the outcome model, but they ignore covariate associations with treatment and may not adjust for important confounders weakly associated to outcome. We propose a novel method for estimating causal effects that simultaneously considers models for both outcome and treatment, which we call the bilevel spike and slab causal estimator (BSSCE). By using a Bayesian formulation, BSSCE estimates the posterior distribution of all model parameters and provides straightforward and reliable inference. Spike and slab priors are used on each covariate coefficient which aim to minimize the mean squared error of the treatment effect estimator. Theoretical properties of the treatment effect estimator are derived justifying the prior used in BSSCE. Simulations show that BSSCE can substantially reduce mean squared error over numerous methods and performs especially well with large numbers of covariates, including situations where the number of covariates is greater than the sample size. We illustrate BSSCE by estimating the causal effect of vasoactive therapy vs. fluid resuscitation on hypotensive episode length for patients in the Multiparameter Intelligent Monitoring in Intensive Care III critical care database.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
E. Caitlin Lloyd ◽  
Hannah M. Sallis ◽  
Bas Verplanken ◽  
Anne M. Haase ◽  
Marcus R. Munafò

Abstract Background Evidence from observational studies suggests an association between anxiety disorders and anorexia nervosa (AN), but causal inference is complicated by the potential for confounding in these studies. We triangulate evidence across a longitudinal study and a Mendelian randomization (MR) study, to evaluate whether there is support for anxiety disorder phenotypes exerting a causal effect on AN risk. Methods Study One assessed longitudinal associations of childhood worry and anxiety disorders with lifetime AN in the Avon Longitudinal Study of Parents and Children cohort. Study Two used two-sample MR to evaluate: causal effects of worry, and genetic liability to anxiety disorders, on AN risk; causal effects of genetic liability to AN on anxiety outcomes; and the causal influence of worry on anxiety disorder development. The independence of effects of worry, relative to depressed affect, on AN and anxiety disorder outcomes, was explored using multivariable MR. Analyses were completed using summary statistics from recent genome-wide association studies. Results Study One did not support an association between worry and subsequent AN, but there was strong evidence for anxiety disorders predicting increased risk of AN. Study Two outcomes supported worry causally increasing AN risk, but did not support a causal effect of anxiety disorders on AN development, or of AN on anxiety disorders/worry. Findings also indicated that worry causally influences anxiety disorder development. Multivariable analysis estimates suggested the influence of worry on both AN and anxiety disorders was independent of depressed affect. Conclusions Overall our results provide mixed evidence regarding the causal role of anxiety exposures in AN aetiology. The inconsistency between outcomes of Studies One and Two may be explained by limitations surrounding worry assessment in Study One, confounding of the anxiety disorder and AN association in observational research, and low power in MR analyses probing causal effects of genetic liability to anxiety disorders. The evidence for worry acting as a causal risk factor for anxiety disorders and AN supports targeting worry for prevention of both outcomes. Further research should clarify how a tendency to worry translates into AN risk, and whether anxiety disorder pathology exerts any causal effect on AN.


2021 ◽  
Vol 12 (3) ◽  
pp. 102
Author(s):  
Jaouad Khalfi ◽  
Najib Boumaaz ◽  
Abdallah Soulmani ◽  
El Mehdi Laadissi

The Box–Jenkins model is a polynomial model that uses transfer functions to express relationships between input, output, and noise for a given system. In this article, we present a Box–Jenkins linear model for a lithium-ion battery cell for use in electric vehicles. The model parameter identifications are based on automotive drive-cycle measurements. The proposed model prediction performance is evaluated using the goodness-of-fit criteria and the mean squared error between the Box–Jenkins model and the measured battery cell output. A simulation confirmed that the proposed Box–Jenkins model could adequately capture the battery cell dynamics for different automotive drive cycles and reasonably predict the actual battery cell output. The goodness-of-fit value shows that the Box–Jenkins model matches the battery cell data by 86.85% in the identification phase, and 90.83% in the validation phase for the LA-92 driving cycle. This work demonstrates the potential of using a simple and linear model to predict the battery cell behavior based on a complex identification dataset that represents the actual use of the battery cell in an electric vehicle.


Drones ◽  
2021 ◽  
Vol 5 (2) ◽  
pp. 37
Author(s):  
Bingsheng Wei ◽  
Martin Barczyk

We consider the problem of vision-based detection and ranging of a target UAV using the video feed from a monocular camera onboard a pursuer UAV. Our previously published work in this area employed a cascade classifier algorithm to locate the target UAV, which was found to perform poorly in complex background scenes. We thus study the replacement of the cascade classifier algorithm with newer machine learning-based object detection algorithms. Five candidate algorithms are implemented and quantitatively tested in terms of their efficiency (measured as frames per second processing rate), accuracy (measured as the root mean squared error between ground truth and detected location), and consistency (measured as mean average precision) in a variety of flight patterns, backgrounds, and test conditions. Assigning relative weights of 20%, 40% and 40% to these three criteria, we find that when flying over a white background, the top three performers are YOLO v2 (76.73 out of 100), Faster RCNN v2 (63.65 out of 100), and Tiny YOLO (59.50 out of 100), while over a realistic background, the top three performers are Faster RCNN v2 (54.35 out of 100, SSD MobileNet v1 (51.68 out of 100) and SSD Inception v2 (50.72 out of 100), leading us to recommend Faster RCNN v2 as the recommended solution. We then provide a roadmap for further work in integrating the object detector into our vision-based UAV tracking system.


Sign in / Sign up

Export Citation Format

Share Document