Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects

29 Jun 2017  ·  P. Richard Hahn, Jared S. Murray, Carlos Carvalho ·

This paper develops a semi-parametric Bayesian regression model for estimating heterogeneous treatment effects from observational data. Standard nonlinear regression models, which may work quite well for prediction, can yield badly biased estimates of treatment effects when fit to data with strong confounding. Our Bayesian causal forests model avoids this problem by directly incorporating an estimate of the propensity function in the specification of the response model, implicitly inducing a covariate-dependent prior on the regression function. This new parametrization also allows treatment heterogeneity to be regularized separately from the prognostic effect of control variables, making it possible to informatively "shrink to homogeneity", in contrast to existing Bayesian non- and semi-parametric approaches. We illustrate the benefits of this approach via the reanalysis of an observational study assessing the causal effects of smoking on medical expenditures as well as extensive simulation studies.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper