Robust estimation of causal effects via a high-dimensional covariate balancing propensity score,Biometrika

当前位置： X-MOL 学术 › Biometrika › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Robust estimation of causal effects via a high-dimensional covariate balancing propensity score
Biometrika ( IF 2.4 ) Pub Date : 2020-06-03 , DOI: 10.1093/biomet/asaa020
Yang Ning ₁ , Peng Sida ₂ , Kosuke Imai ₃

Affiliation

In this paper, we propose a robust method to estimate the average treatment effects in observational studies when the number of potential confounders is possibly much greater than the sample size. We first use a class of penalized M-estimators for the propensity score and outcome models. We then calibrate the initial estimate of the propensity score by balancing a carefully selected subset of covariates that are predictive of the outcome. Finally, the estimated propensity score is used to construct the inverse probability weighting estimator. We prove that the proposed estimator, which has the sample boundedness property, is root-n consistent, asymptotically normal, and semiparametrically efficient when the propensity score model is correctly specified and the outcome model is linear in covariates. More importantly, we show that our estimator remains root-n consistent and asymptotically normal so long as either the propensity score model or the outcome model is correctly specified. We provide valid confidence intervals in both cases and further extend these results to the case where the outcome model is a generalized linear model. In simulation studies, we find that the proposed methodology often estimates the average treatment effect more accurately than the existing methods. We also present an empirical application, in which we estimate the average causal effect of college attendance on adulthood political participation. Open-source software is available for implementing the proposed methodology.

中文翻译：

通过高维协变量平衡倾向得分对因果效应进行稳健估计

在本文中，当潜在混杂因素的数量可能远大于样本量时，我们提出了一种稳健的方法来估计观察性研究中的平均治疗效果。我们首先使用一类受惩罚的 M 估计量来进行倾向评分和结果模型。然后，我们通过平衡精心选择的可预测结果的协变量子集来校准倾向得分的初始估计。最后，估计的倾向得分用于构建逆概率加权估计量。我们证明，当正确指定倾向评分模型并且结果模型在协变量中为线性时，所提出的估计量具有样本有界性，是根-n 一致、渐近正态和半参数有效的。更重要的是，我们表明，只要正确指定了倾向评分模型或结果模型，我们的估计量就保持 root-n 一致和渐近正态。我们在两种情况下都提供了有效的置信区间，并将这些结果进一步扩展到结果模型是广义线性模型的情况。在模拟研究中，我们发现所提出的方法通常比现有方法更准确地估计平均治疗效果。我们还提出了一个实证应用，其中我们估计了大学出勤率对成年政治参与的平均因果效应。开源软件可用于实施建议的方法。我们在两种情况下都提供了有效的置信区间，并将这些结果进一步扩展到结果模型是广义线性模型的情况。在模拟研究中，我们发现所提出的方法通常比现有方法更准确地估计平均治疗效果。我们还提出了一个实证应用，其中我们估计了大学出勤率对成年政治参与的平均因果效应。开源软件可用于实施建议的方法。我们在两种情况下都提供了有效的置信区间，并将这些结果进一步扩展到结果模型是广义线性模型的情况。在模拟研究中，我们发现所提出的方法通常比现有方法更准确地估计平均治疗效果。我们还提出了一个实证应用，其中我们估计了大学出勤率对成年政治参与的平均因果效应。开源软件可用于实施建议的方法。其中我们估计了大学出勤率对成年政治参与的平均因果效应。开源软件可用于实施建议的方法。其中我们估计了大学出勤率对成年政治参与的平均因果效应。开源软件可用于实施建议的方法。

更新日期：2020-06-03

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11