...
首页> 外文期刊>The Annals of applied statistics >ESTIMATING POPULATION AVERAGE CAUSAL EFFECTS IN THE PRESENCE OF NON-OVERLAP: THE EFFECT OF NATURAL GAS COMPRESSOR STATION EXPOSURE ON CANCER MORTALITY
【24h】

ESTIMATING POPULATION AVERAGE CAUSAL EFFECTS IN THE PRESENCE OF NON-OVERLAP: THE EFFECT OF NATURAL GAS COMPRESSOR STATION EXPOSURE ON CANCER MORTALITY

机译:在非重叠存在下估算人口平均因果效应:天然气压缩机站暴露对癌症死亡率的影响

获取原文
获取原文并翻译 | 示例

摘要

Most causal inference studies rely on the assumption of overlap to estimate population or sample average causal effects. When data suffer from non-overlap, estimation of these estimands requires reliance on model specifications due to poor data support. All existing methods to address non-overlap, such as trimming or down-weighting data in regions of poor data support, change the estimand so that inference cannot be made on the sample or the underlying population. In environmental health research settings where study results are often intended to influence policy, population-level inference may be critical and changes in the estimand can diminish the impact of the study results, because estimates may not be representative of effects in the population of interest to policymakers. Researchers may be willing to make additional, minimal modeling assumptions in order to preserve the ability to estimate population average causal effects. We seek to make two contributions on this topic. First, we propose a flexible, data-driven definition of propensity score overlap and non-overlap regions. Second, we develop a novel Bayesian framework to estimate population average causal effects with minor model dependence and appropriately large uncertainties in the presence of non-overlap and causal effect heterogeneity. In this approach the tasks of estimating causal effects in the overlap and non-overlap regions are delegated to two distinct models suited to the degree of data support in each region. Tree ensembles are used to nonparametrically estimate individual causal effects in the overlap region, where the data can speak for themselves. In the non-overlap region where insufficient data support means reliance on model specification is necessary, individual causal effects are estimated by extrapolating trends from the overlap region via a spline model. The promising performance of our method is demonstrated in simulations. Finally, we utilize our method to perform a novel investig
机译:大多数因果推断研究依靠重叠的假设来估计人口或样本平均因果效应。当数据遭受非重叠时,由于数据支持差,这些估算值需要依赖模型规范。所有现有的解决非重叠的方法,例如在数据支持不良区域中修整或下降加权数据,改变了预测,以便不能对样本或基础人群进行推断。在环境健康研究环境中,研究结果往往旨在影响政策,人口级推断可能是至关重要的,并且估计的变化可以减少研究结果的影响,因为估计可能不代表患者人口中的影响政策制定者。研究人员可能愿意进行额外的,最小的建模假设,以保持估计人口平均因果效应的能力。我们寻求对这一主题进行两项贡献。首先,我们提出了一种灵活的数据驱动的倾向分数重叠和非重叠区域的定义。其次,我们开发了一种新的贝叶斯框架,以估计较小的模型依赖性的人口平均因果影响,并在存在非重叠和因果效应异质性时适当的不确定性。在这种方法中,估算重叠和非重叠区域中的因果效应的任务被委派到适用于每个区域中的数据支持程度的两个不同模型。树集合用于非分解地估计重叠区域中的个体因果效果,其中数据可以为自己说话。在不足的数据支持意味着依赖模型规范的非重叠区域中是必要的,通过通过样条模型从重叠区域推断趋势来估计各个因果效应。我们的方法的有希望的性能在模拟中证明。最后,我们利用我们的方法来执行一个小说的Investig

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号