首页> 外文期刊>Statistics in medicine >Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases
【24h】

Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases

机译:使用医疗数据库观测数据估算异质治疗效果的比较方法

获取原文
获取原文并翻译 | 示例
           

摘要

There is growing interest in using routinely collected data from health care databases to study the safety and effectiveness of therapies in “real‐world” conditions, as it can provide complementary evidence to that of randomized controlled trials. Causal inference from health care databases is challenging because the data are typically noisy, high dimensional, and most importantly, observational. It requires methods that can estimate heterogeneous treatment effects while controlling for confounding in high dimensions. Bayesian additive regression trees, causal forests, causal boosting, and causal multivariate adaptive regression splines are off‐the‐shelf methods that have shown good performance for estimation of heterogeneous treatment effects in observational studies of continuous outcomes. However, it is not clear how these methods would perform in health care database studies where outcomes are often binary and rare and data structures are complex. In this study, we evaluate these methods in simulation studies that recapitulate key characteristics of comparative effectiveness studies. We focus on the conditional average effect of a binary treatment on a binary outcome using the conditional risk difference as an estimand. To emulate health care database studies, we propose a simulation design where real covariate and treatment assignment data are used and only outcomes are simulated based on nonparametric models of the real outcomes. We apply this design to 4 published observational studies that used records from 2 major health care databases in the United States. Our results suggest that Bayesian additive regression trees and causal boosting consistently provide low bias in conditional risk difference estimates in the context of health care database studies.
机译:利用来自医疗保健数据库的常规收集的数据越来越感兴趣,以研究“现实世界”条件下疗法的安全性和有效性,因为它可以为随机对照试验提供互补证据。来自医疗保健数据库的因果推断是具有挑战性的,因为数据通常是嘈杂的,高维,最重要的,观察到的。它需要能够估计异质处理效果的方法,同时控制高尺寸的混杂性。贝叶斯添加剂回归树木,因果林,因果升压和因果多变量自适应回归花瓣是现成的方法,对持续结果观察研究中的异质治疗效果估计了良好的性能。然而,目前尚不清楚这些方法如何在医疗保健数据库研究中进行,其中结果通常是二进制和罕见的和数据结构复杂。在这项研究中,我们评估了这些方法在仿真研究中,概括了比较效果研究的关键特征。我们专注于使用条件风险差异作为估计的二元治疗对二元治疗的条件平均效应。为了模拟医疗保健数据库研究,我们提出了一种模拟设计,其中使用真正的协变量和治疗分配数据,并且仅基于真实成果的非参数模型模拟结果。我们将此设计应用于4个发布的观察性研究,该研究使用了美国2重大医疗数据库的记录。我们的研究结果表明,贝叶斯添加剂回归树和因果促进在医疗保健数据库研究中的条件风险差异估算中始终如一地提供低偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号