...
首页> 外文期刊>Epidemiology >Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation
【24h】

Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation

机译:使用超学习预测模型来提高高维倾向评分估计

获取原文
获取原文并翻译 | 示例
           

摘要

The high-dimensional propensity score is a semiautomated variable selection algorithm that can supplement expert knowledge to improve confounding control in nonexperimental medical studies utilizing electronic healthcare databases. Although the algorithm can be used to generate hundreds of patient-level variables and rank them by their potential confounding impact, it remains unclear how to select the optimal number of variables for adjustment. We used plasmode simulations based on empirical data to discuss and evaluate data-adaptive approaches for variable selection and prediction modeling that can be combined with the high-dimensional propensity score to improve confounding control in large healthcare databases. We considered approaches that combine the high-dimensional propensity score with Super Learner prediction modeling, a scalable version of collaborative targeted maximum-likelihood estimation, and penalized regression. We evaluated performance using bias and mean squared error (MSE) in effect estimates. Results showed that the high-dimensional propensity score can be sensitive to the number of variables included for adjustment and that severe overfitting of the propensity score model can negatively impact the properties of effect estimates. Combining the high-dimensional propensity score with Super Learner was the most consistent strategy, in terms of reducing bias and MSE in the effect estimates, and may be promising for semiautomated data-adaptive propensity score estimation in high-dimensional covariate datasets.
机译:高维倾向评分是半过滤可变选择算法,可以补充专家知识,以利用电子医疗保健数据库改善非激活医学研究中的混淆控制。虽然该算法可用于生成数百名患者级变量并通过潜在的混淆影响对它们进行排序,但它仍然不清楚如何选择用于调整的最佳变量数量。我们使用基于经验数据的等离子体仿真来讨论和评估可变选择和预测建模的数据自适应方法,这些方法可以与高维倾向评分组合以改善大型医疗数据库中的混淆控制。我们考虑了与超级学习者预测建模的高维倾向评分结合的方法,可缩放版本的协作目标最大似然估计和惩罚回归。我们使用偏差和均值平方误差(MSE)进行评估性能。结果表明,高维倾向评分可以对包括用于调整的变量数,并且倾向评分模型的严重过度接收可能对效果估计的性质产生负面影响。在减少效果估计的偏差和MSE方面,将高维倾向分数与超学习者相结合是最一致的策略,并且可能对高维协变量数据集中的半归类数据自适应倾销评分估计有望。

著录项

  • 来源
    《Epidemiology》 |2018年第1期|共11页
  • 作者单位

    Brigham &

    Womens Hosp Div Pharmacoepidemiol &

    Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

    Brigham &

    Womens Hosp Div Pharmacoepidemiol &

    Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

    Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

    Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

    Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

    Brigham &

    Womens Hosp Div Pharmacoepidemiol &

    Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 流行病学与防疫;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号