Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation

Wyss Richard; Schneeweiss Sebastian; van der Laan Mark; Lendle Samuel D.; Ju Cheng; Franklin Jessica M.

首页> 外文期刊>Epidemiology >Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation

【24h】

Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation

机译：使用超学习预测模型来提高高维倾向评分估计

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The high-dimensional propensity score is a semiautomated variable selection algorithm that can supplement expert knowledge to improve confounding control in nonexperimental medical studies utilizing electronic healthcare databases. Although the algorithm can be used to generate hundreds of patient-level variables and rank them by their potential confounding impact, it remains unclear how to select the optimal number of variables for adjustment. We used plasmode simulations based on empirical data to discuss and evaluate data-adaptive approaches for variable selection and prediction modeling that can be combined with the high-dimensional propensity score to improve confounding control in large healthcare databases. We considered approaches that combine the high-dimensional propensity score with Super Learner prediction modeling, a scalable version of collaborative targeted maximum-likelihood estimation, and penalized regression. We evaluated performance using bias and mean squared error (MSE) in effect estimates. Results showed that the high-dimensional propensity score can be sensitive to the number of variables included for adjustment and that severe overfitting of the propensity score model can negatively impact the properties of effect estimates. Combining the high-dimensional propensity score with Super Learner was the most consistent strategy, in terms of reducing bias and MSE in the effect estimates, and may be promising for semiautomated data-adaptive propensity score estimation in high-dimensional covariate datasets.

机译：高维倾向评分是半过滤可变选择算法，可以补充专家知识，以利用电子医疗保健数据库改善非激活医学研究中的混淆控制。虽然该算法可用于生成数百名患者级变量并通过潜在的混淆影响对它们进行排序，但它仍然不清楚如何选择用于调整的最佳变量数量。我们使用基于经验数据的等离子体仿真来讨论和评估可变选择和预测建模的数据自适应方法，这些方法可以与高维倾向评分组合以改善大型医疗数据库中的混淆控制。我们考虑了与超级学习者预测建模的高维倾向评分结合的方法，可缩放版本的协作目标最大似然估计和惩罚回归。我们使用偏差和均值平方误差（MSE）进行评估性能。结果表明，高维倾向评分可以对包括用于调整的变量数，并且倾向评分模型的严重过度接收可能对效果估计的性质产生负面影响。在减少效果估计的偏差和MSE方面，将高维倾向分数与超学习者相结合是最一致的策略，并且可能对高维协变量数据集中的半归类数据自适应倾销评分估计有望。

著录项

来源
《Epidemiology》 |2018年第1期|共11页
作者
Wyss Richard; Schneeweiss Sebastian; van der Laan Mark; Lendle Samuel D.; Ju Cheng; Franklin Jessica M.;
展开▼
作者单位

Brigham &

Womens Hosp Div Pharmacoepidemiol &

Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

Brigham &

Womens Hosp Div Pharmacoepidemiol &

Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

Univ Calif Berkeley Dept Biostat Berkeley CA 94720 USA;

Brigham &

Womens Hosp Div Pharmacoepidemiol &

Pharmacoecon 1620 Tremont St Suite 3030 Boston MA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类流行病学与防疫;
关键词

相似文献

外文文献
中文文献
专利

1. Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation [J] . Wyss Richard, Schneeweiss Sebastian, van der Laan Mark, Epidemiology . 2018,第1期

机译：使用超学习预测模型来提高高维倾向评分估计
2. Propensity score prediction for electronic healthcare databases using super learner and high-dimensional propensity score methods [J] . Ju Cheng, Combs Mary, Lendle Samuel D., Journal of applied statistics . 2019,第9a12期

机译：使用超学习者和高维倾向评分方法对电子医疗数据库的倾向评分预测
3. The Balance Super Learner: A robust adaptation of the Super Learner to improve estimation of the average treatment effect in the treated based on propensity score matching [J] . Pirracchio Romain, Carone Marco Statistical methods in medical research . 2018,第8期

机译：平衡超级学习者：超级学习者的强大适应，以提高基于倾向得分匹配治疗中的平均治疗效果的估计
4. High-dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation [C] . Zhuoran Yang, Krishnakumar Balasubramanian, Han Liu International Conference on Machine Learning . 2018

机译：通过阈值分数函数估计高维非高斯单索索引模型
5. Improving Safety Estimation and Prediction Using Multivariate Regression Models in Observational Road Safety Studies. [D] . Musunuru, Anusha. 2017

机译：在观察性道路安全研究中使用多元回归模型改善安全性估算和预测。
6. Improving Propensity Score Estimators Robustness to Model Misspecification Using Super Learner [O] . Romain Pirracchio, Maya L. Petersen, Mark van der Laan -1

机译：使用Super Learner提高倾向得分估算器对模型错误指定的鲁棒性
7. Propensity score prediction for electronic healthcare databases using super learner and high-dimensional propensity score methods [O] . Cheng Ju, Mary Combs, Samuel D. Lendle, 2019

机译：使用超学习者和高维倾向评分方法对电子医疗数据库的倾向评分预测

Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation

摘要

著录项

相似文献

相关主题

期刊订阅