首页> 美国卫生研究院文献>other >Empirical Bayes Estimation and Prediction Using Summary-Level Information From External Big Data Sources Adjusting for Violations of Transportability
【2h】

Empirical Bayes Estimation and Prediction Using Summary-Level Information From External Big Data Sources Adjusting for Violations of Transportability

机译:使用来自外部大数据源的汇总级别信息的经验贝叶斯估计和预测并针对运输性违规进行了调整

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Large external data sources may be available to augment studies that collect data to address a specific research objective. In this article we consider the problem of building regression models for prediction based on individual-level data from an “internal” study while incorporating summary information from an “external” big data source. We extend the work of by introducing an adaptive empirical Bayes shrinkage estimator that uses the external summary-level information and the internal data to trade bias with variance for protection against departures in the conditional probability distribution of the outcome given a set of covariates between the two populations. We use simulation studies and a real data application using external summary information from the Prostate Cancer Prevention Trial to assess the performance of the proposed methods in contrast to maximum likelihood estimation and the constrained maximum likelihood (CML) method developed by . Our simulation studies show that the CML method can be biased and inefficient when the assumption of a transportable covariate distribution between the external and internal populations is violated, and our empirical Bayes estimator provides protection against bias and loss of efficiency.
机译:大型外部数据源可能可用于扩充收集数据以解决特定研究目标的研究。在本文中,我们考虑了基于“内部”研究中的个人数据同时结合来自“外部”大数据源的摘要信息来构建预测的回归模型的问题。我们通过引入自适应经验贝叶斯收缩估计器来扩展工作,该估计器使用外部摘要级别信息和内部数据来交易具有偏差的偏差,以防止由于给定两者之间的一组协变量而导致结果的条件概率分布偏离人口。我们使用模拟研究和实际数据应用程序,使用来自前列腺癌预防试验的外部摘要信息来评估所提出方法的性能,与之相比,最大似然估计和约束最大似然(CML)方法是由开发的。我们的仿真研究表明,如果违反了外部和内部总体之间可运输的协变量分布的假设,则CML方法可能有偏差且效率低下,而我们的经验贝叶斯估计器可以防止偏差和效率损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号