首页> 外文期刊>Model assisted statistics and applications >A new framework for prediction and variable selection for uncommon events in a large prospective cohort study
【24h】

A new framework for prediction and variable selection for uncommon events in a large prospective cohort study

机译:大型前瞻性队列研究中罕见事件预测和变量选择的新框架

获取原文
获取原文并翻译 | 示例
       

摘要

When prediction is a goal, validation utilizing data outside of the prediction effort is desirable. Typically, data is split into two parts: one for a development and one for validation. But this approach becomes less attractive when predicting uncommon events, as it substantially reduces power. When predicting uncommon events within a large prospective cohort study, we propose the use of a nested case-control design, which is an alternative to the full cohort analysis. By including all cases but only a subset of the non-cases, this design is expected to produce a result similar to the full cohort analysis. In our framework, variable selection is conducted and a prediction model is fit on those selected variables in the case-control cohort. Then, the fraction of true negative predictions (specificity) of the fitted prediction model in the case-control cohort is compared to that in the rest of the cohort (non-cases) for validation. In addition, we propose an iterative variable selection using random forest for missing data imputation, as well as a strategy for a valid classification. Our framework is illustrated with an application featuring high-dimensional variable selection in a large prospective cohort study.
机译:当预测是目标时,需要利用预测工作之外的数据进行验证。通常,数据分为两部分:一个用于开发,一个用于验证。但是,这种方法在预测罕见事件时变得不那么吸引人了,因为它会大大降低功耗。当在大型前瞻性队列研究中预测罕见事件时,我们建议使用嵌套病例对照设计,这是完整队列分析的一种替代方法。通过只包括所有病例,但仅包括非病例的一个子集,预期该设计将产生与整个队列分析相似的结果。在我们的框架中,进行变量选择,并在病例对照队列中对那些选择的变量拟合预测模型。然后,将病例对照队列中拟合预测模型的真实阴性预测(特异性)的分数与其余同类队列(非案例)中的分数进行比较,以进行验证。此外,我们提出了一种使用随机森林进行迭代变量选择的方法,用于缺失数据的插补,以及一种有效分类的策略。我们的框架通过在大型前瞻性队列研究中以高维变量选择为特色的应用程序进行了说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号