...
首页> 外文期刊>Genetic epidemiology. >Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes
【24h】

Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes

机译:高维数据预测验证方法对生存结果的样本量考虑

获取原文
获取原文并翻译 | 示例
           

摘要

A variety of prediction methods are used to relate high-dimensional genome data with a clinical outcome using a prediction model. Once a prediction model is developed from a data set, it should be validated using a resampling method or an independent data set. Although the existing prediction methods have been intensively evaluated by many investigators, there has not been a comprehensive study investigating the performance of the validation methods, especially with a survival clinical outcome. Understanding the properties of the various validation methods can allow researchers to perform more powerful validations while controlling for type I error. In addition, sample size calculation strategy based on these validation methods is lacking. We conduct extensive simulations to examine the statistical properties of these validation strategies. In both simulations and a real data example, we have found that 10-fold cross-validation with permutation gave the best power while controlling type I error close to the nominal level. Based on this, we have also developed a sample size calculation method that will be used to design a validation study with a user-chosen combination of prediction. Microarray and genome-wide association studies data are used as illustrations. The power calculation method in this presentation can be used for the design of any biomedical studies involving high-dimensional data and survival outcomes.
机译:使用预测模型,可以使用多种预测方法将高维基因组数据与临床结果相关联。从数据集开发出预测模型后,应使用重采样方法或独立的数据集对其进行验证。尽管许多研究人员已经对现有的预测方法进行了深入评估,但还没有一项全面的研究来研究验证方法的性能,尤其是对生存期临床结果的评估。了解各种验证方法的属性可以使研究人员在控制I型错误的同时执行更强大的验证。另外,缺乏基于这些验证方法的样本量计算策略。我们进行了广泛的模拟,以检查这些验证策略的统计属性。在仿真和真实数据示例中,我们都发现带有置换的10倍交叉验证在将I型错误控制在标称水平附近的同时提供了最佳性能。基于此,我们还开发了一种样本量计算方法,该方法将用于设计具有用户选择的预测组合的验证研究。微阵列和全基因组关联研究数据用作例证。此演示文稿中的功效计算方法可用于设计涉及高维数据和生存结果的任何生物医学研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号