首页> 外文期刊>Annals of Human Genetics >The effect of retrospective sampling on estimates of prediction error for multifactor dimensionality reduction.
【24h】

The effect of retrospective sampling on estimates of prediction error for multifactor dimensionality reduction.

机译:回顾性抽样对预测误差估计的影响,以降低多维度维度。

获取原文
获取原文并翻译 | 示例
           

摘要

The standard in genetic association studies of complex diseases is replication and validation of positive results, with an emphasis on assessing the predictive value of associations. In response to this need, a number of analytical approaches have been developed to identify predictive models that account for complex genetic etiologies. Multifactor Dimensionality Reduction (MDR) is a commonly used, highly successful method designed to evaluate potential gene-gene interactions. MDR relies on classification error in a cross-validation framework to rank and evaluate potentially predictive models. Previous work has demonstrated the high power of MDR, but has not considered the accuracy and variance of the MDR prediction error estimate. Currently, we evaluate the bias and variance of the MDR error estimate as both a retrospective and prospective estimator and show that MDR can both underestimate and overestimate error. We argue that a prospective error estimate is necessary if MDR models are used for prediction, and propose a bootstrap resampling estimate, integrating population prevalence, to accurately estimate prospective error. We demonstrate that this bootstrap estimate is preferable for prediction to the error estimate currently produced by MDR. While demonstrated with MDR, the proposed estimation is applicable to all data-mining methods that use similar estimates.
机译:复杂疾病遗传关联研究的标准是复制和验证阳性结果,重点是评估关联的预测价值。为了响应这种需求,已经开发了许多分析方法来识别解释复杂遗传病因的预测模型。多因素降维(MDR)是一种常用的,非常成功的方法,旨在评估潜在的基因-基因相互作用。 MDR依靠交叉验证框架中的分类错误来对潜在的预测模型进行排名和评估。先前的工作已经证明了MDR的强大功能,但并未考虑MDR预测误差估计的准确性和方差。当前,我们将MDR误差估计值的偏倚和方差作为回顾性和前瞻性估计器进行评估,并显示MDR可以低估和高估误差。我们认为,如果将MDR模型用于预测,则有必要进行前瞻性误差估计,并提出一个结合人口患病率的自举重采样估计值,以准确估计前瞻性误差。我们证明此引导程序估计比MDR当前产生的错误估计更可预测。虽然已通过MDR进行了演示,但建议的估计适用于所有使用类似估计的数据挖掘方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号