...
首页> 外文期刊>Journal of Animal Science and Biotechnology >Efficient strategies for leave-one-out cross validation for genomic best linear unbiased prediction
【24h】

Efficient strategies for leave-one-out cross validation for genomic best linear unbiased prediction

机译:用于基因组最佳线性无偏预测的留一法交叉验证的有效策略

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Prediction, using whole-genome data. Leave-one-out cross validation can be used to quantify the predictive ability of a statistical model. Methods Naive application of Leave-one-out cross validation is computationally intensive because the training and validation analyses need to be repeated n times, once for each observation. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis. Results Efficient Leave-one-out cross validation strategies is 786 times faster than the naive application for a simulated dataset with 1,000 observations and 10,000 markers and 99 times faster with 1,000 observations and 100 markers. These efficiencies relative to the naive approach using the same model will increase with increases in the number of observations. Conclusions Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.
机译:背景技术使用全基因组数据,为最佳线性无偏预测提出了一种随机多重回归模型,该模型同时拟合所有等位基因替代效应的加性标记或单倍型,作为不相关的随机效应。留一法交叉验证可用于量化统计模型的预测能力。方法留一法交叉验证的天真应用需要大量计算,因为训练和验证分析需要重复n次,每次观察一次。本文介绍了高效的留一法交叉验证策略,与单次分析相比,所需的工作量很少。结果对于具有1000个观察值和10,000个标记的模拟数据集,有效的留一法式交叉验证策略比朴素应用程序快786倍,而具有1,000个观察值和100个标记的模拟数据集则要快99倍。与使用相同模型的简单方法相比,这些效率将随着观察次数的增加而增加。结论这里介绍了有效的留一法交叉验证策略,与单次分析相比,所需的工作量很少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号