首页> 外文期刊>BMC proceedings. >Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
【24h】

Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels

机译:使用三种贝叶斯方法的基因组育种价值预测及其在降低密度标记板上的应用

获取原文
           

摘要

Background Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data ( Student-t ) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors. Results The GEBV accuracy (calculated as correlation between GEBV and traditional breeding values) was highest for Lasso, followed by Student-t and then Bayes-A. When comparing GEBV to true breeding values, Student-t was most accurate, though differences were small. In general the shrinkage applied by the Lasso approach was less conservative than Bayes-A or Student-t, indicating that Lasso may be more sensitive to QTL with small effects. In the reduced-density marker subsets the ranking of the methods was generally consistent. Overall, low-density, evenly-spaced SNPs did a poor job of predicting GEBV, but SNPs selected based on additive effect size yielded accuracies similar to those at high density, even when coverage was low. The inclusion of genotype probabilities to the evenly-spaced subsets showed promising increases in accuracy and may be more useful in cases where many QTL of small effect are expected. Conclusions In this dataset the Student-t approach slightly outperformed the other methods when predicting GEBV at both high and low density, but the Lasso method may have particular advantages in situations where many small QTL are expected. When markers were selected at low density based on genome spacing, the inclusion of genotype probabilities increased GEBV accuracy which would allow a single low- density marker panel to be used across traits.
机译:已经提出了用于预测基因组育种值(GEBV)的背景贝叶斯方法,该方法允许各个标记物具有不同的方差,从而导致收缩过程,该收缩过程使用先验信息将可忽略的影响强迫为零。这些方法通常假定适用于所有个体的高密度基因型数据,但实际情况并非如此。在这项研究中,比较了三种方法在以高SNP标记密度进行训练以及以高密度或低密度进行预测时在计算GEBV方面的预测能力:著名的Bayes-A,Bayes-A的推广,其中规模和自由度是根据数据(Student-t)和拉索方法的贝叶斯实现估计的。评估了十二种使用低密度标记子集预测GEBV的方案,包括基于基因组间隔或累加效应大小的SNP选择以及以谱系和基因型祖先的基因型概率形式包含未知基因型信息。结果Lasso的GEBV准确性(计算为GEBV与传统育种值之间的相关性)最高,其次是Student-t,然后是Bayes-A。将GEBV与真实育种值进行比较时,Student-t最准确,尽管差异很小。通常,套索方法所应用的收缩比贝叶斯A或Student-t保守程度低,这表明套索对QTL可能更敏感,且影响较小。在密度降低的标记子集中,方法的排名通常是一致的。总体而言,低密度,均匀分布的SNP在预测GEBV方面做得很差,但是,即使覆盖率较低,根据加性效应大小选择的SNP也会产生与高密度相似的准确性。基因型概率包括在均匀分布的子集中,显示准确性有希望提高,并且在预期有许多小影响的QTL的情况下可能更有用。结论在此数据集中,当以高密度和低密度预测GEBV时,Student-t方法略胜于其他方法,但是Lasso方法在预计会有许多小的QTL的情况下可能具有特殊的优势。当根据基因组间隔以低密度选择标记时,包含基因型概率会提高GEBV准确性,这将允许单个低密度标记面板用于性状。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号