...
首页> 外文期刊>Genetics, selection, evolution >Error rate for imputation from the Illumina BovineSNP50 chip to the Illumina BovineHD chip
【24h】

Error rate for imputation from the Illumina BovineSNP50 chip to the Illumina BovineHD chip

机译:从Illumina BovineSNP50芯片到Illumina BovineHD芯片的插补错误率

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Imputation of genotypes from low-density to higher density chips is a cost-effective method to obtain high-density genotypes for many animals, based on genotypes of only a relatively small subset of animals (reference population) on the high-density chip. Several factors influence the accuracy of imputation and our objective was to investigate the effects of the size of the reference population used for imputation and of the imputation method used and its parameters. Imputation of genotypes was carried out from 50 000 (moderate-density) to 777 000 (high-density) SNPs (single nucleotide polymorphisms). Methods The effect of reference population size was studied in two datasets: one with 548 and one with 1289 Holstein animals, genotyped with the Illumina BovineHD chip (777 k SNPs). A third dataset included the 548 animals genotyped with the 777 k SNP chip and 2200 animals genotyped with the Illumina BovineSNP50 chip. In each dataset, 60 animals were chosen as validation animals, for which all high-density genotypes were masked, except for the Illumina BovineSNP50 markers. Imputation was studied in a subset of six chromosomes, using the imputation software programs Beagle and DAGPHASE. Results Imputation with DAGPHASE and Beagle resulted in 1.91% and 0.87% allelic imputation error rates in the dataset with 548 high-density genotypes, when scale and shift parameters were 2.0 and 0.1, and 1.0 and 0.0, respectively. When Beagle was used alone, the imputation error rate was 0.67%. If the information obtained by Beagle was subsequently used in DAGPHASE, imputation error rates were slightly higher (0.71%). When 2200 moderate-density genotypes were added and Beagle was used alone, imputation error rates were slightly lower (0.64%). The least imputation errors were obtained with Beagle in the reference set with 1289 high-density genotypes (0.41%). Conclusions For imputation of genotypes from the 50?k to the 777?k SNP chip, Beagle gave the lowest allelic imputation error rates. Imputation error rates decreased with increasing size of the reference population. For applications for which computing time is limiting, DAGPHASE using information from Beagle can be considered as an alternative, since it reduces computation time and increases imputation error rates only slightly.
机译:背景技术将基因型从低密度芯片插入到高密度芯片中,是一种基于高密度芯片上仅一小部分动物(参考种群)的基因型而获得许多动物高密度基因型的经济有效的方法。有几种因素影响插补的准确性,我们的目标是研究用于插补的参考群体的大小以及所用插补方法及其参数的影响。基因型的估算从5万(中等密度)到77.7万(高密度)SNP(单核苷酸多态性)进行。方法在两个数据集中研究了参考种群大小的影响:一个数据集548和一个动物1289,分别用Illumina BovineHD芯片(777 k SNP)进行基因分型。第三个数据集包括用777 k SNP芯片进行基因分型的548只动物和用Illumina BovineSNP50芯片进行基因分型的2200只动物。在每个数据集中,选择60只动物作为验证动物,除Illumina BovineSNP50标记外,所有高密度基因型均被屏蔽。使用插补软件程序Beagle和DAGPHASE在六个染色体的子集中研究了插补。结果当缩放和移位参数分别为2.0和0.1、1.0和0.0时,使用DAGPHASE和Beagle进行插补在548个高密度基因型数据集中产生了1.91%和0.87%的等位基因插补错误率。当单独使用Beagle时,插补误差率为0.67%。如果随后将Beagle获得的信息用于DAGPHASE,则插补错误率会稍高(0.71%)。当添加2200种中等密度基因型并单独使用Beagle时,插补错误率略低(0.64%)。使用Beagle在具有1289个高密度基因型(0.41%)的参考集中获得的最小插补误差。结论对于从50?k到777?k SNP芯片的基因型估算,Beagle给出了最低的等位基因估算错误率。插补错误率随着参考人群规模的增加而降低。对于计算时间有限的应用,可以考虑使用来自Beagle的信息进行DAGPHASE,因为它减少了计算时间,并且仅略微增加了插补错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号