...
首页> 外文期刊>BMC Bioinformatics >ICGRM: integrative construction of genomic relationship matrix combining multiple genomic regions for big dataset
【24h】

ICGRM: integrative construction of genomic relationship matrix combining multiple genomic regions for big dataset

机译:ICGRM:基因组关系矩阵的综合构建与大型基因组的多种基因组区相结合

获取原文
           

摘要

BACKGROUND:Genomic prediction is an advanced method for estimating genetic values, which has been widely accepted for genetic evaluation in animal and disease-risk prediction in human. It estimates genetic values with genome-wide distributed SNPs instead of pedigree. The key step of it is to construct genomic relationship matrix (GRM) via genome-wide SNPs; however, usually the calculation of GRM needs huge computer memory especially when the SNP number and sample size are big, so that sometimes it will become computationally prohibitive even for super computer clusters. We herein developed an integrative algorithm to compute GRM. To avoid calculating GRM for the whole genome, ICGRM freely divides the genome-wide SNPs into several segments and computes the summary statistics related to GRM for each segment that requires quite few computer RAM; then it integrates these summary statistics to produce GRM for whole genome.RESULTS:It showed that the computer memory of ICGRM was reduced by 15 times (from 218Gb to 14Gb) after the genome SNPs were split into 5 to 200 parts in terms of the number of SNPs in our simulation dataset, making it computationally feasible for almost all kinds of computer servers. ICGRM is implemented in C/C++ and freely available via https://github.com/mingfang618/CLGRM.CONCLUSIONS:ICGRM is computationally efficient software to build GRM and can be used for big dataset.
机译:背景:基因组预测是估计遗传值的先进方法,该方法已被人类和疾病风险预测中的遗传评估被广泛接受。它估计基因组宽分布式SNP而不是谱系的遗传值。它的关键步骤是通过基因组的SNP构建基因组关系矩阵(GRM);然而,通常,GRM的计算需要巨大的计算机内存,特别是当SNP编号和样本大小很大时,即使对于超级计算机集群,有时它将变得对计算禁止。我们在此开发了一种计算GRM的综合算法。为避免计算整个基因组的GRM,ICGRM自由地将基因组SNP分成几个段,并计算与需要相当少量的计算机RAM的每个段相关的GRM相关的摘要统计数据;然后它集成了这些汇总统计数据来为整个Genome产生GRM。结果:它表明,在基因组SNP在数量方面分为5至200个零件后,ICGRM的计算机内存减少了15次(从218GB至14GB)在我们的模拟数据集中的SNPS,使其对几乎各种计算机服务器进行了计算地可行。 ICGRM在C / C ++中实现,并通过https://github.com/mingfang618/clgrm自由可用.Conclusions:CGRM是计算型GRM的计算有效的软件,可用于大数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号