首页> 外文会议>International Conference on Algorithms and Architectures for Parallel Processing >MrBayes for Phylogenetic Inference Using Protein Data on a GPU Cluster
【24h】

MrBayes for Phylogenetic Inference Using Protein Data on a GPU Cluster

机译:使用蛋白质数据在GPU簇上使用蛋白质发育推论MRBAYES

获取原文

摘要

MrBayes is a widely used software for Bayesian phylogenetic inference: we input biological sequence data from various taxonomic groups, and MrBayes returns its estimate of the phylogenetic tree which gave rise to those taxa. This paper presents ta(MC)~3, based on its pre-decessor a(MC)~3, which, for protein datasets, improves computational efficiency and overcomes major obstacles in analyzing larger datasets on HPCs with multiple Graphics Processing Units (GPUs). The major improvements are (a) a new task mapping strategy, (b) the use of Kahan summation to resolve non-convergence issues, and (c) the introduction of 64-bit variables. We evaluate ta(MC)~3 on real-world protein datasets both on a desktop server and the Tianhe-1A supercomputer. With a single GPU, ta(MC)~3 is nearly 90 times faster compared with the serial version of MrBayes, up to around 9 times faster than MrBayes utilizing a GPU via the BEAGLE library, and up to 2.5 times faster than a(MC)~3. On larger datasets with 64 nodes (GPUs) on Tianhe-1A, ta(MC)~3 is capable of obtaining 1000+ speedup vs. serial MrBayes.
机译:MRBAYES是贝叶斯系统发育推理的广泛使用的软件:我们从各种分类群中输入生物序列数据,MRBAYES返回其对那些分类群的系统发育树的估计。本文介绍了TA(MC)〜3,基于其前解码器A(MC)〜3,即用于蛋白质数据集,提高了计算效率并克服了在具有多个图形处理单元(GPU)的HPC上的较大数据集中的主要障碍。主要的改进是:(a)新任务映射策略,(二)使用Kahan的总和来解决不衔接的问题,以及(c)采用64位变量。我们在桌面服务器和天河1A超级计算机上评估真实蛋白质数据集的TA(MC)〜3。通过单个GPU,与MRBAYES的串行版本相比,TA(MC)〜3比MRBAYES的串行版本速度速度近90倍,比MRBAYES通过BEGGLE库利用GPU的MRBAY,速度快于A(MC )〜3。在Tianhe-1a上具有64个节点(GPU)的较大数据集上,TA(MC)〜3能够获得1000+加速与串行MRBAYES。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号