首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware
【24h】

Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware

机译:在支持CUDA的图形硬件上并行实现MAFFT

获取原文
获取原文并翻译 | 示例
           

摘要

Multiple sequence alignment (MSA) constitutes an extremely powerful tool for many biological applications including phylogenetic tree estimation, secondary structure prediction, and critical residue identification. However, aligning large biological sequences with popular tools such as MAFFT requires long runtimes on sequential architectures. Due to the ever increasing sizes of sequence databases, there is increasing demand to accelerate this task. In this paper, we demonstrate how graphic processing units (GPUs), powered by the compute unified device architecture (CUDA), can be used as an efficient computational platform to accelerate the MAFFT algorithm. To fully exploit the GPU’s capabilities for accelerating MAFFT, we have optimized the sequence data organization to eliminate the bandwidth bottleneck of memory access, designed a memory allocation and reuse strategy to make full use of limited memory of GPUs, proposed a new modified-run-length encoding (MRLE) scheme to reduce memory consumption, and used high-performance shared memory to speed up I/O operations. Our implementation tested in three NVIDIA GPUs achieves speedup up to 11.28 on a Tesla K20m GPU compared to the sequential MAFFT 7.015.
机译:多序列比对(MSA)是许多生物学应用(包括系统发育树估计,二级结构预测和关键残基识别)的强大工具。但是,将大型生物序列与流行的工具(例如MAFFT)进行比对需要序列架构上的长时间运行。由于序列数据库的大小不断增加,因此对加速此任务的需求不断增加。在本文中,我们演示了如何使用由统一计算设备体系结构(CUDA)驱动的图形处理单元(GPU)作为有效的计算平台来加速MAFFT算法。为了充分利用GPU的加速MAFFT功能,我们优化了序列数据组织以消除内存访问的带宽瓶颈,设计了一种内存分配和重用策略,以充分利用GPU的有限内存,提出了一种新的运行长度编码(MRLE)方案可减少内存消耗,并使用高性能共享内存来加速I / O操作。与顺序MAFFT 7.015相比,我们在三个NVIDIA GPU上进行测试的实现在Tesla K20m GPU上实现了高达11.28的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号