...
首页> 外文期刊>International journal of parallel programming >ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture
【24h】

ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture

机译:ASW:在耦合的CPU-GPU架构上加速Smith-Waterman算法

获取原文
获取原文并翻译 | 示例

摘要

Smith-Waterman algorithm (SW) is a popular dynamic programming algorithm widely used in bioinformatics for local biological sequence alignment. Due to the O(n2) high time and space complexity of SW and growing size of biological data, it is crucial to accelerate SW for high performance. In view of the GPU high efficiency in science computation, many existing studies (e.g., CUDAlign, CUDASW++) speedup SW with GPU. However, the strong data dependency makes SW communication intensive, and the previous works fail to fully leverage the heterogeneous capabilities of the GPU machine for either the neglect of the CPU ability or the low bandwidth of PCI-e. In this paper, we propose ASW, which aims at accelerating SW algorithm with accelerated processing unit (APU), a heterogeneous processor integrates CPU and GPU in a single die and share the same memory. This coupled CPU-GPU architecture is more suitable for frequent data exchanging due to the elimination of PCI-e bus. For the full utilization of both CPU and GPU in APU system, ASW partitions the whole SW matrix into blocks and dynamically dispatches each block to CPU and GPU for the concurrent execution. A DAG-based dynamic scheduling method is presented to dispatch the workload automatically. Moreover, we also design a time cost model to determine the partition granularity in the matrix division phase. We have evaluated ASW on AMD A12 platform and our results show that ASW achieves a good performance of 7.2 GCUPS (gigacells update per second).
机译:Smith-Waterman算法(SW)是一种流行的动态编程算法,广泛用于生物信息学中以进行局部生物序列比对。由于软件的O(n2)高时空复杂性和生物数据的不断增长,因此加速软件实现高性能至关重要。鉴于GPU在科学计算中的高效率,许多现有研究(例如CUDAlign,CUDASW ++)都加快了使用GPU的软件速度。但是,强大的数据依存关系使软件之间的通信更加密集,并且先前的工作未能完全利用GPU机器的异构功能来忽略CPU能力或PCI-e的低带宽。在本文中,我们提出了ASW,其目的是通过加速处理单元(APU)来加速SW算法,一种异构处理器将CPU和GPU集成在单个芯片中,并共享相同的内存。由于消除了PCI-e总线,因此这种耦合的CPU-GPU架构更适合于频繁的数据交换。为了充分利用APU系统中的CPU和GPU,ASW将整个SW矩阵划分为多个块,并动态地将每个块分配给CPU和GPU并发执行。提出了一种基于DAG的动态调度方法来自动分配工作量。此外,我们还设计了一个时间成本模型来确定矩阵划分阶段的分区粒度。我们在AMD A12平台上评估了ASW,结果表明ASW达到了7.2 GCUPS(每秒千兆字节更新)的良好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号