...
首页> 外文期刊>BMC Bioinformatics >CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions
【24h】

CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions

机译:CUDASW ++ 3.0:通过耦合CPU和GPU SIMD指令来加速Smith-Waterman蛋白质数据库搜索

获取原文

摘要

Background The maximal sensitivity for local alignments makes the Smith-Waterman algorithm a popular choice for protein sequence database search based on pairwise alignment. However, the algorithm is compute-intensive due to a quadratic time complexity. Corresponding runtimes are further compounded by the rapid growth of sequence databases. Results We present CUDASW++ 3.0, a fast Smith-Waterman protein database search algorithm, which couples CPU and GPU SIMD instructions and carries out concurrent CPU and GPU computations. For the CPU computation, this algorithm employs SSE-based vector execution units as accelerators. For the GPU computation, we have investigated for the first time a GPU SIMD parallelization, which employs CUDA PTX SIMD video instructions to gain more data parallelism beyond the SIMT execution model. Moreover, sequence alignment workloads are automatically distributed over CPUs and GPUs based on their respective compute capabilities. Evaluation on the Swiss-Prot database shows that CUDASW++ 3.0 gains a performance improvement over CUDASW++ 2.0 up to 2.9 and 3.2, with a maximum performance of 119.0 and 185.6 GCUPS, on a single-GPU GeForce GTX 680 and a dual-GPU GeForce GTX 690 graphics card, respectively. In addition, our algorithm has demonstrated significant speedups over other top-performing tools: SWIPE and BLAST+. Conclusions CUDASW++ 3.0 is written in CUDA C++ and PTX assembly languages, targeting GPUs based on the Kepler architecture. This algorithm obtains significant speedups over its predecessor: CUDASW++ 2.0, by benefiting from the use of CPU and GPU SIMD instructions as well as the concurrent execution on CPUs and GPUs. The source code and the simulated data are available at http://cudasw.sourceforge.net webcite .
机译:背景技术局部比对的最大灵敏度使得Smith-Waterman算法成为基于成对比对的蛋白质序列数据库搜索的流行选择。然而,由于二次时间复杂度,该算法是计算密集型的。序列数据库的快速增长进一步增加了相应的运行时。结果我们展示了CUDASW ++ 3.0,这是一种快速的Smith-Waterman蛋白质数据库搜索算法,该算法耦合CPU和GPU SIMD指令,并执行并发的CPU和GPU计算。对于CPU计算,此算法采用基于SSE的向量执行单元作为加速器。对于GPU计算,我们首次研究了GPU SIMD并行化,该并行处理使用CUDA PTX SIMD视频指令来获得超越SIMT执行模型的更多数据并行性。此外,序列比对工作负载会根据它们各自的计算能力自动分布在CPU和GPU上。在Swiss-Prot数据库上进行的评估表明,在单GPU GeForce GTX 680和双GPU GeForce GTX 690上,CUDASW ++ 3.0在2.9和3.2之上的性能均优于CUDASW ++ 2.0,最高性能为119.0和185.6 GCUPS。图形卡。此外,我们的算法已证明比其他性能最高的工具:SWIPE和BLAST +显着提高了速度。结论CUDASW ++ 3.0是用CUDA C ++和PTX汇编语言编写的,目标是基于Kepler架构的GPU。通过使用CPU和GPU SIMD指令以及在CPU和GPU上并发执行,该算法比其前身CUDASW ++ 2.0获得了显着的加速。源代码和模拟数据可从http://cudasw.sourceforge.net webcite获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号