首页> 外文会议>IEEE International Symposium on Parallel and Distributed Processing >Improving CUDASW++, a Parallelization of Smith-Waterman for CUDA Enabled Devices
【24h】

Improving CUDASW++, a Parallelization of Smith-Waterman for CUDA Enabled Devices

机译:改善CUDASW ++,SMITH-WATERMAN for CUDA的设备并行化

获取原文

摘要

CUDASW++ is a parallelization of the Smith-Waterman algorithm for CUDA graphical processing units that computes the similarity scores of a query sequence paired with each sequence in a database. The algorithm uses one of two kernel functions to compute the score between a given pair of sequences: the inter-task kernel or the intra-task kernel. We have identified the intra-task kernel as a major bottleneck in the CUDASW++ algorithm. We have developed a new intra-task kernel that is faster than the original intra-task kernel used in CUDASW++. We describe the development of our kernel as a series of incremental changes that provide insight into a number of issues that must be considered when developing any algorithm for the CUDA architecture. We analyze the performance of our kernel compared to the original and show that the use of our intra-task kernel substantially improves the overall performance of CUDASW++ on the order of three to four giga-cell updates per second on various benchmark databases.
机译:Cudasw ++是CUDA图形处理单元的Smith-Waterman算法的并行化,该算法计算与数据库中的每个序列配对的查询序列的相似度分数。该算法使用两个内核函数中的一个来计算给定对序列之间的分数:任务间内核或帧内内核。我们已将任务内核中的内核识别为CUDASW ++算法中的主要瓶颈。我们开发了一个新的任务内核,比Cudasw ++中使用的原始任务内核更快。我们描述了我们内核的开发作为一系列增量变化,可以在为CUDA架构开发任何算法时,提供了许多必须考虑的问题。我们分析了与原始内核相比的核心的性能,并表明我们的任务内内核的使用大大提高了Cudasw ++的整体性能,大约在各种基准数据库上每秒三到四个Giga-Cell更新。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号