Scalable and highly parallel implementation of Smith-Waterman on graphics processing unit using CUDA

Ali Akoglu; Gregory M. Striemer

首页> 外文期刊>Cluster Computing >Scalable and highly parallel implementation of Smith-Waterman on graphics processing unit using CUDA

【24h】

Scalable and highly parallel implementation of Smith-Waterman on graphics processing unit using CUDA

机译：使用CUDA在图形处理单元上可扩展和高度并行地实现Smith-Waterman

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Program development environments have enabled graphics processing units (GPUs) to become an attractive high performance computing platform for the scientific community. A commonly posed problem in computational biology is protein database searching for functional similarities. The most accurate algorithm for sequence alignments is Smith-Waterman (SW). However, due to its computational complexity and rapidly increasing database sizes, the process becomes more and more time consuming making cluster based systems more desirable. Therefore, scalable and highly parallel methods are necessary to make SW a viable solution for life science researchers. In this paper we evaluate how SW fits onto the target GPU architecture by exploring ways to map the program architecture on the processor architecture. We develop new techniques to reduce the memory footprint of the application while exploiting the memory hierarchy of the GPU. With this implementation, GSW, we overcome the on chip memory size constraint, achieving 23× speedup compared to a serial implementation. Results show that as the query length increases our speedup almost stays stable indicating the solid scalability of our approach. Additionally this is a first of a kind implementation which purely runs on the GPU instead of a CPU-GPU integrated environment, making our design suitable for porting onto a cluster of GPUs.

机译：程序开发环境使图形处理单元（GPU）成为科学界有吸引力的高性能计算平台。在计算生物学中普遍提出的问题是蛋白质数据库搜索功能相似性。用于序列比对的最准确算法是Smith-Waterman（SW）。然而，由于其计算复杂性和数据库大小的迅速增加，该过程变得越来越耗时，使得基于集群的系统更加可取。因此，使SW成为生命科学研究人员可行的解决方案，必须采用可扩展且高度并行的方法。在本文中，我们通过探索将程序架构映射到处理器架构上的方法，来评估SW如何适合目标GPU架构。我们开发新技术来减少应用程序的内存占用，同时利用GPU的内存层次结构。通过GSW的这种实现，我们克服了片上存储器大小的限制，与串行实现相比，实现了23倍的加速。结果表明，随着查询长度的增加，我们的加速几乎保持稳定，这表明我们的方法具有可靠的可扩展性。此外，这是第一个完全在GPU上而不是在CPU-GPU集成环境中运行的实现，这使我们的设计适合于移植到GPU集群上。

著录项

来源
《Cluster Computing》 |2009年第3期|341-352|共12页
作者
Ali Akoglu; Gregory M. Striemer;
展开▼
作者单位

Department of Electrical and Computer Engineering University of Arizona 1230 E. Speedway Blvd Tucson Arizona 85721 USA;

Department of Electrical and Computer Engineering University of Arizona 1230 E. Speedway Blvd Tucson Arizona 85721 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Graphics processing unit; Scalable; Parallel; Alignment; Smith-Waterman; CUDA;

机译：图形处理单元;可扩展;并行;对齐;史密斯·沃特曼;CUDA;

相似文献

外文文献
中文文献
专利

1. Scalable and highly parallel implementation of Smith-Watermanon graphics processing unit using CUDA [J] . Ali Akoglu, Gregory M. Striemer Cluster computing . 2009,第3期

机译：使用CUDA的Smith-Watermanon图形处理单元的可扩展且高度并行的实现
2. GICUDA: A parallel program for 3D correlation imaging of large scale gravity and gravity gradiometry data on graphics processing units with CUDA [J] . Zhaoxi Chen, Xiaohong Meng, Lianghui Guo, Computers & geosciences . 2012,第期

机译：GICUDA：并行程序，用于使用CUDA在图形处理单元上进行大规模重力和重力梯度数据的3D相关成像
3. Parallelized CCHE2D flow model with CUDA Fortran on Graphics Processing Units [J] . Yaoxin Zhang, Yafei Jia Computers & Fluids . 2013,第Null期

机译：图形处理单元上具有CUDA Fortran的并行CCHE2D流程模型
4. Parallelized computation for Edge Histogram Descriptor using CUDA on the Graphics Processing Units (GPU) [C] . Mohammadabadi Alireza Ahmadi, Chalechale Abdolah, Heidari Hadis CSI International Symposium on Computer Architecture and Digital Systems . 2013

机译：在图形处理单元（GPU）上使用CUDA对边缘直方图描述符进行并行计算
5. Parallel Implementation of Facial Detection Using Graphics Processing Units [D] . Marineau, Russell L. 2019

机译：使用图形处理单元并行实现面部检测
6. CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units [O] . Yongchao Liu, Douglas L Maskell, Bertil Schmidt 2009

机译：CUDASW ++：优化Smith-Waterman序列数据库搜索以启用CUDA的图形处理单元
7. CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units [O] . Maskell Douglas L, Liu Yongchao, Schmidt Bertil 2009

机译：CUDASW ++：优化Smith-Waterman序列数据库搜索以启用CUDA的图形处理单元

Scalable and highly parallel implementation of Smith-Waterman on graphics processing unit using CUDA

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅