HPMA: High-performance metagenomic alignment tool, on a large-scale GPU cluster

机译：HPMA：高性能宏基因组校正工具，在大型GPU集群上

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

AIn this paper, we present HPMA, a graphics processing unit (GPU) accelerated meta-genome sequence alignment algorithm for a collection of DNA sequences. This algorithm supports all-to-all pairwise local alignment on NVIDIA GPUs. HPMA builds on an GPU alignment algorithm that we developed earlier with the addition of a ilter module. We designed and developed this new kernel function based on the suix array data structure. The ilter module improves performance by identifying a subset of sequences which meet a user-deined similarity threshold and should be considered for alignment. HPMA has the ability to balance the workload between CPU and GPU. HPMA allows us to preprocess massively large metagenomes in a reasonable amount of time in response to increasing speed of NGS sequencers. The performance of HPMA has been evaluated on a cluster of Kepler-based Tesla K20 GPUs using a variety of short DNA sequence datasets. We evaluate HPMA thoroughly with four test datasets. The irst two test sets are comprised of 10 simulated datasets where read length varies from 72 to 750 base-pairs. The third test set is designed to allow a comparison with published results for GSWABE, a competing GPU alignment tool. The fourth test set is an actual metagenome of over 2 million sequences with an average length of 270 bp. We utilized a cluster of NVIDIA-K20 GPUs in the Stampede supercomputer at the Texas Advanced Computing Center (Austin, TX, USA). When running on a cluster of 10 NVIDIA K20 GPUs, HPMA is able to align 2 million simulated metagenome sequences of length 300 bp in 160 seconds. In the case of real metagenomic data, HPMA is able to align 2,038,516 sequences with an average length of 270 bp in 60 seconds.

机译：答：在本文中，我们介绍了HPMA，这是一种图形处理单元（GPU）加速的元基因组序列比对算法，用于收集DNA序列。该算法支持NVIDIA GPU上的所有对成对的局部对齐。 HPMA建立在我们之前开发的GPU对齐算法的基础上，其中增加了ilter模块。我们基于suix数组数据结构设计和开发了这个新的内核功能。过滤模块通过识别满足用户定义的相似性阈值并应考虑进行比对的序列子集来提高性能。 HPMA能够平衡CPU和GPU之间的工作负载。 HPMA使我们能够在合理的时间内预处理大规模的大型基因组，以响应NGS测序仪不断提高的速度。已使用各种短DNA序列数据集在基于开普勒的Tesla K20 GPU集群上评估了HPMA的性能。我们使用四个测试数据集对HPMA进行了全面评估。前两个测试集由10个模拟数据集组成，其中读取长度从72到750个碱基对变化。第三个测试集旨在与竞争的GPU对齐工具GSWABE的发布结果进行比较。第四个测试集是超过200万个序列的实际元基因组，平均长度为270 bp。我们在德克萨斯州高级计算中心（美国德克萨斯州奥斯汀）的Stampede超级计算机中使用了NVIDIA-K20 GPU集群。当在10个NVIDIA K20 GPU的集群上运行时，HPMA能够在160秒内对齐200万个模拟的长300 bp的元基因组序列。对于真实的宏基因组学数据，HPMA能够在60秒内比对平均长度为270 bp的2,038,516个序列。

著录项

来源
《IEEE International Conference on Bioinformatics and Biomedicine》|2015年|629-634|共6页
会议地点
作者
Savran Ibrahim; Rose John R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bioinformatics; DNA; Genomics; Graphics processing units; GPGPU computing; Metagenomic; high-performance computing; match length; metagenomic ilter;

机译：生物信息学脱氧核糖核酸;基因组学图形处理单元; GPGPU计算;元基因组高性能计算;匹配长度宏基因组学;

相似文献

外文文献
中文文献
专利

1. Large-Scale Pairwise Sequence Alignments on a Large-Scale GPU Cluster [J] . Design & Test,IEEE . 2014,第1期

机译：大型GPU集群上的大规模成对序列比对
2. Large-Scale Pairwise Alignments on GPU Clusters: Exploring the Implementation Space [J] . Huan Truong, Da Li, Kittisak Sajjapongse, Journal of signal processing systems for signal, image, and video technology . 2014,第1a2期

机译：GPU集群上的大规模成对对齐：探索实现空间
3. Evaluating the computing efficiencies (specificity and sensitivity) of graphics processing unit (GPU)-accelerated DNA sequence alignment tools against central processing unit (CPU) alignment tool [J] . Shrikant Pawar, Aditya Stanam, Ying Zhu Journal of Bioinformatics and Sequence Analysis . 2018,第2期

机译：针对中央处理器（CPU）对齐工具评估图形处理单元（GPU）加速的DNA序列比对工具的计算效率（特异性和灵敏度）
4. HPMA: High-performance metagenomic alignment tool, on a large-scale GPU cluster [C] . Savran Ibrahim, Rose John R. IEEE International Conference on Bioinformatics and Biomedicine . 2015

机译：HPMA：高性能MEAGENOMIC对齐工具，在大型GPU集群上
5. Efficient Sequence Clustering and Embedding Algorithms for Large-scale Metagenomics Data [D] . Zheng, Wei. 2019

机译：大规模偏心组织数据的高效序列聚类和嵌入算法
6. METAREP: JCVI metagenomics reports—an open source tool for high-performance comparative metagenomics [O] . Johannes Goll, Douglas B. Rusch, David M. Tanenbaum, -1

机译：METAREP：JCVI宏基因组学报告—一种用于高性能比较宏基因组学的开源工具
7. Large-Scale Pairwise Sequence Alignments on a Large-Scale GPU Cluster [O] . Ibrahim Savran, Yang Gao, Jason D. Bakos 2014

机译：大规模GpU集群上的大规模成对序列对齐

HPMA: High-performance metagenomic alignment tool, on a large-scale GPU cluster

摘要

著录项

相似文献

相关主题

期刊订阅