首页> 外文会议>2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum >An Ultrafast Scalable Many-Core Motif Discovery Algorithm for Multiple GPUs
【24h】

An Ultrafast Scalable Many-Core Motif Discovery Algorithm for Multiple GPUs

机译:适用于多个GPU的超快速可扩展多核主题发现算法

获取原文

摘要

The identification of genome-wide transcription factor binding sites is a fundamental and crucial problem to fully understand the transcriptional regulatory processes. However, the high computational cost of many motif discovery algorithms heavily constraints their application for large-scale datasets. The rapid growth of genomic sequences and gene transcription data further deteriorates the situation and establishes a strong requirement for time-efficient scalable motif discovery algorithms. The emergence of many-core architectures, typically CUDA-enabled GPUs, provides an opportunity to reduce the execution time by an order of magnitude without the loss of accuracy. In this paper, we present mCUDA-MEME, an ultrafast scalable many-core motif discovery algorithm for multiple GPUs based on the MEME algorithm. Our algorithm is implemented using a hybrid combination of the CUDA, OpenMP and MPI parallel programming models in order to harness the powerful compute capability of modern GPU clusters. At present, our algorithm supports OOPS and ZOOPS models, which are sufficient for most motif discovery applications. mCUDAMEME achieves significant speedups for the starting point search stage (and the overall execution) when benchmarked, using real datasets, against parallel MEME running on 32 CPU cores. Speedups of up to 1.4 (1.1) on a single GPU of a Fermi-based Tesla S2050 quad-GPU computing system and up to 10.8 (8.3) on the eight GPUs of a two Tesla S2050 system were observed. Furthermore, our algorithm shows good scalability with respect to dataset size and the number of GPUs (availability:https://sites.google.com/site/yongchaosoftware/mc uda-meme).
机译:全基因组转录因子结合位点的鉴定是充分理解转录调控过程的基本和关键问题。但是,许多主题发现算法的高计算成本严重限制了它们在大规模数据集中的应用。基因组序列和基因转录数据的快速增长进一步恶化了这种情况,并提出了对时间有效的可扩展基序发现算法的强烈要求。许多核心架构(通常是支持CUDA的GPU)的出现提供了将执行时间减少一个数量级而又不降低准确性的机会。在本文中,我们提出了mCUDA-MEME,这是一种基于MEME算法的针对多个GPU的超快速可扩展多核主题发现算法。我们的算法是使用CUDA,OpenMP和MPI并行编程模型的混合组合来实现的,以利用现代GPU集群的强大计算能力。目前,我们的算法支持OOPS和ZOOPS模型,足以满足大多数主题发现应用的需求。当使用实际数据集对32个CPU内核上运行的并行MEME进行基准测试时,mCUDAMEME可以大大提高起点搜索阶段(以及整体执行)的速度。在基于Fermi的Tesla S2050四GPU计算系统的单个GPU上,加速比达到1.4(1.1),而在两个Tesla S2050系统的八个GPU上,加速比达到10.8(8.3)。此外,我们的算法在数据集大小和GPU数量方面显示出良好的可扩展性(可用性:https://sites.google.com/site/yongchaosoftware/mc uda-meme)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号