首页> 外文会议>International Conference on High Performance Computing >GAGM: Genome assembly on GPU using mate pairs
【24h】

GAGM: Genome assembly on GPU using mate pairs

机译:GAGM:使用配对对在GPU上进行基因组组装

获取原文

摘要

Genome fragment assembly has long been a time and computation intensive problem in the field of bioinformatics. Many parallel assemblers have been proposed to accelerate the process but there hasn't been any effective approach proposed for GPUs. Also with the increasing power of GPUs, applications from various research fields are being parallelized to take advantage of the massive number of “cores” available in GPUs. In this paper we present the design and development of a GPU based assembler (GAGM) for sequence assembly using Nvidia's GPUs with the CUDA programming model. Our assembler utilizes the mate pair reads produced by the current NGS technologies to build paired de Bruijn graph. Every paired read is broken into paired k-mers and l-mers. Every paired k-mer represents a vertex and paired l-mers are mapped as edges. Contigs are formed by grouping the regions of graph which can be unambiguously connected. We present parallel algorithms for k - mer extraction, paired de Bruijn graph construction and grouping of edges. We have benchmarked GAGM on four bacterial genomes. Our results show that the design on GPU is effective in terms of time as well as the quality of assembly produced.
机译:基因组片段组装一直是生物信息学领域中时间和计算密集的问题。已经提出了许多并行汇编程序来加速该过程,但是还没有针对GPU提出任何有效的方法。同样,随着GPU的功能不断增强,各个研究领域的应用程序也进行了并行化处理,以利用GPU中大量可用的“核心”。在本文中,我们介绍了使用Nvidia的GPU和CUDA编程模型进行序列组装的基于GPU的组装器(GAGM)的设计和开发。我们的汇编程序利用当前NGS技术产生的配对对来构建配对的de Bruijn图。每个成对的读数都被分成成对的k聚体和l-聚体。每个成对的k-mer代表一个顶点,成对的l-mer被映射为边。重叠群是通过对可以明确连接的图形区域进行分组而形成的。我们提出了用于k-mer提取,成对的de Bruijn图构造和边缘分组的并行算法。我们已经在四个细菌基因组上对GAGM进行了基准测试。我们的结果表明,GPU的设计在时间以及所生产装配的质量方面都是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号