...
首页> 外文期刊>Scientific reports. >Scalable Genome Assembly through Parallel de Bruijn Graph Construction for Multiple k-mers
【24h】

Scalable Genome Assembly through Parallel de Bruijn Graph Construction for Multiple k-mers

机译:通过平行DE BRUIJN图形结构进行可扩展的基因组组件,用于多个K-MERS

获取原文
           

摘要

Remarkable advancements in high-throughput gene sequencing technologies have led to an exponential growth in the number of sequenced genomes. However, unavailability of highly parallel and scalable de novo assembly algorithms have hindered biologists attempting to swiftly assemble high-quality complex genomes. Popular de Bruijn graph assemblers, such as IDBA-UD, generate high-quality assemblies by iterating over a set of k-values used in the construction of de Bruijn graphs (DBG). However, this process of sequentially iterating from small to large k-values slows down the process of assembly. In this paper, we propose ScalaDBG, which metamorphoses this sequential process, building DBGs for each distinct k-value in parallel. We develop an innovative mechanism to "patch" a higher k-valued graph with contigs generated from a lower k-valued graph. Moreover, ScalaDBG leverages multi-level parallelism, by both scaling up on all cores of a node, and scaling out to multiple nodes simultaneously. We demonstrate that ScalaDBG completes assembling the genome faster than IDBA-UD, but with similar accuracy on a variety of datasets (6.8X faster for one of the most complex genome in our dataset).
机译:高通量基因测序技术的显着进步导致了序列基因组数量的指数增长。然而,高度平行和可扩展的de Novo组装算法的不可用性阻碍了生物学家试图迅速组装高质量的复杂基因组。流行的de Bruijn图形汇编程序,如IDBA-UD,通过迭代在建造De Bruijn图表(DBG)的一组K值上来产生高质量的组件。然而,这种顺序迭代到大于大的k值的过程减慢了组装过程。在本文中,我们提出了ScaladBG,它使这个顺序过程变色,并行构建每个不同k值的DBG。我们开发了一种创新机制,以“修补”一种较高的K值图,其中具有从较低k值图生成的contig。此外,ScaladBG通过缩放到节点的所有核心并同时缩放到多个节点来利用多级并行性。我们展示ScaladBG完成比IDBA-UD更快地组装基因组,但在各种数据集中具有类似的准确性(对于我们数据集中最复杂的基因组的一个最复杂的基因组)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号