首页> 外文会议>Research in computational molecular biology >Paired de Bruijn Graphs: A Novel Approach for Incorporating Mate Pair Information into Genome Assemblers
【24h】

Paired de Bruijn Graphs: A Novel Approach for Incorporating Mate Pair Information into Genome Assemblers

机译:Paired de Bruijn图:将配对对信息纳入基因组组装者的新方法

获取原文
获取原文并翻译 | 示例

摘要

The recent proliferation of next generation sequencing with short reads has enabled many new experimental opportunities but, at the same time, has raised formidable computational challenges in genome assembly. One of the key advances that has led to an improvement in contig lengths has been mate pairs, which facilitate the assembly of repeating regions. Mate pairs have been algorithmically incorporated into most next generation assemblers as various heuristic post-processing steps to correct the assembly graph or to link contigs into scaffolds. Such methods have allowed the identification of longer contigs than would be possible with single reads; however, they can still fail to resolve complex repeats. Thus, improved methods for incorporating mate pairs will have a strong effect on contig length in the future. Here, we introduce the paired de Bruijn graph, a generalization of the de Bruijn graph that incorporates mate pair information into the graph structure itself instead of analyzing mate pairs at a post-processing step. This graph has the potential to be used in place of the de Bruijn graph in any de Bruijn graph based assembler, maintaining all other assembly steps such as error-correction and repeat resolution. Through assembly results on simulated error-free data, we argue that this can effectively improve the contig sizes in assembly.
机译:短测序的新一代测序技术最近的兴起为许多新的实验机会带来了机遇,但同时也给基因组组装带来了巨大的计算挑战。导致重叠群长度改善的关键进步之一是配对,这有助于重复区域的组装。配合对已通过算法结合到大多数下一代组装机中,作为各种启发式后处理步骤,以纠正组装图或将重叠群连接到支架上。与单次读取相比,这种方法可以识别更长的重叠群。但是,它们仍然可能无法解决复杂的重复。因此,将来结合配偶对的改进方法将对重叠群长度产生很大影响。在这里,我们介绍了成对的de Bruijn图,它是de Bruijn图的一般化,它将配对对信息合并到图结构本身中,而不是在后期处理步骤中分析配对。此图有可能在任何基于de Bruijn图的汇编器中代替de Bruijn图,并保持所有其他装配步骤,例如纠错和重复分辨率。通过模拟无错误数据的组装结果,我们认为这可以有效地提高重叠群的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号