首页> 外文会议>2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops. >Libgapmis: An ultrafast library for short-read single-gap alignment
【24h】

Libgapmis: An ultrafast library for short-read single-gap alignment

机译:Libgapmis:超快速库,用于短读单间隙比对

获取原文
获取原文并翻译 | 示例

摘要

A broad variety of short-read alignment programmes has been released recently to address the task of mapping tens of millions of short reads to a reference genome, placing emphasis on various aspects of the problem. Although all programmes allow for a small number of alignment mismatches, some of them either perform poorly when allowing gap insertions or they do not allow for gap insertions at all. The seed-and-extend strategy is applied in most of these programmes: after a fast alignment between a fragment of the reference sequence and a high-quality fragment of a short read — the seed — an important problem is to extend the alignment between a relatively short succeeding fragment of the reference sequence and the remaining low-quality fragment of the read allowing a number of mismatches and the insertion of gaps in the alignment. However, the length of the short reads in combination with the gap occurrence frequency observed in various applications suggest that the single-gap alignment of (parts of) those reads is desirable. In this article, we present libgapmis, an ultrafast library for pairwise short-read single-gap alignment including accelerated SSE-based and GPU-based versions. It implements an algorithm, which computes a modified version of the traditional dynamic programming matrix for sequence alignment to solve the above alignment problem. We show that the library functions of the CPU-based version are up to 20x faster compared to competing programmes, while the respective SSE-based and GPU-based versions are up to 6x and llx faster than our CPU-based implementation, respectively. The functions made available via our library can be seamlessly integrated into any short-read alignment pipeline.
机译:最近发布了各种各样的短阅读比对程序,以解决将数千万个短阅读映射到参考基因组的任务,重点放在问题的各个方面。尽管所有程序都允许少量的对齐不匹配,但是其中一些在允许插入间隙时要么表现不佳,要么根本不允许插入间隙。种子扩展策略在大多数程序中都得到了应用:在参考序列片段与短读高质量片段(种子)之间进行快速比对后,一个重要的问题是如何延长序列之间的比对。参考序列的相对短的后续片段和读取的剩余低质量片段,允许许多错配并在比对中插入缺口。但是,短读段的长度与在各种应用中观察到的空位出现频率相结合,表明那些读段(部分)的单空位比对是理想的。在本文中,我们介绍了libgapmis,这是一种用于成对短读单间隙对齐的超快速库,其中包括基于SSE和GPU的加速版本。它实现了一种算法,该算法计算传统动态规划矩阵的修改版本以进行序列比对,以解决上述比对问题。我们证明,与竞争程序相比,基于CPU的版本的库功能快20倍,而基于SSE和GPU的相应版本分别比基于CPU的实现快6倍和11倍。通过我们的库提供的功能可以无缝集成到任何短读对齐管道中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号