首页> 外文期刊>Bioinformatics >PASS: a program to align short sequences
【24h】

PASS: a program to align short sequences

机译:PASS:对齐短序列的程序

获取原文
获取原文并翻译 | 示例
           

摘要

Summary: Standard DNA alignment programs are inadequate to manage the data produced by new generation DNA sequencers. To answer this problem, we developed PASS with the objective of improving execution time and sensitivity when compared with other available programs. PASS performs fast gapped and ungapped alignments of short DNA sequences onto a reference DNA, typically a genomic sequence. It is designed to handle a huge amount of reads such as those generated by Solexa, SOLiD or 454 technologies. The algorithm is based on a data structure that holds in RAM the index of the genomic positions of ‘seed’ words (typically 11 and 12 bases) as well as an index of the precomputed scores of short words (typically seven and eight bases) aligned against each other. After building the genomic index, the program scans every query sequence performing three steps: (1) it finds matching seed words in the genome; (2) for every match checks the precomputed alignment of the short flanking regions; (3) if passes step 2, then it performs an exact dynamic alignment of a narrow region around the match. The performance of the program is very striking both for sensitivity and speed. For instance, gap alignment is achieved hundreds of times faster than BLAST and several times faster than SOAP, especially when gaps are allowed. Furthermore, PASS has a higher sensitivity when compared with the other available programs.
机译:简介:标准DNA比对程序不足以管理新一代DNA测序仪产生的数据。为了解决这个问题,我们开发了PASS,目的是与其他可用程序相比,可以缩短执行时间和提高灵敏度。 PASS将短DNA序列与参考DNA(通常是基因组序列)进行快速缺口和非缺口比对。它旨在处理大量读取,例如Solexa,SOLiD或454技术生成的读取。该算法基于一种数据结构,该数据结构在RAM中保存“种子”字的基因组位置索引(通常为11和12个碱基)以及预先计算的短字得分(通常为7和8个碱基)的索引互相对抗。建立基因组索引后,程序将扫描每个查询序列,执行三个步骤:(1)在基因组中找到匹配的种子词; (2)每次比赛都要检查短侧翼区域的预先计算的路线; (3)如果通过了步骤2,那么它将对比赛周围的狭窄区域进行精确的动态对齐。该程序的性能在灵敏度和速度上都非常惊人。例如,间隙对齐比BLAST快数百倍,比SOAP快几倍,特别是在允许间隙的情况下。此外,与其他可用程序相比,PASS具有更高的灵敏度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号