首页> 外国专利> DIVIDE-AND-CONQUER GLOBAL ALIGNMENT ALGORITHM FOR FINDING HIGHLY SIMILAR CANDIDATES OF A SEQUENCE IN DATABASE

DIVIDE-AND-CONQUER GLOBAL ALIGNMENT ALGORITHM FOR FINDING HIGHLY SIMILAR CANDIDATES OF A SEQUENCE IN DATABASE

机译:查找数据库中序列的高度相似候选条件的分而治之全局对齐算法

摘要

A divide-and-conquer global alignment algorithm for finding highly similar candidates of a sequence in database is disclosed. The invention gives a divide-and-conquer algorithm called Kart, that separates the given sequence into smaller pieces whose alignment can be carried out independently, and their concatenated alignment constitutes the global alignment of the entire sequence. Kart could be viewed as aligning multiple seeds simultaneously in parallel. We illustrate the idea using the read mapping of Next-generation sequencing (NGS) as an example. NGS provides a great opportunity to investigate genome-wide variation at nucleotide resolution. Due to the huge amount of data, NGS applications require very fast alignment algorithms. The invention can process long reads as fast as short reads. Furthermore, it can tolerate much higher error rates. The experiments show that Kart spends much less time on longer reads than most aligners and still produce reliable alignments.
机译:公开了一种分治法全局比对算法,用于在数据库中查找高度相似的候选序列。本发明给出了一种称为Kart的分而治之算法,该算法将给定的序列分成较小的片段,它们的比对可以独立进行,并且它们的串联比对构成整个序列的整体比对。卡丁车可以看作是同时并行对齐多个种子。我们以下一代测序(NGS)的读取映射为例来说明这一想法。 NGS提供了一个以核苷酸分辨率研究全基因组变异的绝好机会。由于海量数据,NGS应用程序需要非常快速的对齐算法。本发明可以像短读一样快地处理长读。此外,它可以容忍更高的错误率。实验表明,与大多数比对仪相比,Kart在更长的读取上花费的时间要少得多,并且仍能产生可靠的比对。

著录项

  • 公开/公告号US2018067992A1

    专利类型

  • 公开/公告日2018-03-08

    原文格式PDF

  • 申请/专利权人 ACADEMIA SINICA;

    申请/专利号US201715694365

  • 发明设计人 WEN-LIAN HSU;HSIN-NAN LIN;

    申请日2017-09-01

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 12:59:11

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号