首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Practical fast on-line exact pattern matching algorithms for highly similar sequences
【24h】

Practical fast on-line exact pattern matching algorithms for highly similar sequences

机译:适用于高度相似序列的实用快速在线精确模式匹配算法

获取原文

摘要

With the advent of high-throughput sequencing technologies there are more and more genomic sequences of individuals of the same species available. These sequences only differ by a very small amount of variations. There is thus a strong need for efficient algorithms for performing fast pattern matching in such specific sets of sequences. In this paper we propose efficient practical algorithms that solve on-line exact pattern matching problem in a set of highly similar DNA sequences. We first present a method for exact single pattern matching when k variations are allowed in a window which size is equal to the pattern length. We then propose an algorithm for exact multiple pattern matching when only one variation is allowed in a window which size is equal to the length of the longest pattern. Experimental results show that our algorithms, though not optimal in the worst case, have good performances in practice.
机译:随着高通量测序技术的出现,越来越多的相同物种个体的基因组序列可用。这些序列仅相差很小的变化。因此,强烈需要用于在这样的特定序列集中执行快速模式匹配的有效算法。在本文中,我们提出了有效的实用算法,可以解决一组高度相似的DNA序列中的在线精确模式匹配问题。我们首先提出一种在大小等于图案长度的窗口中允许k个变化时进行精确的单个图案匹配的方法。然后,当大小等于最长模式长度的窗口中仅允许一个变化时,我们提出一种精确多模式匹配的算法。实验结果表明,我们的算法尽管在最坏的情况下不是最优的,但在实践中却具有良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号