【24h】

Fast Matching Method for DNA Sequences

机译:DNA序列的快速匹配方法

获取原文
获取原文并翻译 | 示例

摘要

DNA sequences are the fundamental information for each species and a comparison between DNA sequences of different species is an important task. Since DNA sequences are very long and there exist many species, not only fast matching but also efficient storage is an important factor for DNA sequences. Thus, a fast string matching method suitable for encoded DNA sequences is needed. In this paper, we present a fast string matching method for encoded DNA sequences which does not decode DNA sequences while matching. We use four-characters-to-one-byte encoding and combine a suffix approach and a multi-pattern matching approach. Experimental results show that our method is about 5 times faster than AGREP and the fastest among known algorithms.
机译:DNA序列是每个物种的基本信息,不同物种的DNA序列之间的比较是一项重要的任务。由于DNA序列很长,并且存在许多种,因此快速匹配和有效存储都是DNA序列的重要因素。因此,需要一种适合于编码的DNA序列的快速字符串匹配方法。在本文中,我们提出了一种用于编码DNA序列的快速字符串匹配方法,该方法在匹配时不会解码DNA序列。我们使用四字符到一字节的编码,并结合了后缀方法和多模式匹配方法。实验结果表明,我们的方法比AGREP快5倍,是已知算法中最快的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号