首页> 外文会议>Advances in Knowledge Discovery and Data Mining; Lecture Notes in Artificial Intelligence; 4426 >A Practical Method for Approximate Subsequence Search in DNA Databases
【24h】

A Practical Method for Approximate Subsequence Search in DNA Databases

机译:DNA数据库中近似子序列搜索的实用方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose an accurate and efficient method for approximate subsequence search in large DNA databases. The proposed method basically adopts a binary trie as its primary structure and stores all the window subsequences extracted from a DNA sequence. For approximate subsequence search, it traverses the binary trie in a breadth-first fashion and retrieves all the matched subsequences from the traversed path within the trie by a dynamic programming technique. However, the proposed method stores only window subsequences of the pre-determined length, and thus suffers from large post-processing time in case of long query sequences. To overcome this problem, we divide a query sequence into shorter pieces, perform searching for those subsequences, and then merge their results.
机译:在本文中,我们为大型DNA数据库中的近似子序列搜索提出了一种准确而有效的方法。所提出的方法基本上采用二进制特里结构作为其主要结构,并存储从DNA序列中提取的所有窗口子序列。对于近似子序列搜索,它以广度优先的方式遍历二进制trie,并通过动态编程技术从遍历路径中的trie中检索所有匹配的子序列。然而,所提出的方法仅存储预定长度的窗口子序列,因此在长查询序列的情况下遭受较大的后处理时间。为了解决这个问题,我们将查询序列分为多个较短的部分,搜索这些子序列,然后合并它们的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号