首页> 外文会议>International Conference on String Processing and Information Retrieval >Dotted Suffix Trees A Structure for Approximate Text Indexing
【24h】

Dotted Suffix Trees A Structure for Approximate Text Indexing

机译:虚线后缀树是近似文本索引的结构

获取原文

摘要

In this work, the problem we address is text indexing for approximate matching. Given a text Τ which undergoes some preprocessing to generate an index, we can later query this index to identify the places where a string occurs up to a certain number of errors κ(edition distance). The indexing structure occupies space O(n log~k n) in the average case, independent of alphabet size. This structure can be used to report the existence of a match with k errors in O(3~km~(k+1)) and to report the occurrences in O(3~km~(k+1) + ed) time, where m is the length of the pattern and ed and the number of matching edit scripts. The construction of the structure has time bound by O(kN|∑|), where N is the number of nodes in the index and |∑| the alphabet size.
机译:在这项工作中,我们地址的问题是用于近似匹配的文本索引。给定经历一些预处理以生成索引的文本τ,我们稍后可以查询此索引以识别字符串发生的位置κ(版距离)。索引结构在平均案例中占用空间O(n log〜k n),与字母大小无关。该结构可用于报告与o(3〜km〜(k + 1)中的k错误的匹配的存在,并报告O(3〜km〜(k + 1)+ ed)时间的发生,其中m是模式和ed的长度以及匹配编辑脚本的数量。结构的结构具有由O(kn |σ|)的时间,其中n是指数中的节点数量和|σ|字母大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号