Dotted Suffix Trees A Structure for Approximate Text Indexing

机译：点缀后缀树近似文本索引的结构

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, the problem we address is text indexing for approximate matching. Given a text Τ which undergoes some preprocessing to generate an index, we can later query this index to identify the places where a string occurs up to a certain number of errors κ(edition distance). The indexing structure occupies space O(n log~k n) in the average case, independent of alphabet size. This structure can be used to report the existence of a match with k errors in O(3~km~(k+1)) and to report the occurrences in O(3~km~(k+1) + ed) time, where m is the length of the pattern and ed and the number of matching edit scripts. The construction of the structure has time bound by O(kN|∑|), where N is the number of nodes in the index and |∑| the alphabet size.

机译：在这项工作中，我们要解决的问题是文本索引以进行近似匹配。给定文本Τ进行了一些预处理以生成索引，我们稍后可以查询该索引以识别出现一定误差κ（编辑距离）的字符串出现的位置。在平均情况下，索引结构占用空间O（n log〜k n），与字母大小无关。该结构可用于报告O（3〜km〜（k + 1））中存在k个错误的匹配项的存在，并报告O（3〜km〜（k + 1）+ ed）时间中的匹配项，其中m是样式和ed的长度以及匹配的编辑脚本的数量。结构的构造受O（kN | ∑ |）的时间限制，其中N是索引中的节点数，| ∑ |字母大小。

著录项

来源
《String Processing and Information Retrieval; Lecture Notes in Computer Science; 4209》|2006年|329-336|共8页
会议地点 Glasgow(GB)
作者
Luis Pedro Coelho; Arlindo L. Oliveira;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类数据备份与恢复;
关键词
string algorithms; suffix trees; approximate text matching; text indexing;

机译：字符串算法；后缀树;近似文字匹配；文字索引;

相似文献

外文文献
中文文献
专利

1. Compressed suffix arrays and suffix trees with applications to text indexing and string matching [J] . Grossi R, Vitter JS SIAM Journal on Computing . 2005,第2期

机译：压缩后缀数组和后缀树及其在文本索引和字符串匹配中的应用
2. Suffix Trays and Suffix Trists: Structures for Faster Text Indexing [J] . Cole Richard, Kopelowitz Tsvi, Lewenstein Moshe Algorithmica . 2015,第2期

机译：后缀纸盘和后缀字词：更快的文本索引结构
3. Geometric Suffix Tree: Indexing Protein 3-D Structures [J] . TETSUO SHIBUYA Journal of the Association for Computing Machinery . 2010,第3期

机译：几何后缀树：索引蛋白质3-D结构
4. Dotted Suffix Trees A Structure for Approximate Text Indexing [C] . Luis Pedro Coelho, Arlindo L. Oliveira International Conference on String Processing and Information Retrieval . 2006

机译：虚线后缀树是近似文本索引的结构
5. Structure and hydraulic function of xylem in two tree species with contrasting amounts of sapwood, Pseudotsuga menziesii and Pinus ponderosa. [D] . Domec, Jean-Christophe. 2002

机译：边材数量相对不同的两种树的木质部的木质部的结构和水力功能，即Pseudotsuga menziesii和美国黄松。
6. Neural Tree Indexers for Text Understanding [O] . Tsendsuren Munkhdalai, Hong Yu -1

机译：用于文本理解的神经树索引器
7. Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure [O] . Andrzej Ehrenfeucht, Ross M. Mcconnell, Sung-whan Woo 2010

机译：约定的后缀树：简单动态的文本索引数据结构
8. Tree Matching Problems with Applications to Structured Text Databases [R] . Kilpelaeinen, P. 1992

机译：树形匹配问题及其在结构化文本数据库中的应用

Dotted Suffix Trees A Structure for Approximate Text Indexing

摘要

著录项

相似文献

相关主题

期刊订阅