Dotted Suffix Trees A Structure for Approximate Text Indexing

机译：虚线后缀树是近似文本索引的结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, the problem we address is text indexing for approximate matching. Given a text Τ which undergoes some preprocessing to generate an index, we can later query this index to identify the places where a string occurs up to a certain number of errors κ(edition distance). The indexing structure occupies space O(n log~k n) in the average case, independent of alphabet size. This structure can be used to report the existence of a match with k errors in O(3~km~(k+1)) and to report the occurrences in O(3~km~(k+1) + ed) time, where m is the length of the pattern and ed and the number of matching edit scripts. The construction of the structure has time bound by O(kN|∑|), where N is the number of nodes in the index and |∑| the alphabet size.

机译：在这项工作中，我们地址的问题是用于近似匹配的文本索引。给定经历一些预处理以生成索引的文本τ，我们稍后可以查询此索引以识别字符串发生的位置κ（版距离）。索引结构在平均案例中占用空间O（n log〜k n），与字母大小无关。该结构可用于报告与o（3〜km〜（k + 1）中的k错误的匹配的存在，并报告O（3〜km〜（k + 1）+ ed）时间的发生，其中m是模式和ed的长度以及匹配编辑脚本的数量。结构的结构具有由O（kn |σ|）的时间，其中n是指数中的节点数量和|σ|字母大小。

著录项

来源
《International Conference on String Processing and Information Retrieval》|2006年||共8页
会议地点
作者
Luis Pedro Coelho; Arlindo L. Oliveira;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据备份与恢复;
关键词
string algorithms; suffix trees; approximate text matching; text indexing;

机译：字符串算法;后缀树;近似文本匹配;文本索引;

相似文献

外文文献
中文文献
专利

1. Compressed suffix arrays and suffix trees with applications to text indexing and string matching [J] . Grossi R, Vitter JS SIAM Journal on Computing . 2005,第2期

机译：压缩后缀数组和后缀树及其在文本索引和字符串匹配中的应用
2. Suffix Trays and Suffix Trists: Structures for Faster Text Indexing [J] . Cole Richard, Kopelowitz Tsvi, Lewenstein Moshe Algorithmica . 2015,第2期

机译：后缀纸盘和后缀字词：更快的文本索引结构
3. Geometric Suffix Tree: Indexing Protein 3-D Structures [J] . TETSUO SHIBUYA Journal of the Association for Computing Machinery . 2010,第3期

机译：几何后缀树：索引蛋白质3-D结构
4. Dotted Suffix Trees A Structure for Approximate Text Indexing [C] . Luis Pedro Coelho, Arlindo L. Oliveira String Processing and Information Retrieval; Lecture Notes in Computer Science; 4209 . 2006

机译：点缀后缀树近似文本索引的结构
5. Structure and hydraulic function of xylem in two tree species with contrasting amounts of sapwood, Pseudotsuga menziesii and Pinus ponderosa. [D] . Domec, Jean-Christophe. 2002

机译：边材数量相对不同的两种树的木质部的木质部的结构和水力功能，即Pseudotsuga menziesii和美国黄松。
6. Neural Tree Indexers for Text Understanding [O] . Tsendsuren Munkhdalai, Hong Yu -1

机译：用于文本理解的神经树索引器
7. Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure [O] . Andrzej Ehrenfeucht, Ross M. Mcconnell, Sung-whan Woo 2010

机译：约定的后缀树：简单动态的文本索引数据结构
8. Tree Matching Problems with Applications to Structured Text Databases [R] . Kilpelaeinen, P. 1992

机译：树形匹配问题及其在结构化文本数据库中的应用

Dotted Suffix Trees A Structure for Approximate Text Indexing

摘要

著录项

相似文献

相关主题

期刊订阅