Approximate String Matching Using Compressed Suffix Arrays

机译：使用压缩后缀数组进行近似字符串匹配

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabet A. The k-difference (k-mismatch, respectively) problem is to find all occurrences of P in T that have edit distance (Hamming distance, respectively) at most k from P. In this paper we investigate a well-studied case in which k = 1 and T is fixed and preprocessed into an indexing data structure so that any pattern query can be answered faster. This paper gives a solution using O(n) bits indexing data structure with O(m log~2 n) query time. To the best of our knowledge, this is the first result which requires linear indexing space. The results can be extended for the k-difference problem with k ≥ 1.

机译：令T为长度为n的文本，P为长度为m的模式，两个字符串都位于固定的有限字母A上。k差异（分别为k不匹配）问题是查找T中所有出现的具有编辑的P距P的最大距离（分别为汉明距离）。在本文中，我们研究了一种经过充分研究的情况，其中k = 1并且T是固定的，并且预处理为索引数据结构，以便可以更快地回答任何模式查询。本文提出了一种使用O（n）位索引数据结构和O（m log〜2 n）查询时间的解决方案。据我们所知，这是需要线性索引空间的第一个结果。可以将结果推广到k≥1的k差问题。

著录项

来源
《Combinatorial Pattern Matching》|2004年|P.434-444|共11页
会议地点
作者
Trinh N.D. Huynh; Wing-Kai Hon; Tak-Wah Lam; Wing-Kin Sung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Approximate string matching using compressed suffix arrays [J] . Trinh N. D. Huynh, Wing-Kai Hon, Tak-Wah Lam, Theoretical computer science . 2006,第1a3期

机译：使用压缩后缀数组进行近似字符串匹配
2. Compressed suffix arrays and suffix trees with applications to text indexing and string matching [J] . Grossi R, Vitter JS SIAM Journal on Computing . 2005,第2期

机译：压缩后缀数组和后缀树及其在文本索引和字符串匹配中的应用
3. Improved Approximate String Matching Using Compressed Suffix Data Structures [J] . Tak-Wah Lam, Wing-Kin Sung, Swee-Seong Wong Algorithmica . 2008,第3期

机译：使用压缩后缀数据结构的改进的近似字符串匹配
4. Approximate String Matching Using Compressed Suffix Arrays [C] . Trinh N. D. Huynh, Wing-Kai Hon, Tak-Wah Lam, Annual Symposium on Combinatorial Pattern Matching . 2004

机译：使用压缩后缀数组匹配的近似字符串
5. Suffix trees and suffix arrays in primary and secondary storage [D] . Ko, Pang 2007

机译：主存储和辅助存储中的后缀树和后缀数组
6. gsufsort: constructing suffix arrays LCP arrays and BWTs for string collections [O] . Felipe A. Louza, Guilherme P. Telles, Simon Gog, 2020

机译：gsufsort：构造后缀阵列LCP阵列和BWTS for String Collections
7. Approximate string matching using compressed suffix arrays [O] . Huynh, Trinh N. D. 2012

机译：使用压缩后缀数组进行近似字符串匹配

Approximate String Matching Using Compressed Suffix Arrays

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅