Compressed Indexes for Approximate String Matching

Ho-Leung Chan; Tak-Wah Lam; Wing-Kin Sung; Siu-Lung Tam; Swee-Seong Wong

首页> 外文期刊>Algorithmica >Compressed Indexes for Approximate String Matching

【24h】

Compressed Indexes for Approximate String Matching

机译：用于近似字符串匹配的压缩索引

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that match a given pattern P[1..m] with at most k errors. Previous solutions either require an index of size exponential in k or need Ω(m k ) time for searching. Motivated by the indexing of DNA, we investigate space efficient indexes that occupy only O(n) space. For k=1, we give an index to support matching in O(m+occ+log nlog log n) time. The previously best solution achieving this time complexity requires an index of O(nlog n) space. This new index can also be used to improve existing indexes for k≥2 errors. Among others, it can support 2-error matching in O(mlog nlog log n+occ) time, and k-error matching, for any k>2, in O(m k−1log nlog log n+occ) time.

机译：我们将重新讨论索引字符串S [1..n]的问题，以支持查找S中与给定模式P [1..m]匹配的所有子字符串，最多不超过k个错误。以前的解决方案要么需要以k为单位的大小指数索引，要么需要Ω（m k ）时间进行搜索。受DNA标引的影响，我们研究了仅占用O（n）空间的空间有效指标。对于k = 1，我们给出一个索引来支持O（m + occ + log nlog log n）时间的匹配。实现此时间复杂度的先前最佳解决方案需要O（nlog n）空间的索引。此新索引还可以用于改善k≥2错误的现有索引。其中，它可以支持在O（mlog nlog log n + occ）时间中进行2次错误匹配，并且在O（m k-1 log nlog中对任意k> 2进行k错误匹配。 log n + occ）时间。

著录项

来源
《Algorithmica》 |2010年第2期|p.263-281|共19页
作者
Ho-Leung Chan; Tak-Wah Lam; Wing-Kin Sung; Siu-Lung Tam; Swee-Seong Wong;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Compressed index; Approximate string matching;

机译：压缩指数;近似字符串匹配;

相似文献

外文文献
中文文献
专利

1. Approximate String Matching with Compressed Indexes [J] . Arlindo L. Oliveira, Gonzalo Navarro, Lu#xED, Algorithms . 2009,第3期

机译：具有压缩索引的近似字符串匹配
2. Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts [J] . Bille P., Fagerberg R., G?rtz I.L. ACM transactions on algorithms . 2010,第1期

机译：在Ziv-Lempel压缩文本上改进了近似字符串匹配和正则表达式匹配
3. New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance [J] . Ho ThienLuan, Oh Seung-Rohk, Kim HyunJin Journal of supercomputing . 2018,第5期

机译：海明距离下定长近似字符串匹配和近似圆字符串匹配的新算法
4. Approximate String Matching with Lempel-Ziv Compressed Indexes [C] . Luis M.S. Russo, Gonzalo Navarro, Arlindo L. Oliveira International Conference on String Processing and Information Retrieval(SPIRE 2007); 20071029-31; Santiago(CL) . 2007

机译：Lempel-Ziv压缩索引的近似字符串匹配
5. CONTRIBUTIONS TO APPROXIMATE COMPUTATION OF POWER GENERATING SYSTEM RELIABILITY INDEXES (ENERGY, POWER ENGINEERING, ELECTRIC SYSTEM). [D] . YIN, CHI KANG. 1986

机译：近似计算发电系统可靠性指标（能源，电力工程，电力系统）的贡献。
6. libFLASM: a software library for fixed-length approximate string matching [O] . Lorraine A. K. Ayad, Solon P. Pissis, Ahmad Retha 2016

机译：libFLASM：用于固定长度的近似字符串匹配的软件库
7. Approximate String Matching with Compressed Indexes [O] . Pedro Morales, Arlindo L. Oliveira, Luís M. S. Russo, 2009

机译：与压缩索引的近似字符串匹配

Compressed Indexes for Approximate String Matching

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅