Metric Indexes for Approximate String Matching in a Dictionary

机译：用于字典中匹配的近似字符串的度量索引

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of finding all approximate occurrences of a given string q, with at most k differences, in a finite database or dictionary of strings. The strings can be e.g. natural language words, such as the vocabulary of some document or set of documents. This has many important application in both off-line (indexed) and on-line string matching. More precisely, we have a universe U of strings, and a non-negative distance function d: U x U → N. The distance function is metric, if it satisfies (ⅰ) d(x, y) = 0 <=> x = y; (ⅱ) d(x, y) = d(y, x); (ⅲ) d(x, y) ≤ d(x, z) + d(z, y). The last item is called the "triangular inequality", and is the most important property in our case. Many useful distance functions are known to be metric, in particular edit (Levenshtein) distance is metric, which we will use for d.

机译：我们考虑找到给定字符串Q的所有近似出现的问题，具有大多数k差异，在有限的数据库或字符串字典中。字符串可以是例如自然语言单词，如某些文档或一组文件的词汇。这在离线（索引）和在线字符串匹配中具有许多重要应用。更确切地说，我们有一个字符串U宇宙，以及一个非负距离功能d：u x u→n。距离功能是公制，如果满足（o）d（x，y）= 0 <=> x = y; （Ⅱ）D（x，y）= d（y，x）; （Ⅲ）D（x，y）≤d（x，z）+ d（z，y）。最后一项称为“三角不等式”，是我们案件中最重要的财产。已知许多有用的距离函数是指标，特别是编辑（Levenshtein）距离是指标，我们将用于D.

著录项

来源
《International Conference on String Processing and Information Retrieval》|2004年||共2页
会议地点
作者
Kimmo Fredriksson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据备份与恢复;
关键词

相似文献

外文文献
中文文献
专利

1. Compressed Indexes for Approximate String Matching [J] . Ho-Leung Chan, Tak-Wah Lam, Wing-Kin Sung, Algorithmica . 2010,第2期

机译：用于近似字符串匹配的压缩索引
2. Approximate String Matching with Compressed Indexes [J] . Arlindo L. Oliveira, Gonzalo Navarro, Lu#xED, Algorithms . 2009,第3期

机译：具有压缩索引的近似字符串匹配
3. New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance [J] . Ho ThienLuan, Oh Seung-Rohk, Kim HyunJin Journal of supercomputing . 2018,第5期

机译：海明距离下定长近似字符串匹配和近似圆字符串匹配的新算法
4. Metric Indexes for Approximate String Matching in a Dictionary [C] . Kimmo Fredriksson International Conference on String Processing and Information Retrieval(SPIRE 2004); 20041005-08; Padova(IT) . 2004

机译：字典中近似字符串匹配的指标索引
5. CONTRIBUTIONS TO APPROXIMATE COMPUTATION OF POWER GENERATING SYSTEM RELIABILITY INDEXES (ENERGY, POWER ENGINEERING, ELECTRIC SYSTEM). [D] . YIN, CHI KANG. 1986

机译：近似计算发电系统可靠性指标（能源，电力工程，电力系统）的贡献。
6. libFLASM: a software library for fixed-length approximate string matching [O] . Lorraine A. K. Ayad, Solon P. Pissis, Ahmad Retha 2016

机译：libFLASM：用于固定长度的近似字符串匹配的软件库
7. Approximate String Matching with Compressed Indexes [O] . Pedro Morales, Arlindo L. Oliveira, Luís M. S. Russo, 2009

机译：与压缩索引的近似字符串匹配

Metric Indexes for Approximate String Matching in a Dictionary

摘要

著录项

相似文献

相关主题

期刊订阅