Fast Algorithms for Top-k Approximate String Matching

机译：Top-k近似字符串匹配的快速算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Top-k approximate querying on string collections is an important data analysis tool for many applications, and it has been exhaustively studied. However, the scale of the problem has increased dramatically because of the prevalence of the Web. In this paper, we aim to explore the efficient top-k similar string matching problem. Several efficient strategies are introduced, such as length aware and adaptive q-gram selection. We present a general q-gram based framework and propose two efficient algorithms based on the strategies introduced. Our techniques are experimentally evaluated on three real data sets and show a superior performance.

机译：对字符串集合的top-k近似查询是许多应用程序的重要数据分析工具，并且已经进行了详尽的研究。但是，由于Web的普及，问题的规模已急剧增加。在本文中，我们旨在探索高效的前k个相似字符串匹配问题。引入了几种有效的策略，例如长度感知和自适应q-gram选择。我们提出了一个基于q-gram的通用框架，并根据引入的策略提出了两种有效的算法。我们的技术在三个真实数据集上进行了实验评估，并显示出卓越的性能。

著录项

来源
《Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-10;Symposium on educational advances in artificial intelligence;AAAI-10;EAAI-10》|2011年|p.1467-1473|共7页
会议地点
作者
Zhenglu Yang; Jianjun Yu; Masaru Kitsuregawa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance [J] . Ho ThienLuan, Oh Seung-Rohk, Kim HyunJin Journal of supercomputing . 2018,第5期

机译：海明距离下定长近似字符串匹配和近似圆字符串匹配的新算法
2. Correction to: New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance [J] . Ho ThienLuan, Oh Seung-Rohk, Kim HyunJin Journal of supercomputing . 2018,第5期

机译：更正为：在汉明距离下用于定长近似字符串匹配和近似圆形字符串匹配的新算法
3. Approximate string matching: A simpler faster algorithm [J] . Cole R., Hariharan R. SIAM Journal on Computing . 2002,第6期

机译：近似字符串匹配：更简单，更快速的算法
4. Fast Algorithms for Top-k Approximate String Matching [C] . Zhenglu Yang, Jianjun Yu, Masaru Kitsuregawa AAAI Conference on Artificial Intelligence . 2010

机译：Top-K近似字符串匹配的快速算法
5. Multi-pattern string matching algorithms. [D] . Zha, Xinyan. 2010

机译：多模式字符串匹配算法。
6. Fast algorithms for approximate circular string matching [O] . Carl Barton, Costas S Iliopoulos, Solon P Pissis 2014

机译：用于近似圆串匹配的快速算法
7. Correction to: New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance [O] . ThienLuan Ho, Seung-Rohk Oh, HyunJin Kim 2018

机译：校正：用于在汉明距离下匹配的固定长度近似串匹配和近似圆形串的新算法
8. Comparison of Approximate String Matching Algorithms [R] . Jokinen, P., Tarhio, J., Ukkonen, E. 1991

机译：近似字符串匹配算法的比较

Fast Algorithms for Top-k Approximate String Matching

摘要

著录项

相似文献

相关主题

期刊订阅