Top-k Substring Matching for Auto-Completion

机译：用于自动完成的Top-K子字符串匹配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given the user's input as a query, auto-completion selects the top-k strings with the highest scores from the strings matching the query in a dictionary. A recent study [14] proposed space-efficient data structures for top-k prefix matching for auto-completion. In practical applications, however, top-k substring matching is required for many purposes. In this paper, we present a novel approach to solve the top-k substring matching problem. We combined two trie-based data structures derived from the same dictionary for prefix and key search, and we search them alternately leveraging the implicit tree structure shared by them. Experimental results show that our algorithm can suggest top-k completion sufficiently fast, while taking much less space than a compressed full-text index of the dictionary.

机译：鉴于用户的输入作为查询，自动完成从字符串中选择具有最高分数的Top-k字符串，匹配字典中的查询。最近的一项研究[14]提出了用于自动完成的Top-K前缀的空节空节空节数据结构。然而，在实际应用中，许多目的需要Top-K子串匹配。在本文中，我们提出了一种解决顶级匹配问题的新方法。我们组合了两个基于TRIE的数据结构，用于前缀和键搜索的相同字典，并且我们在交替地利用它们共享的隐式树结构中搜索它们。实验结果表明，我们的算法可以提出足够快的Top-K完成，同时比字典的压缩全文索引取得更少的空间。

著录项

来源
《Workshop on Algorithm Engineering and Experiments》|2014年|165 p.|共8页
会议地点
作者
Yuzuru Okajima;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Generalized pattern matching and periodicity under substring consistent equivalence relations [J] . Matsuoka Yoshiaki, Aoki Takahiro, Inenaga Shunsuke, Theoretical computer science . 2016,第Pta2期

机译：子串一致等价关系下的广义模式匹配和周期性
2. Robust and Reverse-Engineering Resilient PUF Authentication and Key-Exchange by Substring Matching [J] . Rostami Mohamad, Majzoobi Mehrdad, Koushanfar Farinaz, Emerging Topics in Computing, IEEE Transactions on . 2014,第1期

机译：通过子字符串匹配进行健壮且反向工程的弹性PUF身份验证和密钥交换
3. Bit-Parallel Algorithms for Finding All Substrings Matching a Regular Expression [J] . Hiroaki YAMAMOTO, Takashi MIYAZAKI 電子情報通信学会技術研究報告 . 2012,第199期

机译：查找与正则表达式匹配的所有子串的位并行算法
4. Top-k Substring Matching for Auto-Completion [C] . Yuzuru Okajima Workshop on Algorithm Engineering and Experiments . 2014

机译：用于自动完成的Top-K子字符串匹配
5. Discovering motifs in DNA and protein sequences: The approximate common substring problem. [D] . Bailey, Timothy Lawrence. 1995

机译：在DNA和蛋白质序列中发现基序：近似的常见子串问题。
6. A New Algorithm for Fast All-Against-All Substring Matching [O] . Marina Barsky, Ulrike Stege, Alex Thomo, -1

机译：快速全反对所有子串匹配的新算法
7. Top-k String Auto-Completion with Synonyms [O] . Xu, Pengfei, Lu, Jiaheng 2016

机译：具有同义词的Top-k字符串自动完成

Top-k Substring Matching for Auto-Completion

摘要

著录项

相似文献

相关主题

期刊订阅