首页> 外文会议>Workshop on Algorithm Engineering and Experiments >Top-k Substring Matching for Auto-Completion
【24h】

Top-k Substring Matching for Auto-Completion

机译:用于自动完成的Top-K子字符串匹配

获取原文

摘要

Given the user's input as a query, auto-completion selects the top-k strings with the highest scores from the strings matching the query in a dictionary. A recent study [14] proposed space-efficient data structures for top-k prefix matching for auto-completion. In practical applications, however, top-k substring matching is required for many purposes. In this paper, we present a novel approach to solve the top-k substring matching problem. We combined two trie-based data structures derived from the same dictionary for prefix and key search, and we search them alternately leveraging the implicit tree structure shared by them. Experimental results show that our algorithm can suggest top-k completion sufficiently fast, while taking much less space than a compressed full-text index of the dictionary.
机译:鉴于用户的输入作为查询,自动完成从字符串中选择具有最高分数的Top-k字符串,匹配字典中的查询。最近的一项研究[14]提出了用于自动完成的Top-K前缀的空节空节空节数据结构。然而,在实际应用中,许多目的需要Top-K子串匹配。在本文中,我们提出了一种解决顶级匹配问题的新方法。我们组合了两个基于TRIE的数据结构,用于前缀和键搜索的相同字典,并且我们在交替地利用它们共享的隐式树结构中搜索它们。实验结果表明,我们的算法可以提出足够快的Top-K完成,同时比字典的压缩全文索引取得更少的空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号