首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Efficient Fuzzy Type-Ahead Search in XML Data
【24h】

Efficient Fuzzy Type-Ahead Search in XML Data

机译:XML数据中的有效模糊预输入搜索

获取原文
获取原文并翻译 | 示例

摘要

In a traditional keyword-search system over XML data, a user composes a keyword query, submits it to the system, and retrieves relevant answers. In the case where the user has limited knowledge about the data, often the user feels “left in the dark” when issuing queries, and has to use a try-and-see approach for finding information. In this paper, we study fuzzy type-ahead search in XML data, a new information-access paradigm in which the system searches XML data on the fly as the user types in query keywords. It allows users to explore data as they type, even in the presence of minor errors of their keywords. Our proposed method has the following features: 1) Search as you type: It extends Autocomplete by supporting queries with multiple keywords in XML data. 2) Fuzzy: It can find high-quality answers that have keywords matching query keywords approximately. 3) Efficient: Our effective index structures and searching algorithms can achieve a very high interactive speed. We study research challenges in this new search framework. We propose effective index structures and top-k algorithms to achieve a high interactive speed. We examine effective ranking functions and early termination techniques to progressively identify the top-k relevant answers. We have implemented our method on real data sets, and the experimental results show that our method achieves high search efficiency and result quality.
机译:在基于XML数据的传统关键字搜索系统中,用户编写关键字查询,将其提交给系统,然后检索相关答案。在用户对数据的了解有限的情况下,用户经常会在发出查询时感到“茫然不知所措”,必须使用一种尝试式方法来查找信息。在本文中,我们研究了XML数据中的模糊预输入搜索,这是一种新的信息访问范式,其中系统根据用户输入查询关键字的方式即时搜索XML数据。它允许用户在键入时浏览数据,即使在关键字存在较小错误的情况下也是如此。我们提出的方法具有以下功能:1)键入时搜索:通过支持XML数据中带有多个关键字的查询,扩展了自动完成功能。 2)模糊:它可以找到关键字与查询关键字近似匹配的高质量答案。 3)高效:我们有效的索引结构和搜索算法可以实现很高的交互速度。我们在这个新的搜索框架中研究研究挑战。我们提出有效的索引结构和top-k算法,以实现较高的交互速度。我们研究有效的排名功能和提前终止技术,以逐步确定与前k位相关的答案。我们已经在真实的数据集上实现了我们的方法,实验结果表明我们的方法实现了高搜索效率和结果质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号