【24h】

Breaking out of the MisMatch trap

机译:脱离了不匹配的陷阱

获取原文

摘要

When users issue a query to a database, they have expectations about the results. If what they search for is unavailable in the database, the system will return an empty result or, worse, erroneous mismatch results.We call this problem the MisMatch Problem. In this paper, we solve the MisMatch problem in the context of XML keyword search. Our solution is based on two novel concepts that we introduce: Target Node Type and Distinguishability. Using these concepts, we develop a low-cost post-processing algorithm on the results of query evaluation to detect the MisMatch problem and generate helpful suggestions to users. Our approach has three noteworthy features: (1) for queries with the MisMatch problem, it generates the explanation, suggested queries and their sample results as the output to users, helping users judge whether the MisMatch problem is solved without reading all query results; (2) it is portable as it can work with any LCA-based matching semantics and is orthogonal to the choice of result retrieval method adopted; (3) it is lightweight in the way that it occupies a very small proportion of the whole query evaluation time. Extensive experiments on three real datasets verify the effectiveness, efficiency and scalability of our approach. A search engine called XClear has been built and is available at http://xclear.comp.nus.edu.sg.
机译:当用户向数据库发出查询时,他们对结果有期望。如果他们搜索的是在数据库中不可用,系统将返回一个空结果,或者更差,错误的不匹配结果。我们称之为不匹配问题。在本文中,我们在XML关键字搜索的上下文中解决了不匹配问题。我们的解决方案基于我们介绍的两种新颖概念:目标节点类型和可区分性。使用这些概念,我们在查询评估结果中开发出低成本的后处理算法,以检测不匹配问题并为用户生成有用的建议。我们的方法有三个值得注意的功能:(1)对于具有不匹配问题的查询,它会产生解释,建议的查询和他们的样本结果作为向用户的输出,帮助用户判断是否解决了不读数的不匹配问题而不读取所有查询结果; (2)它可以与任何基于LCA的匹配语义合作,并且与所采用的结果检索方法正交; (3)它是轻量级的,即它占据整个查询评估时间的非常小的比例。三个真实数据集的广泛实验验证了我们方法的有效性,效率和可扩展性。已建立一个名为Xcart的搜索引擎,可在http://xclear.comp.nus.edu.sg中获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号