首页> 外文会议>International conference on web-age information management >Improving the Effectiveness of Keyword Search in Databases Using Query Logs
【24h】

Improving the Effectiveness of Keyword Search in Databases Using Query Logs

机译:使用查询日志提高数据库中关键字搜索的效率

获取原文

摘要

Using query logs to enhance user experience has been extensively studied in the Web IR literature. However, in the area of keyword search on structured data (relational databases in particular), most existing work has focused on improving search result quality through designing better scoring functions, without giving explicit consideration to query logs. Our work presented in this paper taps into the wealth of information contained in query logs, and aims to enhance the search effectiveness by explicitly taking into account the log information when ranking the query results. To concretize our discussion, we focus on schema-graph-based approaches to keyword search (using the seminal work DISCOVER as an example), which usually proceed in two stages, candidate network (CN) generation and CN evaluation. We propose a query-log-aware ranking strategy that uses the frequent patterns mined from query logs to help rank the CNs generated during the first stage. Given the frequent patterns, we show how to compute the maximal score of a CN using a dynamic programming algorithm. We prove that the problem of finding the maximal score is NP-hard. User studies on a real dataset validate the effectiveness of the proposed ranking strategy.
机译:在Web IR文献中已经广泛研究了使用查询日志来增强用户体验。但是,在结构化数据(尤其是关系数据库)的关键字搜索领域,大多数现有工作集中在通过设计更好的评分功能来提高搜索结果质量,而没有明确考虑查询日志。本文中介绍的工作充分利用了查询日志中包含的大量信息,旨在通过在对查询结果进行排名时明确考虑日志信息来提高搜索效率。为了使我们的讨论具体化,我们专注于基于模式图的关键字搜索方法(以开创性的工作DISCOVER为例),该方法通常分两个阶段进行,即候选网络(CN)生成和CN评估。我们提出了一种查询日志感知的排名策略,该策略使用从查询日志中提取的频繁模式来对第一阶段生成的CN进行排名。给定频繁的模式,我们展示了如何使用动态编程算法来计算CN的最大分数。我们证明找到最大分数的问题是NP难的。对真实数据集的用户研究验证了所提出的排名策略的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号