【24h】

Exploiting the Use of Similar Past Search Results Through a Dynamic Cache

机译:通过动态缓存利用过去相似的搜索结果

获取原文

摘要

Search engines are prone to the computation of redundant queries. Without any optimization this can be a costly procedure; limited processing power is used to compute new responses for queries that have already been answered. Despite the latter, little work exists on leveraging similar past queries to reduce the engine's workload while improving its precision. This paper describes a dynamic cache based on an on-line algorithm, which takes advantage of similar past queries to reduce the engine's response times, while simultaneously improving precision. Roughly, the algorithm uses a priority queue to build a dynamic index of past queries, alongside their responses, where each query's priority is linked to its overall popularity (the top-K queries along with their results): similar queries that have appeared multiple times are indexed with a high priority value. This ensures that their results will be easily recoverable from the queue without recomputation. Notably, the algorithm also improves response times for queries that only share a subset of their search terms. Several experiments were performed over a dataset of simulated queries and documents. Empirical results show that the proposed dynamic cache improves on both precision and response times.
机译:搜索引擎易于计算冗余查询。如果没有任何优化,这可能是一个昂贵的过程;有限的处理能力用于为已经回答的查询计算新的响应。尽管有后者,但在利用类似的过去查询来减少引擎的工作量并提高引擎精度方面,几乎没有任何工作。本文介绍了一种基于在线算法的动态缓存,该缓存利用了过去的类似查询来减少引擎的响应时间,同时提高了精度。粗略地讲,该算法使用优先级队列来构建过去查询及其响应的动态索引,其中每个查询的优先级都与其整体知名度(前K个查询及其结果)相关联:多次出现的类似查询索引具有较高的优先级值。这样可确保无需重新计算即可轻松从队列中恢复其结果。值得注意的是,该算法还缩短了仅共享搜索项子集的查询的响应时间。在模拟查询和文档的数据集上进行了几次实验。实验结果表明,所提出的动态缓存在精度和响应时间上均得到了改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号