首页> 外文会议>International world wide web conference;WWW 09 >Improved Techniques for Result Caching in Web Search Engines
【24h】

Improved Techniques for Result Caching in Web Search Engines

机译:Web搜索引擎中结果缓存的改进技术

获取原文
获取外文期刊封面目录资料

摘要

Query processing is a major cost factor in operating large web search engines. In this paper, we study query result caching, one of the main techniques used to optimize query processing performance. Our first contribution is a study of result caching as a weighted caching problem. Most previous work has focused on optimizing cache hit ratios, but given that processing costs of queries can vary very significantly we argue that total cost savings also need to be considered. We describe and evaluate several algorithms for weighted result caching, and study the impact of Zipf-based query distributions on result caching. Our second and main contribution is a new set of feature-based cache eviction policies that achieve significant improvements over all previous methods, substantially narrowing the existing performance gap to the theoretically optimal (clairvoyant) method. Finally, using the same approach, we also obtain performance gains for the related problem of inverted list caching.
机译:查询处理是操作大型网络搜索引擎的主要成本因素。在本文中,我们研究查询结果缓存,这是用于优化查询处理性能的主要技术之一。我们的第一个贡献是研究将结果缓存作为加权缓存问题。以前的大多数工作都集中在优化高速缓存命中率上,但是鉴于查询的处理成本可能相差很大,我们认为还需要考虑节省总成本。我们描述和评估加权结果缓存的几种算法,并研究基于Zipf的查询分布对结果缓存的影响。我们的第二个主要贡献是基于特征的高速缓存逐出策略的新集合,它们在所有以前的方法上均取得了显着改进,从而将现有的性能差距大大缩小到了理论上最佳的(透视)方法。最后,使用相同的方法,我们还获得了反向列表缓存相关问题的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号