首页> 外文期刊>Information Processing & Management >A machine learning approach for result caching in web search engines
【24h】

A machine learning approach for result caching in web search engines

机译:一种用于Web搜索引擎中结果缓存的机器学习方法

获取原文
获取原文并翻译 | 示例
           

摘要

A commonly used technique for improving search engine performance is result caching. In result caching, precomputed results (e.g., URLs and snippets of best matching pages) of certain queries are stored in a fast-access storage. The future occurrences of a query whose results are already stored in the cache can be directly served by the result cache, eliminating the need to process the query using costly computing resources. Although other performance metrics are possible, the main performance metric for evaluating the success of a result cache is hit rate. In this work, we present a machine learning approach to improve the hit rate of a result cache by facilitating a large number of features extracted from search engine query logs. We then apply the proposed machine learning approach to static, dynamic, and static-dynamic caching. Compared to the previous methods in the literature, the proposed approach improves the hit rate of the result cache up to 0.66%, which corresponds to 9.60% of the potential room for improvement.
机译:一种提高搜索引擎性能的常用技术是结果缓存。在结果缓存中,某些查询的预先计算的结果(例如,URL和最佳匹配页面的片段)存储在快速访问存储器中。其结果已存储在高速缓存中的查询的将来出现可以由结果高速缓存直接提供服务,从而无需使用昂贵的计算资源来处理查询。尽管其他性能指标也是可行的,但是评估结果缓存是否成功的主要性能指标是命中率。在这项工作中,我们提出一种机器学习方法,通过促进从搜索引擎查询日志中提取的大量功能来提高结果缓存的命中率。然后,我们将提出的机器学习方法应用于静态,动态和静态-动态缓存。与文献中的先前方法相比,所提出的方法将结果缓存的命中率提高了0.66%,相当于潜在改进空间的9.60%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号