首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Using Evolutive Summary Counters for Efficient Cooperative Caching in Search Engines
【24h】

Using Evolutive Summary Counters for Efficient Cooperative Caching in Search Engines

机译:使用渐进式摘要计数器在搜索引擎中进行有效的协作缓存

获取原文
获取原文并翻译 | 示例

摘要

We propose and analyze a distributed cooperative caching strategy based on the Evolutive Summary Counters (ESC), a new data structure that stores an approximated record of the data accesses in each computing node of a search engine. The ESC capture the frequency of accesses to the elements of a data collection, and the evolution of the access patterns for each node in a network of computers. The ESC can be efficiently summarized into what we call ESC-summaries to obtain approximate statistics of the document entries accessed by each computing node. We use the ESC-summaries to introduce two algorithms that manage our distributed caching strategy, one for the distribution of the cache contents, ESC-placement, and another one for the search of documents in the distributed cache, ESC-search. While the former improves the hit rate of the system and keeps a large ratio of data accesses local, the latter reduces the network traffic by restricting the number of nodes queried to find a document. We show that our cooperative caching approach outperforms state-of-the-art models in both hit rate, throughput, and location recall for multiple scenarios, i.e., different query distributions and systems with varying degrees of complexity.
机译:我们提出并分析了基于渐进式摘要计数器(ESC)的分布式协作式缓存策略,这是一种新的数据结构,用于存储搜索引擎每个计算节点中数据访问的近似记录。 ESC捕获对数据集合元素的访问频率,以及计算机网络中每个节点的访问模式的演变。可以将ESC有效地概括为所谓的ESC摘要,以获得每个计算节点访问的文档条目的近似统计信息。我们使用ESC摘要来介绍两种管理分布式缓存策略的算法,一种算法用于分配缓存内容(ESC放置),另一种算法用于搜索分布式缓存中的文档ESC搜索。前者可以提高系统的命中率,并保持很大比例的本地数据访问,而后者则通过限制查询文档的节点数来减少网络流量。我们展示了我们的协作式缓存方法在多种情况下(即不同的查询分布和具有不同程度的复杂度的系统)的命中率,吞吐量和位置召回率均优于最新模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号