...
首页> 外文期刊>Future generation computer systems >EDS: An Efficient Data Selection policy for search engine storage architectures
【24h】

EDS: An Efficient Data Selection policy for search engine storage architectures

机译:EDS:针对搜索引擎存储架构的有效数据选择策略

获取原文
获取原文并翻译 | 示例
           

摘要

Caching is an effective optimization in search engine storage architectures. Many caching algorithms have been proposed to improve retrieval performance. The data selection policy of search engine cache management plays an important role, which carefully places the data in memory or other storage, such as solid state disks (SSDs). Considering that the historical query log has a guiding role for the future query, we present an Efficient Data Selection (EDS) policy for search engine cache management, which views cache media as a knapsack, and views results and posting lists as items. The best benefit of EDS can be computed by greedy algorithms. We carry out a series of experiments to study the essential factors of the data selection in different architectures, including hard disk drive (HDD), SSD, and SSD-based hybrid storage architectures. The hybrid storage architecture is a two-level cache architecture, which uses SSD as a secondary cache for the memory. Our main goal is to improve the performance of the search engines and reduce the cost of the servers on two-level cache architecture. The experimental results demonstrate that our proposed policy improves the hit ratio by 20.04% as well as the retrieval performance on HDD, SSD, and hybrid architecture bv 31.98%, 28.72% and 23.24%, respectively.
机译:缓存是搜索引擎存储体系结构中的有效优化。已经提出了许多缓存算法来提高检索性能。搜索引擎缓存管理的数据选择策略起着重要的作用,它将数据小心地放置在内存或其他存储设备(例如固态磁盘(SSD))中。考虑到历史查询日志对将来的查询具有指导作用,我们提出了一种用于搜索引擎缓存管理的有效数据选择(EDS)策略,该策略将缓存媒体视为背包,并将结果和发布列表视为项目。 EDS的最大好处可以通过贪婪算法来计算。我们进行了一系列实验,以研究不同架构(包括硬盘驱动器(HDD),SSD和基于SSD的混合存储架构)中数据选择的基本因素。混合存储体系结构是两级缓存体系结构,它使用SSD作为内存的辅助缓存。我们的主要目标是提高搜索引擎的性能,并降低两级缓存体系结构上服务器的成本。实验结果表明,我们提出的策略将命中率提高了20.04%,并且对HDD,SSD和混合体系结构的检索性能分别提高了31.98%,28.72%和23.24%。

著录项

  • 来源
    《Future generation computer systems》 |2017年第9期|220-231|共12页
  • 作者单位

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China;

    School of Software Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China;

    Department of Computer Science, Pace University, New York, NY 10038, USA;

    Department of Computer Science, State University of New York, New York, NY 12561, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Search engine; Data selection; Solid state disk; Hybrid storage architecture; Cache;

    机译:搜索引擎;数据选择;固态磁盘;混合存储架构;快取;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号