Improving cache performance by combining cost-sensitivity and locality principles in cache replacement algorithms

机译：通过组合缓存替换算法中的成本灵敏度和局部原理来提高缓存性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Due to the ever increasing performance gap between the processor and the main memory, it becomes crucial to bridge that gap by designing an efficient memory hierarchy capable of reducing the average memory access time. The cache replacement algorithm plays a central role in designing an efficient memory hierarchy. Many of the recent studies in cache replacement algorithms have focused on improving L2 cache replacement algorithms by minimizing the miss count. However, depending on the dependency chain, cache miss bursts, and other factors, a processor's ability to partially hide the cost of an L2 cache miss varies; that is, cache miss costs are not uniform. Therefore, a better solution would account also for the aggregate miss cost in designing cache replacement algorithms. Our proposed solution combines the two principles of locality and cost-sensitivity into one which we call: LACS: Locality-Aware Cost-Sensitive cache replacement algorithm. LACS estimates a cache block's cost from the number of instructions the processor manages to issue during a cache miss on that block and then victimizes cache blocks with low cost and poor locality in order to maximize the overall cache performance. When LACS is evaluated using a uniprocessor architecture model, it speeds up 10 L2 cache performance-constrained SPEC CPU2000 benchmarks by up to 85% and 15% on average while not slowing down any of the 20 SPEC CPU2000 benchmarks evaluated. When evaluated using a dual-core CMP architecture model, LACS speeds up 6 SPEC CPU2000 benchmark pairs by up to 44% and 11% on average.

机译：由于处理器和主存储器之间的性能差异增加，通过设计能够减少平均存储器访问时间的有效内存层级来弥合该间隙至关重要。缓存替换算法在设计有效的内存层次结构方面起着核心作用。最近在高速缓存替换算法中的许多研究都集中在通过最小化错过计数来改进L2缓存替换算法。但是，根据依赖链，缓存未命中突发和其他因素，处理器部分隐藏L2缓存未命中的成本的能力变化;也就是说，缓存未命中成本不均匀。因此，在设计缓存替换算法中，还可以考虑更好的解决方案。我们所提出的解决方案将本地的两个原则与成本敏感性相结合，进入了我们所召唤的一个原则：LACS：地方感知成本敏感缓存替换算法。 LAC估计缓存块从处理器管理在该块上的缓存未命令期间发出的指令数量，然后将缓存块造成低成本和众所周知的缓存块，以最大化整体高速缓存性能。当使用单处理器架构模型进行评估LAC时，它速度高达10 L2缓存性能约束规范CPU2000基准，平均速度高达85％，而且在评估的20个规范CPU2000基准中的任何一个时，速度不变。使用双核CMP架构模型进行评估时，LACS平均速度高达4个规格CPU2000基准对，平均高达44％和11％。

著录项

来源
《IEEE International Conference on Computer Design》|2010年||共8页
会议地点
作者
Sheikh Rami; Kharbutli Mazen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Making LRU friendly to weak locality workloads: a novel replacement algorithm to improve buffer cache performance [J] . Jiang S., Zhang X. IEEE Transactions on Computers . 2005,第8期

机译：使LRU对弱本地工作负载友好：一种新颖的替换算法，可提高缓冲区高速缓存的性能
2. Improving Performance in Sub-Block Caches with Optimized Replacement Policies [J] . Olorode Oluleye, Nourani Mehrdad ACM Journal on Emerging Technologies in Computing Systems . 2015,第4期

机译：通过优化的替换策略提高子块缓存的性能
3. Improving performance of a distributed file system using hierarchical collaborative global caching algorithm with rank-based replacement technique [J] . Gopisetty Rathnamma, Ragunathan Thirumalaisamy, Bindu C. Shoba International journal of communication networks and distributed systems . 2021,第3期

机译：利用基于秩的替代技术的分层协作全球缓存算法提高分布式文件系统的性能
4. Improving cache performance by combining cost-sensitivity and locality principles in cache replacement algorithms [C] . Sheikh Rami, Kharbutli Mazen 2010 IEEE International Conference on Computer Design . 2010

机译：通过在缓存替换算法中结合成本敏感性和局部性原则来提高缓存性能
5. Novel cache replacement algorithms for high performance computer systems [D] . Khalid, Humayun 1996

机译：适用于高性能计算机系统的新型缓存替换算法
6. Federated or cached searches: Providing expected performance from multiple invasive species databases [O] . Jim Graham, Catherine S. Jarnevich, Annie Simpson, -1

机译：联合或缓存搜索：提供来自多个入侵物种数据库的预期性能
7. Improved Adaptive Replacement Algorithm for Disk Caches in HSM Systems [O] . Ulrich Hahn, Werner Dilling, Dietmar Kaletta 1999

机译：HSM系统中磁盘缓存的改进的自适应替换算法

Improving cache performance by combining cost-sensitivity and locality principles in cache replacement algorithms

摘要

著录项

相似文献

相关主题

期刊订阅