首页> 外文会议>IEEE International Symposium on Parallel Distributed Processing >Adapting cache partitioning algorithms to pseudo-LRU replacement policies
【24h】

Adapting cache partitioning algorithms to pseudo-LRU replacement policies

机译:将缓存分区算法调整为伪LRU替换策略

获取原文

摘要

Recent studies have shown that cache partitioning is an efficient technique to improve throughput, fairness and Quality of Service (QoS) in CMP processors. The cache partitioning algorithms proposed so far assume Least Recently Used (LRU) as the underlying replacement policy. However, it has been shown that the true LRU imposes extraordinary complexity and area overheads when implemented on high associativity caches, such as last level caches. As a consequence, current processors available on the market use pseudo-LRU replacement policies, which provide similar behavior as LRU, while reducing the hardware complexity. Thus, the presented so far LRU-based cache partitioning solutions cannot be applied to real CMP architectures. This paper proposes a complete partitioning system for caches using the pseudo-LRU replacement policy. In particular, the paper focuses on the pseudo-LRU implementations proposed by Sun Microsystems and IBM, called Not Recently Used (NRU) and Binary Tree (BT), respectively. We propose a high accuracy profiling logic and a cache partitioning hardware for both schemes. We evaluate our proposals' hardware costs in terms of area and power, and compare them against the LRU partitioning algorithm. Overall, this paper presents two hardware techniques to adapt the existing cache partitioning algorithms to real replacement policies. The results show that our solutions impose negligible performance degradation with respect to the LRU.
机译:最近的研究表明,缓存分区是一种有效的技术,可以提高CMP处理器中的吞吐量,公平和服务质量(QoS)。到目前为止,所提出的高速缓存分区算法是占据最近使用的(LRU)作为底层替换策略。然而,已经表明,当在高分子缓存中实现时,真正的LRU在高级别高速缓存时实现了非凡的复杂性和面积开销。因此,市场上可用的当前处理器使用伪LRU替换政策,它提供类似的行为作为LRU,同时降低了硬件复杂性。因此,所呈现的基于LRU的高速缓存分区解决方案不能应用于真实的CMP架构。本文提出了使用伪LRU替换策略的缓存完整的分区系统。特别是,本文侧重于Sun Microsystems和IBM提出的伪LRU实现,即不再使用(NRU)和二叉树(BT)。我们为这两种方案提出了高精度的分析逻辑和高速缓存分区硬件。我们在区域和电源方面评估我们的建议硬件成本,并将它们与LRU分区算法进行比较。总的来说,本文介绍了两个硬件技术,可使现有的高速缓存分区算法适应真实替换策略。结果表明,我们的解决方案对LRU的性能下降略有忽略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号