首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >Adapting cache partitioning algorithms to pseudo-LRU replacement policies
【24h】

Adapting cache partitioning algorithms to pseudo-LRU replacement policies

机译:使高速缓存分区算法适应伪LRU替换策略

获取原文
获取原文并翻译 | 示例

摘要

Recent studies have shown that cache partitioning is an efficient technique to improve throughput, fairness and Quality of Service (QoS) in CMP processors. The cache partitioning algorithms proposed so far assume Least Recently Used (LRU) as the underlying replacement policy. However, it has been shown that the true LRU imposes extraordinary complexity and area overheads when implemented on high associativity caches, such as last level caches. As a consequence, current processors available on the market use pseudo-LRU replacement policies, which provide similar behavior as LRU, while reducing the hardware complexity. Thus, the presented so far LRU-based cache partitioning solutions cannot be applied to real CMP architectures. This paper proposes a complete partitioning system for caches using the pseudo-LRU replacement policy. In particular, the paper focuses on the pseudo-LRU implementations proposed by Sun Microsystems and IBM, called Not Recently Used (NRU) and Binary Tree (BT), respectively. We propose a high accuracy profiling logic and a cache partitioning hardware for both schemes. We evaluate our proposals' hardware costs in terms of area and power, and compare them against the LRU partitioning algorithm. Overall, this paper presents two hardware techniques to adapt the existing cache partitioning algorithms to real replacement policies. The results show that our solutions impose negligible performance degradation with respect to the LRU.
机译:最近的研究表明,缓存分区是一种提高CMP处理器的吞吐量,公平性和服务质量(QoS)的有效技术。到目前为止,提出的缓存分区算法假定最近最少使用(LRU)作为基础替换策略。但是,已经表明,当在高关联性缓存(例如末级缓存)上实现时,真正的LRU会带来非凡的复杂性和区域开销。结果,市场上可用的当前处理器使用伪LRU替换策略,该策略提供与LRU类似的行为,同时降低了硬件复杂性。因此,目前为止提出的基于LRU的缓存分区解决方案不能应用于实际的CMP体系结构。本文提出了一种使用伪LRU替换策略的完整的缓存分区系统。特别是,本文重点介绍了Sun Microsystems和IBM提出的伪LRU实现,分别称为“最近不使用(NRU)”和“二进制树(BT)”。对于这两种方案,我们提出了一种高精度的分析逻辑和一个缓存分区硬件。我们根据面积和功耗评估提案的硬件成本,并将其与LRU分区算法进行比较。总体而言,本文提出了两种硬件技术,以使现有的缓存分区算法适应实际的替换策略。结果表明,相对于LRU,我们的解决方案对性能的影响可忽略不计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号