Adapting cache partitioning algorithms to pseudo-LRU replacement policies

机译：使高速缓存分区算法适应伪LRU替换策略

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recent studies have shown that cache partitioning is an efficient technique to improve throughput, fairness and Quality of Service (QoS) in CMP processors. The cache partitioning algorithms proposed so far assume Least Recently Used (LRU) as the underlying replacement policy. However, it has been shown that the true LRU imposes extraordinary complexity and area overheads when implemented on high associativity caches, such as last level caches. As a consequence, current processors available on the market use pseudo-LRU replacement policies, which provide similar behavior as LRU, while reducing the hardware complexity. Thus, the presented so far LRU-based cache partitioning solutions cannot be applied to real CMP architectures. This paper proposes a complete partitioning system for caches using the pseudo-LRU replacement policy. In particular, the paper focuses on the pseudo-LRU implementations proposed by Sun Microsystems and IBM, called Not Recently Used (NRU) and Binary Tree (BT), respectively. We propose a high accuracy profiling logic and a cache partitioning hardware for both schemes. We evaluate our proposals' hardware costs in terms of area and power, and compare them against the LRU partitioning algorithm. Overall, this paper presents two hardware techniques to adapt the existing cache partitioning algorithms to real replacement policies. The results show that our solutions impose negligible performance degradation with respect to the LRU.

机译：最近的研究表明，缓存分区是一种提高CMP处理器的吞吐量，公平性和服务质量（QoS）的有效技术。到目前为止，提出的缓存分区算法假定最近最少使用（LRU）作为基础替换策略。但是，已经表明，当在高关联性缓存（例如末级缓存）上实现时，真正的LRU会带来非凡的复杂性和区域开销。结果，市场上可用的当前处理器使用伪LRU替换策略，该策略提供与LRU类似的行为，同时降低了硬件复杂性。因此，目前为止提出的基于LRU的缓存分区解决方案不能应用于实际的CMP体系结构。本文提出了一种使用伪LRU替换策略的完整的缓存分区系统。特别是，本文重点介绍了Sun Microsystems和IBM提出的伪LRU实现，分别称为“最近不使用（NRU）”和“二进制树（BT）”。对于这两种方案，我们提出了一种高精度的分析逻辑和一个缓存分区硬件。我们根据面积和功耗评估提案的硬件成本，并将其与LRU分区算法进行比较。总体而言，本文提出了两种硬件技术，以使现有的缓存分区算法适应实际的替换策略。结果表明，相对于LRU，我们的解决方案对性能的影响可忽略不计。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|p.1-12|共12页
会议地点 Atlanta GA(US)
作者
Kedzierski K.; Moreto M.; Cazorla F.J.; Valero M.;
展开▼
作者单位

Barcelona Supercomput. Center (BSC), Tech. Univ. of Catalonia (UPC), Barcelona, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
CMP; Pseudo-LRU; Shared last level cache;

机译：CMP;伪LRU;共享的最后一级缓存;

相似文献

外文文献
中文文献
专利

1. Improving Cache Partitioning Algorithms for Pseudo-LRU Policies [J] . Xi ZHANG, Chuanyi LIU, Zhenyu LIU, IEICE transactions on information and systems . 2013,第12期

机译：改进用于伪LRU策略的缓存分区算法
2. Improving Cache Partitioning Algorithms for Pseudo-LRU Policies [J] . Xi ZHANG, Chuanyi LIU, Zhenyu LIU, IEICE Transactions on Information and Systems . 2013,第12期

机译：改进用于伪LRU策略的缓存分区算法
3. ENHANCEMENT OF CLOUD PERFORMANCE AND STORAGE CONSUMPTION USING ADAPTIVE REPLACEMENT CACHE AND PROBABILISTIC CONTENT PLACEMENT ALGORITHMS [J] . AHMED SALIH MAHDI, RRAVIE CHANDREN MUNIYANDIH Journal of Theoretical and Applied Information Technology . 2016,第3期

机译：使用自适应替换缓存和概率内容定位算法增强云性能和存储消耗
4. Adapting cache partitioning algorithms to pseudo-LRU replacement policies [C] . Kedzierski Kamil, Moreto Miquel, Cazorla Francisco J., 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：使高速缓存分区算法适应伪LRU替换策略
5. Set-associative history-aided adaptive replacement for on-chip caches. [D] . Simons, Brad. 2016

机译：集关联历史辅助片上缓存的自适应替换。
6. A surrogate-primary replacement algorithm for response-adaptive randomization in stroke clinical trials [O] . Amy S Nowacki, Wenle Zhao, Yuko Y Palesch -1

机译：脑卒中临床试验中适应性随机反应的替代主要替代算法
7. Adapting cache partitioning algorithms to pseudo-LRU replacement policies [O] . Kedzierski, Kamil, Moreto Planas, Miquel, Cazorla, Francisco, 2010

机译：使缓存分区算法适应伪LRU替换策略

Adapting cache partitioning algorithms to pseudo-LRU replacement policies

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅