Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative caching

机译：利用容量需求的集合级不均匀性来增强CMP协作缓存

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the Memory Wall remains a bottleneck for Chip Multiprocessors (CMP), the effective management of CMP last level caches becomes of paramount importance in minimizing expensive off-chip memory accesses. For the CMPs with private last level caches, Cooperative Caching (CC) has been proposed to enable capacity sharing among private caches by spilling an evicted block from one cache to another. But this eviction-driven CC does not necessarily promote the cache performance since it implicitly favors the applications full of block evictions regardless of their real capacity demand. The recent Dynamic Spill-Receive (DSR) paradigm improves CC by prioritizing applications with higher benefit from extra capacity in spilling blocks. However, the DSR paradigm only exploits the coarse-grained application-level difference in capacity demand, making it less effective as the non-uniformity exists at a much finer level. This paper (i) highlights the observation of cache set-level non-uniformity of capacity demand, and (ii) presents a novel L2 cache design, named SNUG (Set-level Non-Uniformity identifier and Grouper), to exploit the fine-grained non-uniformity to further enhance the effectiveness of cooperative caching. By utilizing a per-set shadow tag array and saturating counter, SNUG can identify whether a set should either spill or receive blocks; by using an index-bit flipping scheme, SNUG can group peer sets for spilling and receiving in an flexible way, capturing more opportunities for cooperative caching. We evaluate our design through extensive execution-driven simulations on Quad-core CMP systems. Our results show that for 6 classes of workload combinations our SNUG cache can improve the CMP throughput by up to 22.3%, with an average of 13.9% over the baseline configuration, while the state-of-the-art DSR scheme can only achieve an improvement by up to 14.5% and 8.4% on average.

机译：由于内存墙仍然是芯片多处理器（CMP）的瓶颈，因此对CMP末级高速缓存的有效管理对于最大程度地减少昂贵的片外内存访问至关重要。对于具有私有最后一级缓存的CMP，已提出了协作缓存（CC），以通过将逐出的块从一个缓存溢出到另一个缓存来实现私有缓存之间的容量共享。但是，这种驱逐驱动的CC不一定会提高缓存性能，因为它隐含地支持充满块驱逐的应用程序，而不管其实际容量需求如何。最新的动态溢出接收（DSR）范式通过优先处理应用程序而提高了CC，这是由于溢出块的额外容量所带来的好处。但是，DSR范式仅利用容量需求上的粗粒度应用程序级别差异，由于非均匀性存在于更精细的级别上，因此使其效率较低。本文（i）着重介绍了对缓存集级别的容量需求不一致性的观察，以及（ii）提出了一种名为SNUG（集级别的非一致性标识符和Grouper）的新型L2缓存设计，以利用精细的粒度不均匀，进一步增强了协作缓存的有效性。通过利用每个影子标签阵列和饱和计数器，SNUG可以识别出一个集是否应该溢出或接收块。通过使用索引位翻转方案，SNUG可以将对等集进行分组以灵活地进行溢出和接收，从而捕获了更多协作缓存的机会。我们通过在四核CMP系统上进行广泛的，执行驱动的仿真来评估我们的设计。我们的结果表明，对于6类工作负载组合，我们的SNUG缓存可以将CMP吞吐量提高22.3％，平均比基准配置高13.9％，而最新的DSR方案只能实现平均改善幅度高达14.5％和8.4％。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|P.1-10|共10页
会议地点 Atlanta GA(US);Atlanta GA(US)
作者
Zhan Dongyuan; Jiang Hong; Seth Sharad C.;
展开▼
作者单位

Department of Computer Science Engineering, University of Nebraska - Lincoln, Lincoln, NE 68588;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
Chip Multiprocessors; Cooperative Caching; Last Level Cache Management; Set-Level Non-Uniformity of Capacity Demand;

机译：芯片多处理器;协作缓存;最后一级缓存管理;容量需求的集级不均匀性;

相似文献

外文文献
中文文献
专利

1. Exploiting Replicated Cache Blocks to Reduce L2 Cache Leakage in CMPs [J] . Kim, H., Ahn, IEEE transactions on very large scale integration (VLSI) systems . 2013,第10期

机译：利用复制的缓存块来减少CMP中的L2缓存泄漏
2. Wuhan Optoelectronics Forum 74: Architecting STT-RAM caches for enhanced performance in CMPs [J] . Chita R. Das Frontiers of optoelectronics in China . 2014,第1期

机译：武汉光电论坛74：构建STT-RAM缓存以增强CMP的性能
3. Wuhan Optoelectronics Forum 74： Architecting STT-RAM caches for enhanced performance in CMPs [J] . Chita R. Das 中国光电子学前沿：英文版 . 2014,第001期

机译：武汉光电论坛74：构建STT-RAM缓存以增强CMP的性能
4. Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative caching [C] . Dongyuan Zhan, Hong Jiang, Seth S.C. 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：利用容量需求的集合级不均匀性来增强CMP协作缓存
5. Exploiting properties of CMP cache traffic in designing hybrid packet/circuit switched NoCs. [D] . Abousamra, Ahmed. 2013

机译：在设计混合数据包/电路交换NoC时利用CMP缓存流量的属性。
6. Cooperative interactions within the family enhance the capacity for evolutionarychange in body size [O] . Benjamin JM Jarrett, Matthew Schrader, Darren Rebar, -1

机译：家庭内部的合作互动增强了进化的能力改变体型
7. Exploiting Set-Level Non-Uniformity of Capacity Demand to Enhance CMP Cooperative Caching [O] . Zhan, Dongyuan, Jiang, Hong, Seth, Sharad C. 2009

机译：利用容量需求的集级非均匀性来增强Cmp协同缓存

Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative caching

摘要

著录项

相似文献

相关主题

期刊订阅