首页> 外文会议>Annual IEEE/ACM International Symposium on Microarchitecture >Decoupled compressed cache: Exploiting spatial locality for energy-optimized compressed caching
【24h】

Decoupled compressed cache: Exploiting spatial locality for energy-optimized compressed caching

机译:解耦的压缩缓存:利用空间局部性进行能源优化的压缩缓存

获取原文

摘要

In multicore processor systems, last-level caches (LLCs) play a crucial role in reducing system energy by (i) filtering out expensive accesses to main memory and (ii) reducing the time spent executing in high-power states. Cache compression can increase effective cache capacity and reduce misses, improve performance, and potentially reduce system energy. However, previous compressed cache designs have demonstrated only limited benefits due to internal fragmentation and limited tags. In this paper, we propose the Decoupled Compressed Cache (DCC), which exploits spatial locality to improve both the performance and energy-efficiency of cache compression. DCC uses decoupled super-blocks and non-contiguous sub-block allocation to decrease tag overhead without increasing internal fragmentation. Non-contiguous sub-blocks also eliminate the need for energy-expensive re-compaction when a block's size changes. Compared to earlier compressed caches, DCC increases normalized effective capacity to a maximum of 4 and an average of 2.2 for a wide range of workloads. A further optimized Co-DCC (Co-Compacted DCC) design improves the average normalized effective capacity to 2.6 by co-compacting the compressed blocks in a super-block. Our simulations show that DCC nearly doubles the benefits of previous compressed caches with similar area overhead. We also demonstrate a practical DCC design based on a recent commercial LLC design.
机译:在多核处理器系统中,最后一级缓存(LLC)通过(i)过滤掉对主内存的昂贵访问,以及(ii)减少在高功率状态下执行所花费的时间,在降低系统能耗方面发挥着至关重要的作用。高速缓存压缩可以增加有效的高速缓存容量并减少未命中率,提高性能,并可能减少系统能耗。但是,由于内部碎片和标签的限制,以前的压缩缓存设计仅显示出有限的好处。在本文中,我们提出了去耦压缩缓存(DCC),它利用空间局部性来提高缓存压缩的性能和能效。 DCC使用解耦的超级块和不连续的子块分配来减少标签开销,而不会增加内部碎片。当块的大小改变时,不连续的子块也消除了对能量消耗昂贵的重新压缩的需求。与早期的压缩缓存相比,DCC可以将各种工作负载的标准化有效容量最大增加到4,平均增加到2.2。进一步优化的Co-DCC(Co-Compacted DCC)设计通过在超级块中共同压缩压缩块,将平均归一化有效容量提高到2.6。我们的仿真表明,DCC的效率几乎是以前的压缩缓存的两倍,而面积压缩却类似。我们还演示了基于最近的商业LLC设计的实用DCC设计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号