首页> 外文期刊>Computer architecture news >Die-Stacked DRAM Caches for Servers Hit Ratio, Latency, or Bandwidth? Have It All with Footprint Cache
【24h】

Die-Stacked DRAM Caches for Servers Hit Ratio, Latency, or Bandwidth? Have It All with Footprint Cache

机译:芯片堆叠式DRAM缓存是针对服务器的命中率,延迟还是带宽?拥有足迹缓存

获取原文
获取原文并翻译 | 示例

摘要

Recent research advocates using large die-stacked DRAM caches to break the memory bandwidth wall. Existing DRAM cache designs fall into one of two categories — block-based and page-based. The former organize data in conventional blocks (e.g., 64B), ensuring low off-chip bandwidth utilization, but co-locate tags and data in the stacked DRAM, incurring high lookup latency. Furthermore, such designs suffer from low hit ratios due to poor temporal locality. In contrast, page-based caches, which manage data at larger granularity (e.g., 4KB pages), allow for reduced tag array overhead and fast lookup, and leverage high spatial locality at the cost of moving large amounts of data on and off the chip. This paper introduces Footprint Cache, an efficient die-stacked DRAM cache design for server processors. Footprint Cache allocates data at the granularity of pages, but identifies and fetches only those blocks within a page that will be touched during the page's residency in the cache — i.e., the page's footprint. In doing so, Footprint Cache eliminates the excessive off-chip traffic associated with page-based designs, while preserving their high hit ratio, small tag array overhead, and low lookup latency. Cycle-accurate simulation results of a 16-core server with up to 512MB Footprint Cache indicate a 57% performance improvement over a baseline chip without a die-stacked cache. Compared to a state-of-the-art block-based design, our design improves performance by 13% while reducing dynamic energy of stacked DRAM by 24%.
机译:最近的研究提倡者使用大型裸片堆叠DRAM缓存来打破内存带宽壁垒。现有的DRAM缓存设计属于两类之一-基于块和基于页面。前者将数据组织在常规块(例如64B)中,确保了较低的片外带宽利用率,但将标签和数据共置在堆叠的DRAM中,从而导致高查找延迟。此外,由于不良的时间局部性,这样的设计遭受低的命中率。相反,基于页面的高速缓存可以以较大的粒度(例如4KB页面)管理数据,从而可以减少标签阵列的开销和快速查找,并以将大量数据移入和移出芯片为代价来利用较高的空间局部性。 。本文介绍了Footprint Cache,这是一种用于服务器处理器的高效管芯堆叠DRAM缓存设计。足迹缓存按页面的粒度分配数据,但仅识别和获取页面中在页面驻留在缓存中时将被触摸的那些块,即页面的足迹。这样,Footprint Cache消除了与基于页面的设计相关的过多片外流量,同时保留了它们的高命中率,小标签阵列开销和低查找延迟。具有多达512MB足迹缓存的16核服务器的精确周期仿真结果表明,与没有裸片堆栈缓存的基准芯片相比,性能提高了57%。与最新的基于块的设计相比,我们的设计将性能提高了13%,同时将堆叠DRAM的动态能量降低了24%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号