首页> 外文期刊>Multi-Scale Computing Systems, IEEE Transactions on >FusedCache: A Naturally Inclusive, Racetrack Memory, Dual-Level Private Cache
【24h】

FusedCache: A Naturally Inclusive, Racetrack Memory, Dual-Level Private Cache

机译:FusedCache:自然包含的赛道内存,双层私有缓存

获取原文
获取原文并翻译 | 示例

摘要

We propose FusedCache, a two-level set-associative Racetrack memory (RM) cache design that utilizes RM's high density for providing fast uniform access at one level, and non-uniform access at the next. FusedCache is well suited for private L1/L2 caches enforcing alignment of L1 data with the RM access points with the remaining non-aligned data serving as L2. It uses traditional LRU eviction for L1 misses. Promotion and demotion between L1 and L2 are performed through shifts and, when necessary, background swap operations. These swap operations do not require physical stores or loads, making accesses both faster and more energy efficient. Further, unlike a traditional inclusive cache hierarchy, fused L1 cache lines naturally exist in L2 avoiding duplicated storage and tag structures, promotions, and evictions. L1 status on each track is strictly enforced by track LRU maintenance and background swapping. Our results demonstrate that compared to an iso-area L1 SRAM cache replacement, FusedCache improves application performance by 7 percent while reducing cache energy by 33 percent. Compared to an iso-capacity two level (L1/L2) SRAM cache replacement, FusedCache provides similar performance with a dramatic 69 percent cache energy reduction. Compared to a TapeCache L1 scheme, FusedCache gains a 7 percent performance improvement with a 6 percent cache energy saving which translates to a 13 percent improvement in energy-delay product.
机译:我们建议使用FusedCache,这是一种两级的集关联赛马场内存(RM)缓存设计,该设计利用RM的高密度在一个级别提供快速的统一访问,而在下一个级别提供非统一的访问。 FusedCache非常适合于私有L1 / L2高速缓存,该高速缓存要求将L1数据与RM访问点对齐,并将其余未对齐的数据用作L2。它对L1未命中使用传统的LRU逐出。 L1和L2之间的提升和降级是通过轮班以及必要时进行后台交换操作进行的。这些交换操作不需要物理存储或负载,从而使访问更快,更节能。此外,与传统的包含性缓存层次结构不同,融合的L1缓存行自然存在于L2中,避免了重复的存储和标签结构,升级和逐出。每个磁道的L1状态均由磁道LRU维护和后台交换严格执行。我们的结果表明,与等面积L1 SRAM缓存替换相比,FusedCache将应用程序性能提高了7%,同时将缓存能量降低了33%。与等容量的两级(L1 / L2)SRAM缓存替换相比,FusedCache提供了相似的性能,并显着降低了69%的缓存能量。与TapeCache L1方案相比,FusedCache的性能提高了7%,高速缓存的能耗降低了6%,这意味着能源延迟产品提高了13%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号