首页> 外文期刊>Journal of supercomputing >Reducing the second-level cache conflict misses using a set folding technique
【24h】

Reducing the second-level cache conflict misses using a set folding technique

机译:使用集合折叠技术减少第二级缓存冲突未命中

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The cache memory has a direct effect on the performance of a computer system. Instructions and data are fetched from a fast cache instead of a slow memory to save hundreds of cycles. Reducing the cache miss ratio will definitely improve the execution time of an application. In this work, we propose cache memory designs that reduce the number of conflict misses significantly. The proposed designs reduce the conflict misses in the last level multi-way set associative cache. Each set is divided into a group of subsets: the first is referred to as the exclusive subset, and the rest are the shared subsets. The exclusive is configured as a traditional cache where each block is mapped to the set whose index matches the block index. In addition to their standard cache indexing role, the shared subsets are configured to host blocks with different indices. A memory block can be mapped to one subset from the exclusive type or one of multiple subsets from the shared type. Since the proposed technique is based on combining multiple sets of the shared part to form a larger set, that is shared between memory blocks with different indices, we have chosen the name "set folding." The decision as to where to map a memory block depends on the number of misses encountered at each of the potential target sets. To evaluate the proposed design based on the overall hit rate, twenty-three benchmarks from SPEC CPU 2006 were simulated using the SuperESCalar simulator. The proposed designs require a few extra storage bits which adds a small overhead on the hardware complexity in comparison with the conventional cache. However, the proposed designs achieve lower miss rates for most of the benchmarks.
机译:高速缓存存储器直接影响计算机系统的性能。指令和数据是从快速缓存而不是慢速存储器中获取的,以节省数百个周期。降低缓存未命中率肯定会改善应用程序的执行时间。在这项工作中,我们提出了可以显着减少冲突遗漏数量的高速缓存存储器设计。提出的设计减少了最后一级多路集关联缓存中的冲突遗漏。每个集合分为一组子集:第一个子集称为互斥子集,其余子集为共享子集。独占配置为传统缓存,其中每个块都映射到索引与块索引匹配的集合。除了它们的标准高速缓存索引功能外,共享子集还配置为托管具有不同索引的块。可以将存储块映射到互斥类型的一个子集或共享类型的多个子集之一。由于所提出的技术是基于将共享部分的多组组合以形成更大的组(该组在具有不同索引的存储块之间共享),因此我们选择了“集合折叠”这个名称。关于在何处映射存储块的决定取决于在每个潜在目标集中遇到的未命中次数。为了基于总体命中率评估建议的设计,使用SuperESCalar模拟器对SPEC CPU 2006的23个基准进行了仿真。所提出的设计需要一些额外的存储位,与传统的高速缓存相比,这增加了硬件复杂性的小开销。但是,对于大多数基准测试,建议的设计实现了较低的未命中率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号