首页> 外文期刊>Computers & Digital Techniques, IET >Tree-based scheme for reducing shared cache miss rate leveraging regional, statistical and temporal similarities
【24h】

Tree-based scheme for reducing shared cache miss rate leveraging regional, statistical and temporal similarities

机译:基于树的方案可利用区域,统计和时间相似性来降低共享缓存未命中率

获取原文
获取原文并翻译 | 示例
       

摘要

Cache miss can have a major impact on overall performance of many-core systems. A miss may result in extra traffic and delay because of coherency messages. This has been reduced in coarse-grain coherency protocols where only shared misses require a coherency message. Conventional off-chip methods manage the shared miss rate by relying on reuse histories. However the pertinent memory overhead that comes with reuse histories makes them impractical for on-chip multi-processor systems. In this study, a new scheme has been proposed to reduce shared cache miss rate in multi-processor system-on-chips that benefits from novel prefetching techniques to L2 caches from off-chip memories or other remote L2 caches located on-chip. In the proposed scheme, the previously proposed Virtual Tree Coherence (VTC) method has been extended to limit block forwarding messages to true sharers within each region. Instead of relying on exact reuse histories, shared regions are searched for regional, temporal and statistical similarities. These similarities are exploited for determining the sharers that should receive the forwarded blocks. The proposed method has been evaluated with Splash-2 workloads. Simulation results indicate that the proposed method has reduced shared miss count by up to 75%, and improved interconnect traffic by up to 47% compared with VTC.
机译:高速缓存未命中可能会对多核系统的整体性能产生重大影响。由于一致性消息,未命中可能会导致额外的流量和延迟。在仅共享未命中需要一致性消息的粗粒度一致性协议中,这一点已得到减少。常规的片外方法依靠重用历史记录来管理共享丢失率。但是,重用历史记录附带的相关内存开销使它们对于片上多处理器系统不切实际。在这项研究中,已经提出了一种新的方案来减少多处理器片上系统中的共享高速缓存未命中率,该方案得益于新颖的预取技术,可以从片外存储器或其他位于芯片上的远程L2高速缓存中获取L2高速缓存。在提出的方案中,先前提出的虚拟树一致性(VTC)方法已扩展为将块转发消息限制为每个区域内的真实共享者。不再依赖确切的重用历史记录,而是在共享区域中搜索区域,时间和统计上的相似性。利用这些相似性来确定应接收转发块的共享者。提议的方法已在Splash-2工作负载下进行了评估。仿真结果表明,与VTC相比,该方法最多可将共享未命中次数减少75%,并将互连流量提高多达47%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号