首页> 外文会议>IEEE International Symposium on High Performance Computer Architecture >Design and Implementation of the Blue Gene/P Snoop Filter
【24h】

Design and Implementation of the Blue Gene/P Snoop Filter

机译:蓝色基因/ P Snoop滤波器的设计与实现

获取原文

摘要

As multi-core processors evolve, coherence traffic between cores is becoming problematic, both in terms of performance and power. The negative effects of coherence (snoop) traffic can be significantly mitigated through snoop filtering. Shielding each cache with a device that can squash snoop requests for addresses known not to be in cache improves performance significantly for caches that cannot perform normal load and snoop lookups simultaneously. In addition, reducing snoop lookups yields power savings. This paper describes the design of the Blue Gene/P snoop filters, and presents hardware measurements to demonstrate their effectiveness. The Blue Gene/P snoop filters combine stream registers and snoop caches to capture both the locality of snoop addresses and their streaming behavior. Simulations of SPLASH-2 benchmarks illustrate tradeoffs and strengths of these two techniques. Their combination is shown to be most effective, eliminating 94-99% of all snoop requests using very few stream registers and snoop cache lines. This translates into an average performance improvement of almost 20% for the NAS benchmarks running on an actual Blue Gene/P system.
机译:随着多核处理器的发展,核心之间的相干流量在性能和功率方面都变得有问题。通过窥探滤波可以显着减轻一致性(窥探)流量的负面影响。使用可以使用可以同时执行正常负载和SNOOP查找的缓存来屏蔽任何可在高速缓存中屏蔽不在缓存的地址的侦听请求的设备对每个缓存提高性能。此外,减少Snoop查找会产生电量。本文介绍了蓝色基因/ P窥探滤波器的设计,并呈现硬件测量以证明其有效性。 Blue Gene / P Snoop滤波器组合流寄存器和Snoop缓存以捕获Snoop地址的局部性及其流式行为。 Splash-2基准的模拟说明了这两种技术的权衡和优势。它们的组合显示最有效,消除了使用很少的流寄存器和Snoop高速缓存行的所有Snoop请求中的94-99%。这转化为在实际蓝色基因/ P系统上运行的NAS基准的平均性能提高近20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号