首页> 外文期刊>Computers, IEEE Transactions on >Adaptive Cache Coherence Mechanisms with Producer–Consumer Sharing Optimization for Chip Multiprocessors
【24h】

Adaptive Cache Coherence Mechanisms with Producer–Consumer Sharing Optimization for Chip Multiprocessors

机译:面向芯片多处理器的生产者-消费者共享优化的自适应缓存一致性机制

获取原文
获取原文并翻译 | 示例

摘要

In chip multiprocessors (CMPs), maintaining cache coherence can account for a major performance overhead. Write-invalidate protocols adapted by most CMPs generate high cache-to-cache misses under producer–consumer sharing patterns. Accordingly, this paper presents three cache coherence mechanisms optimized for CMPs. First, to reduce coherence misses observed in write-invalidate-based protocols, we propose a dynamic write-update mechanism augmented on top of a write-invalidate protocol. This mechanism is specifically triggered at the detection of a producer–consumer sharing pattern. Second, we extend this adaptive protocol with a bandwidth-adaptive mechanism to eliminate performance degradation from write-updates under limited bandwidth. Finally, proximity-aware mechanism is proposed to extend the base adaptive protocol with latency-based optimizations. Experimental analysis is conducted on a set of scientific applications from the SPLASH-2 and NAS parallel benchmark suites. The proposed mechanisms were shown to reduce coherence misses by up to 48% and in return speed up application performance up to 30%. Bandwidth-adaptive mechanism is proven to perform well under varying levels of available bandwidth. Results from our proposed proximity-aware extension demonstrated up to 6% performance gain over the base adaptive protocol for 64-core tiled CMP runs. In addition, the analytical model provided good estimates for performance gains from our adaptive protocols.
机译:在芯片多处理器(CMP)中,保持高速缓存一致性可以解决主要的性能开销。在生产者-消费者共享模式下,大多数CMP所采用的写无效协议会导致高速缓存到高速缓存未命中。因此,本文提出了针对CMP优化的三种缓存一致性机制。首先,为了减少在基于写无效协议中观察到的一致性缺失,我们提出了一种在写无效协议之上增强的动态写更新机制。该机制是在检测到生产者-消费者共享模式时特别触发的。第二,我们使用带宽自适应机制扩展该自适应协议,以消除有限带宽下写入更新的性能下降。最后,提出了一种邻近感知机制,通过基于等待时间的优化来扩展基本自适应协议。对SPLASH-2和NAS并行基准测试套件中的一组科学应用程序进行了实验分析。所提出的机制已显示出最多可将相干缺失降低48%,并且将应用程序性能提高高达30%。事实证明,带宽自适应机制可以在不同级别的可用带宽下表现良好。我们建议的接近感知扩展的结果表明,与针对64核平铺CMP运行的基本自适应协议相比,性能提高了6%。此外,分析模型为我们的自适应协议提供了良好的性能提升估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号