首页> 外文期刊>IEEE Transactions on Computers >A quantitative analysis of the performance and scalability of distributed shared memory cache coherence protocols
【24h】

A quantitative analysis of the performance and scalability of distributed shared memory cache coherence protocols

机译:分布式共享内存缓存一致性协议的性能和可伸缩性的定量分析

获取原文
获取原文并翻译 | 示例

摘要

Scalable cache coherence protocols have become the key technology for creating moderate to large-scale shared-memory multiprocessors. Although the performance of such multiprocessors depends critically on the performance of the cache coherence protocol, little comparative performance data is available. Existing commercial implementations use a variety of different protocols, including bit-vector/coarse-vector protocols, SCI-based protocols, and COMA protocols. Using the programmable protocol processor of the Stanford FLASH multiprocessor, we provide a detailed, implementation-oriented evaluation of four popular cache coherence protocols. In addition to measurements of the characteristics of protocol execution (e.g., memory overhead, protocol execution time, and message count) and of overall performance, we examine the effects of scaling the processor count from 1 to 128 processors. Surprisingly, the optimal protocol changes for different applications and can change with processor count even within the same application. These results help identify the strengths of specific protocols and illustrate the benefits of providing flexibility in the choice of cache coherence protocol.
机译:可扩展的高速缓存一致性协议已成为创建中型到大型共享内存多处理器的关键技术。尽管此类多处理器的性能主要取决于高速缓存一致性协议的性能,但几乎没有可用的比较性能数据。现有的商业实现使用各种不同的协议,包括位向量/粗向量协议,基于SCI的协议和COMA协议。使用Stanford FLASH多处理器的可编程协议处理器,我们对四种流行的缓存一致性协议进行了详细的,面向实现的评估。除了测量协议执行的特征(例如内存开销,协议执行时间和消息计数)以及整体性能外,我们还检查了将处理器数量从1个扩展到128个处理器的影响。令人惊讶的是,最佳协议针对不同的应用程序而变化,并且即使在同一应用程序内,也可能随处理器数量的变化而变化。这些结果有助于确定特定协议的优势,并说明在选择高速缓存一致性协议时提供灵活性的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号