首页> 外文会议> >The impact of shared-cache clustering in small-scale shared-memory multiprocessors
【24h】

The impact of shared-cache clustering in small-scale shared-memory multiprocessors

机译:共享缓存集群在小型共享内存多处理器中的影响

获取原文

摘要

As processor performance continues to increase, greater demands are placed on the bus and memory systems of small-scale shared-memory multiprocessors. In this paper, we investigate how to reduce these demands by organizing groups of processors into clusters which are then connected together using a shared global bus. We take advantage of the high-bandwidth, low-latency interconnections available from multichip module (MCM) technology, to build clusters with multiple high-performance processors sharing an L2 cache. The use of MCM technology allows for significantly lower shared-cache access times, and higher shared cache to processor bandwidth, than is possible using printed circuit board (PCB) designs. Our results show that for an eight processor bus-based system, bus contention can be a large portion of the overall execution time, and that clustering can eliminate much or all of it. Clustering also tends to reduce read stall times due to shared working set effects and a reduction in the effect of communication misses. The same is true for two and four processor systems, although to a lesser extent. Overall, we find that clustering can result in significant performance gains for applications which heavily utilize the memory system.
机译:随着处理器性能的不断提高,对小型共享内存多处理器的总线和内存系统提出了更高的要求。在本文中,我们研究如何通过将处理器组组织成集群,然后使用共享全局总线将它们连接在一起来减少这些需求。我们利用多芯片模块(MCM)技术提供的高带宽,低延迟互连功能,利用共享L2高速缓存的多个高性能处理器来构建集群。与使用印刷电路板(PCB)设计相比,使用MCM技术可显着缩短共享缓存访问时间,并提高对处理器带宽的共享缓存。我们的结果表明,对于基于八处理器总线的系统,总线争用可能是整个执行时间的很大一部分,而集群可以消除其中的大部分或全部。由于共享的工作集效应以及通信遗漏效应的降低,集群化还倾向于减少读取停顿时间。尽管程度较小,但对于两个和四个处理器系统也是如此。总体而言,我们发现对于大量使用内存系统的应用程序,群集可以显着提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号