首页> 外文会议>ACM/IEEE conference on Supercomputing >Multi-protocol active messages on a cluster of SMP's
【24h】

Multi-protocol active messages on a cluster of SMP's

机译:SMP群集上的多协议活动消息

获取原文

摘要

Clusters of multiprocessors, or Clumps, promise to be the supercomputers of the future, but obtaining high performance on these architectures requires an understanding of interactions between the multiple levels of interconnection. In this paper, we present the first multi-protocol implementation of a lightweight message layer---a version of Active Messages-II running on a cluster of Sun Enterprise 5000 servers connected with Myrinet. This research brings together several pieces of high-performance interconnection technology: bus backplanes for symmetric multiprocessors, low-latency networks for connections between machines, and simple, user-level primitives for communication. The paper describes the shared memory message-passing protocol and analyzes the multi-protocol implementation with both microbenchmarks and Split-C applications. Three aspects of the communication layer are critical to performance: the overhead of cache-coherence mechanisms, the method of managing concurrent access, and thecost of accessing state with the slower protocol. Through the use of an adaptive polling strategy, the multi-protocol implementation limits performance interactions between the protocols, delivering up to 160 MB/s of bandwidth with 3.6 microsecond end-to-end latency. Applications within an SMP benefit from this fast communication, running up to 75% faster than on a network of uniprocessor workstations. Applications running on the entire Clump are limited by the balance of NIC's to processors in our system, and are typically slower than on the NOW. These results illustrate several potential pitfalls for the Clumps architecture.
机译:多处理器集群或Clumps有望成为未来的超级计算机,但是要在这些架构上获得高性能,则需要了解互连的多个层次之间的相互作用。在本文中,我们介绍了轻量级消息层的第一个多协议实现-一种在与Myrinet连接的Sun Enterprise 5000服务器群集上运行的Active Messages-II版本。这项研究汇集了几项高性能互连技术:用于对称多处理器的总线背板,用于机器之间连接的低延迟网络以及用于通信的简单,用户级原语。本文介绍了共享内存消息传递协议,并分析了具有微基准测试和Split-C应用程序的多协议实现。通信层的三个方面对性能至关重要:高速缓存一致性机制的开销,管理并发访问的方法以及使用较慢协议访问状态的成本。通过使用自适应轮询策略,多协议实现限制了协议之间的性能交互,以3.6微秒的端到端延迟提供了高达160 MB / s的带宽。 SMP中的应用程序受益于这种快速通信,与单处理器工作站网络上的运行速度相比,运行速度提高了75%。在整个Clump上运行的应用程序受到NIC与系统中处理器之间的平衡的限制,并且通常比NOW慢。这些结果说明了Clumps体系结构的几个潜在陷阱。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号