首页> 外文会议> >Optimizing user-level communication patterns on the Fujitsu AP3000
【24h】

Optimizing user-level communication patterns on the Fujitsu AP3000

机译:优化富士通AP3000上的用户级通信模式

获取原文

摘要

We present techniques and algorithms to improve the performance of various communication patterns on message passing platforms where, for reasons of safety, user level communications must be buffered in (special) memory on both the send and the receive. These algorithms can not only minimize message copying but overlap the copying to/from the special memory with the actual transfer enabling full bandwidth to be achieved. These patterns include tree broadcast and reductions, (ring based) multiple broadcasts and reductions, pipelined broadcast and buffered point-to-point sends. In each case, the messages have a simple stride. All of these patterns are used in dense linear algebra applications, although they are also used it many other contexts. These algorithms are implemented and their performance evaluated on the Fujitsu AP3000, a message passing multicomputer having many characteristics of the cluster model. Some aspects, such as the performance characteristics of the special memory are specific to the AP3000; however the algorithms still apply to any platform using a similar mode of user level communications. Worthwhile performance increases are obtained, especially for patterns involving moderate-large number of processors.
机译:我们提出了改善消息传递平台上各种通信模式性能的技术和算法,出于安全原因,用户级通信必须在发送和接收的(特殊)内存中进行缓冲。这些算法不仅可以最大程度地减少消息复制,而且还可以将复制到/从特殊内存复制到实际存储中,从而实现完整的带宽。这些模式包括树广播和归约,(基于环的)多个广播和归约,流水线广播和缓冲的点对点发送。在每种情况下,消息都有一个简单的跨度。所有这些模式都用在稠密的线性代数应用中,尽管它们在许多其他情况下也使用过。这些算法在具有集群模型许多特征的消息传递多计算机富士通AP3000上实现并评估了它们的性能。某些方面(例如,专用存储器的性能特征)是AP3000特有的。但是,算法仍适用于使用类似用户级别通信模式的任何平台。尤其在涉及中等数量的处理器的模式下,可以获得性能的提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号