首页> 外文会议>International Parallel Processing Symposium >Software techniques for improving MPP bulk-transfer performance
【24h】

Software techniques for improving MPP bulk-transfer performance

机译:用于提高MPP散装传输性能的软件技术

获取原文
获取外文期刊封面目录资料

摘要

Brewer & Kuszmaul (1994) demonstrated how barriers and traffic interleaving can alleviate the problem of bulk-transfer performance degradation on the Thinking Machines CM-5 massively parallel processor (MPP) by exploiting the observation that one-on-one communication avoids network congestion. We apply and extend these techniques on the Intel Paragon and MIT Alewife machines. Because these machines lack the CM-5's fast hardware support for barriers, we introduce a token-passing scheme that avoids barriers while maintaining one-on-one communication. We also introduce a new algorithm-distributed dynamic scheduling-that brings Brewer & Kuszmaul's observations to bear on irregular traffic patterns by massaging traffic into a sequence of near-permutations at runtime, without requiring any preprocessing or global state. The measured performance of our algorithm exceeds that of traffic interleaving (the most effective technique proposed by Brewer & Kuszmaul) on all three platforms, and is comparable to the performance of static scheduling, which requires preprocessing and global state.
机译:Brewer&Kuszmaul(1994)通过利用一对一通信避免网络拥塞,展示障碍和交通交织在思维机器CM-5大型平行处理器(MPP)上的批量转移性能下降问题。我们申请并扩展了英特尔Paragon和MIT ALEWIFE机器的这些技术。由于这些机器缺乏CM-5对障碍的快速硬件支持,因此我们介绍了一种令牌传递方案,避免了障碍,同时保持一对一的通信。我们还介绍了一种新的算法分布式动态调度 - 将Brewer&Kuszmaul的观察结果带入不规则的交通模式,通过将流量按摩到运行时的近偏移序列,而无需任何预处理或全局状态。我们的算法的测量性能超过了所有三个平台上的交通交织(Brewer&Kuszmaul提出的最有效的技术),并且与静态调度的性能相当,这需要预处理和全局状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号