首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Implementing multidestination worms in switch-based parallel systems: architectural alternatives and their impact
【24h】

Implementing multidestination worms in switch-based parallel systems: architectural alternatives and their impact

机译:在基于交换机的并行系统中实现多目标蠕虫:体系结构替代方案及其影响

获取原文
获取原文并翻译 | 示例
           

摘要

Multidestination message passing has been proposed as an attractive mechanism for efficiently implementing multicast and other collective operations on direct networks. However, applying this mechanism to switch-based parallel systems is nontrivial. In this paper, we propose alternative switch architectures with differing buffer organizations to implement multidestination worms on switch-based parallel systems. First, we discuss issues related to such implementation (deadlock-freedom, replication mechanisms, header encoding, and routing). Next, we demonstrate how an existing central-buffer-based switch architecture supporting unicast message passing can be enhanced to accommodate multidestination message passing. Similarly, implementing multidestination worms on an input-buffer-based switch architecture is discussed, and two architectural alternatives are presented that reduce the wiring complexity in a practical switch implementation. The central-buffer-based and input-buffer-based implementations are evaluated against each other, as well as against the corresponding software-based schemes. Simulation experiments under a range of traffic (multiple multicast, bimodal, varying degree of multicast, and message length) and system size are used for evaluation. The study demonstrates the superiority of the central-buffer-based switch architecture. It also indicates that under bimodal traffic the central-buffer-based hardware multicast implementation affects background unicast traffic less adversely compared to a software-based multicast implementation. These results show that multidestination message passing can be applied easily and effectively to switch-based parallel systems to deliver good multicast and collective communication performance.
机译:已经提出了多目的地消息传递作为用于在直接网络上有效地实现多播和其他集体操作的有吸引力的机制。但是,将这种机制应用于基于交换机的并行系统并非易事。在本文中,我们提出了具有不同缓冲区组织的替代交换机体系结构,以在基于交换机的并行系统上实现多目标蠕虫。首先,我们讨论与此类实现相关的问题(无死锁,复制机制,标头编码和路由)。接下来,我们演示如何增强现有支持单播消息传递的基于中央缓冲区的交换体系结构,以适应多目标消息传递。类似地,讨论了在基于输入缓冲区的交换机体系结构上实现多目标蠕虫,并提出了两种体系结构替代方案,它们可降低实际交换机实现中的布线复杂性。基于中央缓冲区的实现和基于输入缓冲区的实现是相互进行评估的,也要根据相应的基于软件的方案进行评估。评估使用了一系列流量(多个多播,双峰,不同的多播程度和消息长度)和系统大小下的仿真实验。该研究证明了基于中央缓冲区的交换机体系结构的优越性。它还表明,在双峰通信下,与基于软件的多播实现相比,基于中央缓冲区的硬件多播实现对后台单播通信的影响较小。这些结果表明,多目标消息传递可以轻松有效地应用于基于交换机的并行系统,以提供良好的多播和集体通信性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号