首页> 外文会议>International Conference on High Performance Computing(HiPC 2004); 20041219-22; Bangalore(IN) >A New Adaptive Fault-Tolerant Routing Methodology for Direct Networks
【24h】

A New Adaptive Fault-Tolerant Routing Methodology for Direct Networks

机译:直接网络的新型自适应容错路由方法

获取原文
获取原文并翻译 | 示例

摘要

Interconnection networks play a key role in the fault tolerance of massively parallel computers, since faults may isolate a large fraction of the machine containing many healthy nodes. In this paper, we present a methodology to design fully adaptive fault-tolerant routing algorithms for direct interconnection networks that can be applied to different regular topologies. The methodology is mainly based on the selection of an intermediate node (if needed) for each source-destination pair. Packets are adaptively routed to the intermediate node and, from this node, they are adaptively forwarded to their destination. This methodology requires only one additional virtual channel, even for tori. Evaluation results show that the methodology is 7-fault tolerant, and for up to 14 faults, more than 99% of the combinations are tolerated, also without significantly degrading performance in the presence of faults.
机译:互连网络在大规模并行计算机的容错能力中起着关键作用,因为故障可能会将包含许多正常节点的计算机隔离开来。在本文中,我们提出了一种方法,可以为直接互连网络设计完全自适应的容错路由算法,该算法可以应用于不同的常规拓扑。该方法主要基于为每个源-目的地对选择中间节点(如果需要)。分组被自适应地路由到中间节点,并且从该节点被自适应地转发到它们的目的地。这种方法只需要一个额外的虚拟通道,即使对于花托也是如此。评估结果表明,该方法可以容忍7个故障,并且对于多达14个故障,可以容忍超过99%的组合,并且在存在故障的情况下也不会显着降低性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号