首页> 外文会议>ISCA International Conference on Computer and Their Applications >PERMUTATION ROUTING AND FAULT TOLERANCE IN OMEGA PLUS NETWORKS
【24h】

PERMUTATION ROUTING AND FAULT TOLERANCE IN OMEGA PLUS NETWORKS

机译:Omega Plus网络中的置换路由和容错

获取原文

摘要

The Shuffle/Exchange networks with redundant paths are interconnection networks that are designed to provide fault tolerance for high performance computing systems. The requirement for using the redundant paths either for random access or permutation routing entails the identification of faults in the current active paths so that the redundant paths can be used to avoid the faults. So far, none of the work has provided any detailed mechanism regarding how and when the redundant paths will be used. In this paper, a routing technique is described that can be used to avoid a single fault in the Omega-Plus network, which is a network with an extra switching stage to provide two paths between any processor memory pair. Then, the technique is extended to handle permutation routing under the presence of a single fault The proposed approach necessitates the use of periodic diagnosis and saving the system state at a checkpoint. Since the use of checkpoints will add overhead to the normal processing, the expected number of permutations to be performed and the optimal checkpoint interval are then derived for a sequence of P permutations.
机译:具有冗余路径的随机/交换网络是互连网络,旨在为高性能计算系统提供容错。用于随机访问或置换路由的使用冗余路径的要求需要识别当前有源路径中的故障,使得冗余路径可用于避免故障。到目前为止,没有任何作品向如何使用冗余路径以及何时使用任何详细的机制。在本文中,描述了一种路由技术,其可用于避免OMEGA-Plus网络中的单个故障,该网络是具有额外切换级的网络,以在任何处理器存储器对之间提供两个路径。然后,该技术扩展以在单个故障的情况下扩展到处理置换路由,所提出的方法需要使用周期性诊断并在检查点处保存系统状态。由于使用检查点将增加正常处理的开销,因此待执行的预期排列数量和最佳检查点间隔的序列被导出为一系列P序列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号