...
首页> 外文期刊>Journal of supercomputing >Lea-TN: leader election algorithm considering node and link failures in a torus network
【24h】

Lea-TN: leader election algorithm considering node and link failures in a torus network

机译:LEA-TN:考虑CORUS网络中的节点和链接故障的领导选举算法

获取原文
获取原文并翻译 | 示例

摘要

Torus network topology offers many advantages such as higher speed, lower latency, better fairness, and lower energy consumption. For these kinds of benefits, nowadays, it is used to construct many parallel and distributed systems like IBM Blue Gene, IBM Sequoia, Mira, and Sugon TC8600. In parallel and distributed computing, multiple nodes act together to carry out large tasks fast. Hence, coordination is indispensable among these nodes to complete such tasks. A possible coordination method could be to elect a leader among the nodes. Along with coordination, the elected leader node also manages other activities such as task allocation, result aggregation, clock synchronization, proficient resource sharing management, and communication among the system's nodes. In this paper, we propose a lower bound Omega(N log(3) N) of message complexity on a comparison-based leader election for a 2D torus network (where N is the number of nodes in the network). Next, we sketch a new leader election algorithm (Lea-TN) considering both the node and link failures for a 2D torus network. This Lea-TN is a deterministic and robust algorithm that elects a leader for a synchronous distributed system. The algorithm chooses a leader, even when there are some link or node failures in the system. We consider the number of non-faulty links and the subsisting nodes' failure rate to elect a reliable leader. We introduce new patterns for sending messages that help reduce the number of exchanged messages and the execution time of the election process. The proposed algorithm (Lea-TN) enables a node to identify its link failures during the election also. Further, we simulate the Lea-TN algorithm and compare its performance with that of the well-known existing algorithms.
机译:Torus网络拓扑提供了许多优点,例如更高的速度,降低延迟,更好的公平性和更低的能耗。对于这些益处,如今,它用于构建许多平行和分布的系统,如IBM Blue Gene,IBM SequoIa,Mira和Sugon TC8600。在并行和分布式计算中,多个节点一起行动以快速执行大型任务。因此,这些节点之间的协调是必不可少的,以完成这些任务。可能的协调方法可以选择节点之间的领导者。 Along with coordination, the elected leader node also manages other activities such as task allocation, result aggregation, clock synchronization, proficient resource sharing management, and communication among the system's nodes.在本文中,我们提出了在基于比较的基于比较的领导者选举中的消息复杂性的下限ω(n log(3)n)(其中n是网络中的节点数)。接下来,考虑到2D Torus网络的节点和链路故障,绘制新的领导者选举算法(LEA-TN)。该LEA-TN是一种确定性和强大的算法,可选择同步分布式系统的领导者。算法选择了一个领导者,即使系统中有一些链接或节点故障。我们考虑了非故障链接的数量和第五次节点的失败率来选择可靠的领导者。我们介绍了用于发送消息的新模式,有助于减少交换消息的数量和选举过程的执行时间。所提出的算法(LEA-TN)使节点能够在选举期间识别其链路故障。此外,我们模拟了LEA-TN算法,并将其性能与众所周知的现有算法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号