首页> 外文期刊>IEEE Transactions on Computers >Message-optimal protocols for fault-tolerant broadcasts/multicasts in distributed systems with crash failures
【24h】

Message-optimal protocols for fault-tolerant broadcasts/multicasts in distributed systems with crash failures

机译:具有崩溃故障的分布式系统中的容错广播/组播的消息最佳协议

获取原文
获取原文并翻译 | 示例

摘要

An essential feature in any fault tolerant design of distributed systems is a mechanism by which a process can reliably broadcast information to other processes in the presence of failures. The paper studies the message complexity of fault tolerant broadcast protocols in weakly synchronous and totally asynchronous distributed systems with point to point communication links, where the system failures are caused by the processes but the communication links are completely reliable. We focus on the number of messages required of any fault tolerant protocol in failure free executions. Our motivation is that one should incur the cost of handling failures only when they actually occur. We present protocols that, in an n-process system subject to at most t crash failures where 1/spl les/t>(n-1), guarantee the delivery of a message from any process to other nonfaulty processes. In the absence of crash failures, our protocols require (n+t-1) messages in the weakly synchronous model and (t+1)(n--1-(t/2)) messages in the totally asynchronous model. Moreover, we show that in both cases our protocols are optimal with respect to message complexity. The new insights provided in our lower bound proofs also yield graph-theoretic characterizations of all message optimal reliable broadcast protocols in failure free executions. Both the upper and lower bound results on broadcast protocols can be generalized to multicast protocols, where a process only needs to deliver a message to a subset of processes in the system.
机译:分布式系统的任何容错设计中的基本特征是一种机制,通过该机制,进程可以在出现故障时可靠地将信息广播到其他进程。本文研究了具有点对点通信链路的弱同步和完全异步分布式系统中的容错广播协议的消息复杂性,其中系统故障是由过程引起的,但是通信链路是完全可靠的。我们专注于无故障执行中任何容错协议所需的消息数。我们的动机是,只有在实际发生故障时,才应承担处理故障的成本。我们提出的协议在n进程系统中最多遭受t崩溃失败,其中1 / spl les / t>(n-1)保证从任何进程到其他无故障进程的消息传递。在没有崩溃失败的情况下,我们的协议在弱同步模型中需要(n + t-1)条消息,在完全异步模型中需要(t + 1)(n--1-(t / 2))条消息。此外,我们表明在两种情况下,我们的协议在消息复杂度方面都是最佳的。我们在下界证明中提供的新见解还可以在无故障执行中产生所有消息最佳可靠广播协议的图形理论表征。广播协议的上限和下限结果都可以推广到多播协议,在多播协议中,进程仅需要将消息传递到系统中进程的子集即可。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号