【24h】

The Do-All Problem with Byzantine Processor Failures

机译:拜占庭式处理器故障的所有问题

获取原文

摘要

Do-All is the abstract problem of using n processors to cooperatively perform m independent tasks in the presence of failures. This problem can be used as the cornerstone in identifying aspects of the trade-off between efficiency and fault-tolerance in cooperative computing and in developing efficient and fault-tolerant algorithms for distributed cooperative applications. Many algorithms have been developed for Do-All in various models of computation, including message-passing, partitionable networks, and shared-memory models and under various failure models. However, to the best of our knowledge, Do-All has not been studied under Byzantine processor failures, where a faulty processor may exhibit completely unconstrained behavior. Byzantine failures model any arbitrary type of processor malfunction, including for example, failures of individual components within the processors. In this work we study the Do-All problem for synchronous message-passing processors prone to Byzantine failures. In particular, we present upper and lower bound results on the efficiency of Do-All for several cases: we consider the case where the maximum number of faulty processors f is known a priori, the case where f is not known, the case where a task that has been executed can be verified (without re-executing it) and the case where task executions cannot be verified. We evaluate our algorithms in terms of their work and message complexities. The work complexity accounts all computational steps taken by the processors and the message complexity accounts all messages sent by the processors during the computation. The work and messages of a faulty processor are counted only until the processor fails to follow the algorithm being executed.
机译:do-all是使用n处理器在发生故障的情况下协同执行M独立任务的抽象问题。这个问题可以用作识别协同计算中效率和容错之间的折衷方面的基石以及分布式协作应用的高效和容错算法。许多算法已经开发了各种计算模型,包括消息传递,分区网络和共享存储器模型以及在各种故障模型下。然而,据我们所知,DO-ALL尚未在拜占庭处理器故障下进行研究,其中有故障的处理器可能表现出完全不受约束的行为。拜占庭故障模型模型任何任意类型的处理器故障,包括例如处理器内各个组件的故障。在这项工作中,我们研究同步消息传递处理器容易发生拜占庭的失败的所有问题。特别是,我们对多种情况的效率提高了上下界限:我们考虑了已知最大故障处理器F的最大数量的情况,其中F不知道的情况,其中a可以验证已执行的任务(无需重新执行它),并且无法验证任务执行的情况。我们在其工作和消息复杂性方面评估我们的算法。工作复杂性账户处理器和消息复杂性账户所采用的所有计算步骤,处理器在计算期间发送的所有消息。故障处理器的工作和消息仅计为直到处理器无法遵循正在执行的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号