【24h】

A Formal Model for Fault-Tolerance in Distributed Systems

机译:分布式系统中的容错的正式模型

获取原文
获取原文并翻译 | 示例

摘要

We present a formal method based on graph rewriting systems for the specifications and the proofs of fault-tolerant distributed algorithms. Our method deals with crash failures. In a crash failure system the process can fail by crashing, i.e. by permanently halting. The faulty processes are the processes contaminated by the crashes. The methodology is formalized in two phases. In the first phase, we build the set of illegitimate configurations to specify the faults and the faulty processes. The second phase is devoted to the addition of correction rules in the initial graph rewriting system used to encode the distributed algorithm. These rules are able to detect and eliminate the faults locally during the computation. This method can be implemented under an asynchronous message passing system which notifies the faults. To illustrate this approach, we present examples of fault-tolerant distributed spanning tree algorithms.
机译:我们提出了一种基于图形重写系统的形式化方法,用于规范和容错分布式算法的证明。我们的方法处理崩溃失败。在崩溃失败系统中,该过程可能因崩溃而失败,即由于永久停止而失败。有故障的进程是崩溃所污染的进程。该方法分为两个阶段。在第一阶段,我们构建非法配置集以指定故障和故障过程。第二阶段致力于在用于对分布式算法进行编码的初始图形重写系统中添加校正规则。这些规则能够在计算过程中局部检测和消除故障。该方法可以在通知故障的异步消息传递系统下实现。为了说明这种方法,我们提供了容错分布式生成树算法的示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号