首页> 外文期刊>Journal of Parallel and Distributed Computing >Quantifying rollback propagation in distributed checkpointing
【24h】

Quantifying rollback propagation in distributed checkpointing

机译:量化分布式检查点中的回滚传播

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new classification of executions with checkpoints based on the amount of rollback during recovery. Specifically, an execution is k-rollback, if k indicates the maximal number of checkpoints that have to be rolled back. It is shown that coordinated checkpointing, SZPF, and ZPF are 1-rollback, while ZCF is (n - l)-rollback, where n is the number of participants in an execution. A new class of executions, called d-bounded cycles (in short, d-BC), is introduced, and is shown to be ((n - 1) · d)-rollback (ZCF is a special case of d-BC for d = 1). Finally, a protocol is presented whose executions are d-bounded cycles. A nice property of this protocol is that it does not impose any control information overhead on application messages, yet sends only a few control messages of its own. Moreover, the protocol maintains information that enables very efficient discovery of a recent recovery line that existed shortly before the failure.
机译:本文基于恢复期间的回滚量,提出了一种新的带有检查点的执行分类。具体来说,如果k表示必须回滚的最大检查点数,则执行为k滚回。显示协调检查点,SZPF和ZPF为1回滚,而ZCF为(n-1)回滚,其中n是执行的参与者数。引入了一种新的执行类,称为d界周期(简称d-BC),并显示为((n-1)·d)-回滚(ZCF是d-BC的特殊情况d = 1)。最后,给出了一个协议,其执行是d界周期。该协议的一个很好的特性是它不会在应用程序消息上施加任何控制信息开销,而仅发送一些自己的控制消息。此外,该协议维护的信息可以非常有效地发现发生故障之前不久存在的最近恢复线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号