...
首页> 外文期刊>Parallel Algorithms and Applications >A fully informed model-based checkpointing protocol for preventing useless checkpoints
【24h】

A fully informed model-based checkpointing protocol for preventing useless checkpoints

机译:基于信息的完全基于模型的检查点协议,可防止无用的检查点

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Checkpointing and rollback recovery are widely used techniques for handling failures in distributed systems. When processes involved in a distributed computation are allowed to take checkpoints independently without any coordination with each other, some or all of the checkpoints taken may not be part of any consistent global checkpoint, and hence, are useless for recovery. Communication-induced checkpointing algorithms allow processes to take checkpoints independently and also ensure that each checkpoint taken is part of a consistent global checkpoint by forcing processes to take some additional checkpoints. It is well known that it is impossible to design an optimal communication-induced checkpointing algorithm (i.e. a checkpointing algorithm that takes minimum number of forced checkpoints). So, researchers have designed communication-induced checkpointing algorithms that reduce forced checkpoints using different heuristics. In this paper, we present a communication-induced checkpointing algorithm which takes less number of forced checkpoints when compared to some of the existing checkpointing algorithms in its class.
机译:检查点和回滚恢复是用于处理分布式系统中的故障的广泛使用的技术。当允许分布式计算中涉及的进程独立采取检查点而彼此之间没有任何协调时,采取的某些或所有检查点可能不是任何一致的全局检查点的一部分,因此对恢复毫无用处。通信引起的检查点算法允许进程独立地获取检查点,并且通过强制进程获取一些其他检查点来确保所采用的每个检查点都是一致的全局检查点的一部分。众所周知,不可能设计最佳的通信引起的检查点算法(即,采用最少数量的强制检查点的检查点算法)。因此,研究人员设计了由通信引起的检查点算法,该算法使用不同的启发式方法来减少强制检查点。在本文中,我们提出了一种通信诱导的检查点算法,与同类中的某些现有检查点算法相比,该方法占用的强制检查点数量更少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号