首页> 外文会议>International conference on parallel and distributed processing techniques and applications;PDPTA 2011 >Defining the Checkpoint Interval for Uncoordinated Checkpointing Protocols
【24h】

Defining the Checkpoint Interval for Uncoordinated Checkpointing Protocols

机译:定义不协调检查点协议的检查点间隔

获取原文

摘要

Parallel applications running on large computers suffer from the absence of a reliable environment. Fault tolerance proposals, in general, rely on rollback-recovery strategies supported by checkpoint and/or message logging. There are well-defined models that address the optimum checkpoint interval for coordinated checkpointing. Nevertheless, there is a lack of models concerning uncoordinated checkpointing combined with message logging. First we present a model designed for serial applications or coordinated checkpointing-based solutions. Our contribution is the extension of this model to a scenario based on uncoordinated checkpointing combined with message logging. We introduce two key points to minimise the fault tolerance overhead for parallel applications. The first is the use of a factor to represent the dependency relation between processes. The second is the use a specific checkpoint intervals for each process. Experiments show that our model performs as well as previous studies for serial applications or coordinated checkpointing. While running parallel applications using uncoordinated checkpointing combined with message logging, our checkpoint interval model effectively minimises the overhead introduced by the fault tolerance tasks. Moreover, the overhead prediction error is smaller than 5% for all applications tested.
机译:在大型计算机上运行的并行应用程序缺少可靠的环境。通常,容错建议依赖于检查点和/或消息记录支持的回滚恢复策略。有明确定义的模型可以解决用于协调检查点的最佳检查点间隔。但是,缺乏有关不协调检查点与消息记录的模型。首先,我们介绍一个为串行应用程序或基于协调检查点的解决方案而设计的模型。我们的贡献是将该模型扩展到基于不协调检查点结合消息日志记录的方案。我们介绍了两个要点,以最大程度地减少并行应用程序的容错开销。首先是使用一个因子来表示流程之间的依赖关系。第二个是为每个进程使用特定的检查点间隔。实验表明,我们的模型在串行应用或协调检查点方面的性能与以前的研究一样好。当使用不协调的检查点结合消息日志来运行并行应用程序时,我们的检查点间隔模型可以有效地减少容错任务引入的开销。此外,对于所有测试的应用,开销预测误差均小于5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号