【24h】

PARALLEL CHECKPOINTING FACILITY ON A METASYSTEM

机译:元系统上的并行检查点功能

获取原文
获取原文并翻译 | 示例

摘要

This article describes experiences on incorporating uncoordinated checkpointing and recovery facilities in a Java-based metacomputing system. Our case study is SUMA, a metasystem for execution of Java bytecode, both sequential and parallel. We have incorporated an algorithm that implements a communication-induced checkpointing protocol into SUMA. This algorithm induces processes to take additional local (forced) checkpoints and to log all in-transit messages to ensure consistent global checkpoints. Preliminary results about the performance overhead produced by the implementation of this algorithm are presented.
机译:本文介绍了在基于Java的元计算系统中合并不协调的检查点和恢复功能的经验。我们的案例研究是SUMA,这是一个用于执行Java字节码(顺序和并行)的元系统。我们已经将一种算法实现了通信,该算法将通信引发的检查点协议实现到SUMA中。此算法促使进程采用其他本地(强制)检查点并记录所有在途消息,以确保一致的全局检查点。给出了有关该算法实施产生的性能开销的初步结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号