首页> 外国专利> METHOD FOR SOFTWARE ERROR RECOVERY USING CONSISTENT GLOBAL CHECKPOINTS

METHOD FOR SOFTWARE ERROR RECOVERY USING CONSISTENT GLOBAL CHECKPOINTS

机译:使用一致的全局检查点进行软件错误恢复的方法

摘要

Disclosed is a method for error recovery in a multiprocessing computer system of the typein which each of the processes periodically takes checkpoints. In the event of a failure. a processcan be rolled back to a prior checkpoint, and execution can continue from the checkpointed state.A monitor process monitors the execution of the processes. Upon the occurrence of a failure, atarget set of checkpoints is identified, and the maximum consistent global checkpoint, whichincludes the target set of checkpoints, is computed. Each of the processes is rolled back to anassociated checkpoint in the consistent global checkpoint. Upon a subsequent occurrence of thesame failure, a second set of checkpoints is identified, and the minimum consistent globalcheckpoint, which includes the target set of checkpoints, is computed. Each of the processes isrolled back to an associated checkpoint in the consistent global checkpoint. Upon anotheroccurrence of the same failure, the system is rolled back further to a coordinated checkpoint. Alsodisclosed are novel methods for calculating the minimum and maximum consistent globalcheckpoints. In accordance with one embodiment, the minimum and maximum consistent globalcheckpoints are calculated by a central process. In accordance with another embodiment, theminimum and maximum consistent global checkpoints are calculated in a distributed fashion byeach of the individual processes.
机译:公开了一种用于在这种类型的多处理计算机系统中进行错误恢复的方法。其中每个进程定期获取检查点。如果发生故障。一个过程可以回滚到先前的检查点,并且可以从检查点状态继续执行。监视进程监视进程的执行。发生故障后,确定目标检查点集,并确定最大一致全局检查点,即包括目标检查点集,并进行计算。每个过程都回滚到一致的全局检查点中的关联检查点。在随后发生相同的故障,第二套检查点被识别,并且最小一致全局计算包括目标检查点集的检查点。每个过程是回滚到一致的全局检查点中的关联检查点。在另一个如果发生相同的故障,系统将进一步回滚到协调检查点。也公开了用于计算最小和最大一致整体的新颖方法检查点。根据一个实施例,最小和最大一致全局检查点是通过中央过程计算的。根据另一个实施例,最小和最大一致全局检查点是通过以下方式分布式计算的:每个单独的过程。

著录项

  • 公开/公告号CA2185054A1

    专利类型

  • 公开/公告日1997-03-13

    原文格式PDF

  • 申请/专利权人 AT&T CORP.;

    申请/专利号CA19962185054

  • 发明设计人 WANG YI-MIN;

    申请日1996-09-09

  • 分类号G06F15/16;G06F11/16;

  • 国家 CA

  • 入库时间 2022-08-22 03:23:39

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号