首页> 外国专利> Progressive retry method and apparatus having reusable software modules for software failure recovery in multiprocess message-passing applications

Progressive retry method and apparatus having reusable software modules for software failure recovery in multiprocess message-passing applications

机译:具有用于多进程消息传递应用中的软件故障恢复的可重用软件模块的渐进重试方法和装置

摘要

A progressive retry recovery system based on checkpointing, message logging, rollback, message replaying and message reordering is disclosed. The disclosed progressive retry system minimizes the number of involved processes as well as the total rollback distance. The progressive retry recovery system includes a fault tolerant software library which provides a number of functions which may be invoked by application processes to implement fault tolerance. Fault tolerant functions are provided for allowing an application process to generate a heartbeat message at specified intervals indicating that the application process is still active. In addition, fault tolerance implementation functions are provided for specifying critical memory, for executing checkpoints to store backup copies of critical data, and for restoring critical data during a recovery. In addition, functions are provided which process messages that are sent or received by an application process and maintain logs of the sent and received messages. The progressive retry recovery method consists of a number of retry steps which gradually increase the scope of the rollback when a previous retry step fails.
机译:公开了一种基于检查点,消息记录,回滚,消息重放和消息重新排序的渐进重试恢复系统。所公开的渐进重试系统使所涉及的过程的数量以及总回滚距离最小化。渐进重试恢复系统包括一个容错软件库,该库提供许多功能,这些功能可由应用程序调用以实现容错功能。提供了容错功能,以允许应用程序进程以指定的时间间隔生成心跳消息,以指示应用程序进程仍处于活动状态。此外,还提供了容错实现功能,用于指定关键内存,执行检查点以存储关键数据的备份副本以及在恢复过程中还原关键数据。另外,提供了一些功能,这些功能处理应用程序发送或接收的消息,并维护已发送和已接收消息的日志。逐步重试恢复方法由多个重试步骤组成,这些重试步骤在上一个重试步骤失败时逐渐增加回滚的范围。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号