首页> 外国专利> method and device for error detection and recovery with a predetermined replikationsart for distributed applications in a network

method and device for error detection and recovery with a predetermined replikationsart for distributed applications in a network

机译:用于网络中的分布式应用程序的具有预定复制码的错误检测和恢复的方法和设备

摘要

An application module (A) running on a host computer in a computer network is failure-protected with one or more backup copies that are operative on other host computers in the network. In order to effect fault protection, the application module registers itself with a ReplicaManager daemon process (112) by sending a registration message, which message, in addition to identifying the registering application module and the host computer on which it is running, includes the particular replication strategy (cold backup, warm backup, or hot backup) and the degree of replication associated with that application module. The backup copies are then maintained in a fail-over state according to the registered replication strategy. A WatchDog daemon (113), running on the same host computer as the registered application periodically monitors the registered application to detect failures. When a failure, such as a crash or hangup of the application module, is detected, the failure is reported to the ReplicaManager, which effects the requested fail-over actions. An additional backup copy is then made operative in accordance with the registered replication style and the registered degree of replication. A SuperWatchDog daemon process (115-1), running on the same host computer as the ReplicaManager, monitors each host computer in the computer network. When a host failure is detected, each application module running on that host computer is individually failure-protected in accordance with its registered replication style and degree of replication. IMAGE
机译:在计算机网络中的主机上运行的应用程序模块(A)使用在网络中其他主机上可运行的一个或多个备份副本进行故障保护。为了实现故障保护,应用程序模块通过发送注册消息向ReplicaManager守护进程进行注册(112),该消息除了标识正在注册的应用程序模块和运行它的主机之外,还包括特定的消息。复制策略(冷备份,热备份或热备份)以及与该应用程序模块关联的复制程度。然后根据注册的复制策略将备份副本保持在故障转移状态。与注册的应用程序在同一主机上运行的WatchDog守护程序(113)定期监视注册的应用程序以检测故障。当检测到故障(例如应用程序模块的崩溃或挂起)时,会将故障报告给ReplicaManager,这将影响所请求的故障转移操作。然后根据注册的复制样式和注册的复制程度使附加备份副本生效。与ReplicaManager在同一主机上运行的SuperWatchDog守护进程(115-1)监视计算机网络中的每台主机。当检测到主机故障时,该主机上运行的每个应用程序模块都会根据其注册的复制样式和复制程度分别进行故障保护。 <图像>

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号