首页> 外文会议>Advances in Grid and Pervasive Computing >Domino-Effect Free Crash Recovery for Concurrent Failures in Cluster Federation
【24h】

Domino-Effect Free Crash Recovery for Concurrent Failures in Cluster Federation

机译:集群联合中并发故障的Domino影响免费崩溃恢复

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we have addressed the complex problem of recovery for concurrent failures in cluster computing environment. We have proposed a new approach in which we have dealt with both inter cluster orphan and lost messages unlike the existing works. The proposed recovery approach is free from the domino-effect and hence guarantees the least amount of re-computation after recovery. Besides, a process needs to save only its recent local checkpoint, which is also the case for a cluster. So number of trips to stable storage per process is always one during recovery. The proposed common check pointing interval is such that it enables a process to log the minimum number of messages it has sent. These features make our approach superior to the existing works.
机译:在本文中,我们已经解决了集群计算环境中并发故障的复杂恢复问题。我们提出了一种新方法,与现有工作不同,我们既处理集群间孤儿消息又丢失了消息。提议的恢复方法没有多米诺骨牌效应,因此可以保证恢复后最少的重新计算量。此外,进程仅需要保存其最近的本地检查点,对于集群也是如此。因此,在恢复过程中,每个进程到稳定存储的旅行次数始终为一。提议的通用检查指向间隔是这样的,它使进程能够记录已发送的最小消息数。这些功能使我们的方法优于现有作品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号