首页> 外文OA文献 >A Novel Low-Overhead Recovery Approach for Distributed Systems
【2h】

A Novel Low-Overhead Recovery Approach for Distributed Systems

机译:分布式系统的新型低开销恢复方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We have addressed the complex problem of recovery for concurrent failures in distributed computing environment. We have proposed a new approach in which we have effectively dealt with both orphan and lost messages. The proposed checkpointing and recovery approaches enable each process to restart from its recent checkpoint and hence guarantee the least amount of recomputation after recovery. It also means that a process needs to save only its recent local checkpoint. In this regard, we have introduced two new ideas. First, the proposed value of the common checkpointing interval is such that it enables an initiator process to log the minimum number of messages sent by each application process. Second, the determination of the lost messages is always done a priori by an initiator process; besides it is done while the normal distributed application is running. This is quite meaningful because it does not delay the recovery approach in any way.
机译:我们已经解决了分布式计算环境中并发故障的复杂恢复问题。我们提出了一种新的方法,可以有效地处理孤儿信息和丢失的信息。提议的检查点和恢复方法使每个进程都可以从其最近的检查点重新启动,从而保证恢复后最少的重新计算量。这也意味着进程仅需要保存其最近的本地检查点。在这方面,我们引入了两个新想法。首先,公共检查点间隔的建议值应使启动程序能够记录每个应用程序进程发送的最小消息数。其次,丢失消息的确定总是由发起方过程事先进行的;此外,它是在正常的分布式应用程序运行时完成的。这非常有意义,因为它不会以任何方式延迟恢复方法。

著录项

  • 作者

    Gupta B.; Rahimi Shahram;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号