首页> 外文会议>IASTED International Conference on Modelling and Simulation >Checkpointing and recovery for mobile distributed systems: a new approach
【24h】

Checkpointing and recovery for mobile distributed systems: a new approach

机译:检查和恢复移动分布式系统:一种新方法

获取原文
获取外文期刊封面目录资料

摘要

Traditional checkpointing and rollback recovery schemes designed for a static network, are not suitable for applications running on a mobile computing system (3, 11). Issues like mobility of the nodes, long disconnection periods of mobile hosts (MHs), lack of stable storage in MH, high fault rate and available limited communication bandwidth of wireless links needs to be considered. Traditional checkpointing and recovery schemes for static networks can be broadly classified into synchronous, asynchronous and quasi-synchronous [5]. All these strategies have been explored for designing checkpointing schemes for mobile environment. This paper presents a checkpointing technique which uses probabilistic approach to avoid excessive checkpointing activity at mobile nodes in "no receive after send (NRAS)"[1, 3] checkpointing algorithm, which is SZPF protocol[5]. In our scheme a mobile node skips some of the checkpoints which may be needed to construct a consistent global state in case of a fault. Our protocol uses the message logs to construct these skipped checkpoints if they are needed during recovery. We show that the probability that a skipped checkpoint will be required during recovery is very low at moderate fault rates. We also present a simple recovery algorithm for distributed mobile systems with this checkpointing activity. The probability to take a checkpoint at mobile node depends on the fault rate and message pattern. This probability makes the system adaptive to the fault rate and message pattern, and reduces the load on the wireless links. The proposed scheme is compared with the existing deterministic schemes through simulation.
机译:用于静态网络的传统检查点和回滚恢复方案,不适合在移动计算系统(3,11)上运行的应用程序。需要的问题,如节点的移动性,移动主机(MHS)的长断开周期,MH中缺乏稳定存储,高故障率和可用的无线链路的可用有限通信带宽。静态网络的传统检查和恢复方案可以广泛分为同步,异步和准同步[5]。为设计移动环境的检查点方案探索了所有这些策略。本文介绍了一种检查点技术,它使用概率方法来避免在“发送(NRAS)”[1,3]检查点算法中的“无接收”中的移动节点中的过度检查点活动,这是SZPF协议[5]。在我们的方案中,移动节点跳过某些检查点,在故障情况下可能需要在发生一致的全局状态。如果在恢复期间,我们的协议使用消息日志构建这些跳过的检查点。我们表明,在恢复期间将需要跳过检查点的概率非常低,处于适度的故障率。我们还提供了一种具有此检查点活动的分布式移动系统的简单恢复算法。在移动节点处拍摄检查点的概率取决于故障率和消息模式。这种概率使系统自适应到故障率和消息模式,并降低了无线链路上的负载。通过模拟将所提出的方案与现有的确定性方案进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号