首页> 外文会议>Iranian Conference on Electrical Engineering >A diskless chekpointing approach for failure recovery in multiprocessor safety-critical embedded systems
【24h】

A diskless chekpointing approach for failure recovery in multiprocessor safety-critical embedded systems

机译:用于多处理器安全关键嵌入式系统中故障恢复的无盘检查点方法

获取原文

摘要

Backward recovery is the one of the most important techniques for error recovery in safety-critical systems which are usually based on nonvolatile memories. Since storing checkpoints in hard disks -as a nonvolatile memory- imposes noteworthy timing overhead to the system, diskless checkpointing would be a good solution for low cost fault tolerance in parallel and distributed systems. In this paper an algorithm is proposed which is able to recover a multiprocessor system from failure when up to half of the processors are failed, simultaneously. In contrast to many existing work, in the presented work each processor can have more than one task. The algorithm also by grouping tasks and by coding checkpoints eliminates the need of hard and nonvolatile disks to store checkpoints. The simulation results show the ability of the proposed algorithm in recovering system from failure when up to half of processors are simultaneously failed without using any extra dedicated checkpointing processor. Also compared to the existing approaches, the presented method requires fewer processors.
机译:在通常基于非易失性存储器的安全关键型系统中,向后恢复是用于错误恢复的最重要技术之一。由于将检查点存储在硬盘中(作为非易失性存储器)会给系统带来明显的定时开销,因此,无盘检查点将是并行和分布式系统中低成本容错能力的良好解决方案。本文提出了一种算法,该算法能够在多达一半的处理器同时发生故障时从故障中恢复多处理器系统。与许多现有工作相反,在提出的工作中,每个处理器可以执行多个任务。该算法还通过对任务进行分组和对检查点进行编码,消除了使用硬盘和非易失性磁盘存储检查点的需求。仿真结果表明,在不使用任何额外专用检查点处理器的情况下,当多达一半的处理器同时发生故障时,所提算法能够从故障中恢复系统。还与现有方法相比,所提出的方法需要更少的处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号