首页> 外文会议>Calable high performance computing conference >Checkpointing SPMD applications on transputer networks
【24h】

Checkpointing SPMD applications on transputer networks

机译:检查切换网络上的SPMD应用程序

获取原文

摘要

Providing fault-tolerance for parallel/distributed applications is a problem of paramount importance, since the overall failure rate of the system increases with the number of processors, and the failure of just one processor can lend to the complete crash of the program. Checkpointing mechanisms are a good candidate to provide the continuity of the applications in the occurrence of failures. In this paper, we present an experimental study of several variations of checkpointing for SPMD (single process, multiple data) applications. We used a typical benchmark to experimentally assess the overhead, advantages and limitations of each checkpointing scheme.
机译:为并行/分布式应用程序提供容错是重要的重要性,因为系统的总体故障率随处理器的数量而增加,只有一个处理器的失败可以借入程序的完全崩溃。检查点的机制是一个很好的候选者,以便在发生故障的情况下提供应用的连续性。本文介绍了对SPMD(单程,多数据)应用的检查点几种变化的实验研究。我们使用典型的基准测试来通过实验评估每个检查点方案的开销,优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号