首页> 外文会议> >A hierarchical checkpointing protocol for parallel applications in cluster federations
【24h】

A hierarchical checkpointing protocol for parallel applications in cluster federations

机译:集群联合中并行应用程序的分层检查点协议

获取原文

摘要

Summary form only given. Code coupling applications can be divided into communicating modules, that may be executed on different clusters in a cluster federation. As a cluster federation comprises of a large number of nodes, there is a high probability of a node failure. We propose a hierarchical checkpointing protocol that combines a synchronized checkpointing technique inside clusters and a communication-induced technique between clusters. This protocol fits to the characteristics of a cluster federation (large number of nodes, high latency and low bandwidth networking technologies between clusters). A preliminary performance evaluation performed using a discrete event simulator shows that the protocol is suitable for code coupling applications.
机译:摘要表格仅给出。代码耦合应用程序可以分为通信模块,其可以在群集联合中的不同群集上执行。作为集群联合组合包括大量节点,节点故障的概率很高。我们提出了一种分层检查点化协议,该协议将群集内部的同步检查点技术与集群之间的通信引起的技术相结合。该协议适合集群联合的特征(大量节点,高延迟和群集之间的低带宽网络技术)。使用离散事件模拟器执行的初步性能评估显示该协议适用于代码耦合应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号