首页> 外文期刊>Parallel Computing >Application controlled checkpointing coordination for fault-tolerant distributed computing systems
【24h】

Application controlled checkpointing coordination for fault-tolerant distributed computing systems

机译:容错分布式计算系统的应用控制检查点协调

获取原文
获取原文并翻译 | 示例
           

摘要

In order to provide fault tolerance for distributed systems, the checkpointing technique has widely been used and many researches have been performed to reduce the overhead of checkpointing coordination. In this paper, we present a new checkpointing coordination scheme in which the application controls the coordination activity by utilizing the commu- nication pattern of the application program. Unlike the previous solutions which do not utilize the communication pattern of cooperating processes. it is possible to reduce the coordination effort as well as the number of checkpoints enforced to be taken. Extensive simulations have been performed to evaluate the proposed scheme and we have concluded that the proposed scheme significantly reduces the coordination overhead compared with the existing loose coordination scheme.
机译:为了提供分布式系统的容错能力,检查点技术已被广泛使用,并进行了许多研究以减少检查点协调的开销。在本文中,我们提出了一种新的检查点协调方案,其中应用程序通过利用应用程序的通信模式来控制协调活动。与以前的解决方案不同,该解决方案没有利用协作过程的通信模式。可以减少协调工作,并减少要执行的检查点的数量。已经进行了广泛的仿真以评估所提出的方案,并且我们得出的结论是,与现有的松散协调方案相比,所提出的方案显着减少了协调开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号