首页> 外文期刊>IEICE transactions on information and systems >A Concurrent Partial Snapshot Algorithm for Large-Scale and Dynamic Distributed Systems
【24h】

A Concurrent Partial Snapshot Algorithm for Large-Scale and Dynamic Distributed Systems

机译:大规模动态分布系统的并行部分快照算法

获取原文
           

摘要

Checkpoint-rollback recovery, which is a universal method for restoring distributed systems after faults, requires a sophisticated snapshot algorithm especially if the systems are large-scale, since repeatedly taking global snapshots of the whole system requires unacceptable communication cost. As a sophisticated snapshot algorithm, a partial snapshot algorithm has been introduced that takes a snapshot of a subsystem consisting only of the nodes that are communication-related to the initiator instead of a global snapshot of the whole system. In this paper, we modify the previous partial snapshot algorithm to create a new one that can take a partial snapshot more efficiently, especially when multiple nodes concurrently initiate the algorithm. Experiments show that the proposed algorithm greatly reduces the amount of communication needed for taking partial snapshots.
机译:作为故障后恢复分布式系统的通用方法,检查点回滚恢复需要复杂的快照算法,尤其是在系统规模较大的情况下,因为要重复获取整个系统的全局快照需要不可接受的通信成本。作为复杂的快照算法,已引入了部分快照算法,该算法获取子系统的快照,该子系统仅由与发起方通信相关的节点组成,而不是整个系统的全局快照。在本文中,我们修改了以前的部分快照算法,以创建一种可以更有效地拍摄部分快照的新算法,尤其是当多个节点同时启动该算法时。实验表明,该算法大大减少了拍摄部分快照所需的通信量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号