首页> 外文会议>International Conference on Parallel and Distributed Computing: Applications and Technologies(PDCAT 2004); 20041208-10; Singapore(SG) >A Communication-Induced Checkpointing and Asynchronous Recovery Algorithm for Multithreaded Distributed Systems
【24h】

A Communication-Induced Checkpointing and Asynchronous Recovery Algorithm for Multithreaded Distributed Systems

机译:基于通信的多线程分布式系统检查点和异步恢复算法

获取原文
获取原文并翻译 | 示例

摘要

Checkpointing and recovery in traditional distributed systems is relatively well established. However, checkpointing and recovery in multithreaded distributed systems has not been studied in the literature. Using the traditional checkpointing and recovery algorithms in multithreaded systems leads to false causality problem and high checkpointing overhead. The checkpointing algorithm is implemented at the process level to reduce number of checkpoints and the recovery algorithm is implemented at the thread level which minimizes the false causality problem. The algorithm also takes advantage of the communication-induced checkpointing method to reduce the message overhead.
机译:传统分布式系统中的检查点和恢复相对完善。但是,文献中尚未研究多线程分布式系统中的检查点和恢复。在多线程系统中使用传统的检查点和恢复算法会导致错误的因果关系问题和较高的检查点开销。检查点算法是在过程级别实现的,以减少检查点的数量,而恢复算法是在线程级别实现的,这使虚假因果关系问题最小化。该算法还利用了通信引发的检查点方法来减少消息开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号