首页> 外文会议>Fault-Tolerant Computing, 1998. Digest of Papers. Twenty-Eighth Annual International Symposium on >RENEW: a tool for fast and efficient implementation of checkpoint protocols
【24h】

RENEW: a tool for fast and efficient implementation of checkpoint protocols

机译:更新:一种用于快速高效地执行检查点协议的工具

获取原文

摘要

This paper describes the design, implementation, and evaluation of a run-time system for clusters of workstations that allows the rapid testing of checkpoint protocols with standard benchmarks. To achieve this goal, RENEW provides a flexible set of operations that facilitates the integration of a protocol in the system with reduced programming effort. To support a broad range of applications, RENEW exports, as its external interface, the industry endorsed Message Passing Interface (MPI). Three distinct classes of protocols were evaluated using the RENEW environment with SPEC and NAS benchmarks on a network of workstations connected by ATM. It was observed that the communication-induced protocol emulated the behavior of the coordinated protocol, with comparable performance. The message logging protocol degraded the performance. Even though the message logging protocol was slower due to log replay, all three protocols required a similar amount of time to restore the application to the same state as before failure occurred and recovery was initiated.
机译:本文介绍了针对工作站集群的运行时系统的设计,实现和评估,该系统允许使用标准基准快速测试检查点协议。为了实现此目标,RENEW提供了一组灵活的操作,以减少编程工作量促进了协议在系统中的集成。为了支持广泛的应用,RENEW导出了业界认可的消息传递接口(MPI)作为其外部接口。在带有ATM连接的工作站网络上,使用带有SPEC和NAS基准的RENEW环境评估了三种不同类别的协议。据观察,由通信引起的协议模拟了协调协议的行为,并具有可比的性能。消息日志记录协议降低了性能。尽管由于日志重播,消息日志记录协议的速度较慢,但​​所有三种协议都需要相似的时间来将应用程序还原到与发生故障和启动恢复之前相同的状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号