首页> 美国政府科技报告 >Tools and Techniques for Adding Fault Tolerance to Distributed and Parallel Programs.
【24h】

Tools and Techniques for Adding Fault Tolerance to Distributed and Parallel Programs.

机译:为分布式并行程序添加容错的工具和技术。

获取原文

摘要

The scale of parallel computing systems is rapidly approaching dimensions where fault tolerance can no longer be ignored. No matter how reliable the individual components may be, the complexity of these systems results in a significant probability of failure during lengthy computations. In the case of distributed memory multiprocessors, fault tolerance techniques developed for distributed operating systems and applications can be applied also to parallel computations. In the paper we survey some of the principal paradigms for fault-tolerant distributed computing and discuss their relevance to parallel processing. One particular technique--passive replication--is explored in detail as it forms the basis for fault tolerance in the Paralex parallel programming environment.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号