首页> 外文会议>Parallel and Distributed Computing, Applications and Technologies, 2009 >A Classification-Based Approach to Fault-Tolerance Support in Parallel Programs
【24h】

A Classification-Based Approach to Fault-Tolerance Support in Parallel Programs

机译:并行程序中基于分类的容错支持方法

获取原文

摘要

Fault tolerance is an important requirement for long-running parallel programs. This paper presents a different approach to fault-tolerance support in message-passing parallel programs based on their structural and behavioral characteristics, commonly known as patterns. A classification of these patterns and their applicable fault-tolerance strategies is aimed to facilitate an application developer to incorporate appropriate fault-tolerance strategies to an application. Fault-tolerance strategies for two of the patterns are discussed, and one specific strategy is elaborated and analyzed. The presented strategies have been incorporated into a fault-tolerance support framework called FT-PAS. One objective of the framework is to separate the fault tolerance related details from an application developer's main objectives (separation-of-concerns). The paper presents the additional key features of the framework, and concludes with a discussion on current and future research directions.
机译:容错是长时间运行并行程序的重要要求。本文基于消息传递并行程序的结构和行为特征(通常称为模式)提出了一种不同的方法来支持消息传递并行程序中的容错。这些模式及其适用的容错策略的分类旨在促进应用程序开发人员将适当的容错策略合并到应用程序中。讨论了两种模式的容错策略,并阐述和分析了一种特定的策略。提出的策略已合并到称为FT-PAS的容错支持框架中。该框架的一个目标是将与容错相关的细节与应用程序开发人员的主要目标(关注分离)分开。本文介绍了该框架的其他关键特征,并在最后对当前和未来的研究方向进行了讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号