首页> 外文期刊>Concurrency and Computation >CPPC: a compiler-assisted tool for portable checkpointing of message-passing applications
【24h】

CPPC: a compiler-assisted tool for portable checkpointing of message-passing applications

机译:CPPC:编译器辅助工具,用于消息传递应用程序的便携式检查点

获取原文
获取原文并翻译 | 示例

摘要

With the evolution of high-performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the execution or to a migration of the application processes to different machines, checkpointing tools must be able to operate in heterogeneous environments. However, some of the data manipulated by a parallel application are not truly portable. Examples of these include opaque state (e.g. data structures for communications support) or diversity of interfaces for a single feature (e.g. communications, I/O). Directly manipulating the underlying ad hoc representations renders checkpointing tools unable to work on different environments. Portable checkpointers usually work around portability issues at the cost of transparency: the user must provide information such as what data need to be stored, where to store them, or where to checkpoint. CPPC (ComPiler for Portable Checkpointing) is a checkpointing tool designed to feature both portability and transparency. It is made up of a library and a compiler. The CPPC library contains routines for variable level checkpointing, using portable code and protocols. The CPPC compiler helps to achieve transparency by relieving the user from time-consuming tasks, such as data flow and communications analyses and adding instrumentation code. This paper covers both the operation of the CPPC library and its compiler support. Experimental results using benchmarks and large-scale real applications are included, demonstrating usability, efficiency, and portability.
机译:随着高性能计算向异构,大规模并行系统的发展,并行应用程序开发了新的检查点和重新启动必要性。无论是由于执行失败还是由于将应用程序进程迁移到其他计算机,检查点工具都必须能够在异构环境中运行。但是,并行应用程序操纵的某些数据并不是真正可移植的。这些的示例包括不透明状态(例如,用于通信支持的数据结构)或用于单个特征的接口的多样性(例如,通信,I / O)。直接操纵基础的即席表示会使检查点工具无法在不同的环境下工作。便携式检查点通常以透明性为代价来解决可移植性问题:用户必须提供信息,例如需要存储哪些数据,将数据存储在何处或在哪里检查点。 CPPC(用于便携式检查点的ComPiler)是一种检查点工具,旨在兼具可移植性和透明性。它由一个库和一个编译器组成。 CPPC库包含使用可移植代码和协议进行可变级别检查点的例程。 CPPC编译器通过使用户摆脱费时的任务(例如数据流和通信分析以及添加检测代码)来帮助实现透明性。本文涵盖了CPPC库的操作及其编译器支持。包括使用基准测试和大规模实际应用的实验结果,展示了可用性,效率和可移植性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号