...
首页> 外文期刊>Procedia Computer Science >Portable Application-level Checkpointing for Hybrid MPI-OpenMP Applications
【24h】

Portable Application-level Checkpointing for Hybrid MPI-OpenMP Applications

机译:混合MPI-OpenMP应用程序的便携式应用程序级检查点

获取原文
           

摘要

As parallel machines increase their number of processors, so does the failure rate of the global system, thus, long-running applications will need to make use of fault tolerance techniques to ensure the successful execution completion. Most of current HPC systems are built as clusters of multicores. The hybrid MPI-OpenMP paradigm provides numerous benefits on these systems. This paper presents a checkpointing solution for hybrid MPI-OpenMP applications, in which checkpoint consistency is guaranteed by using a coordination protocol intra-node, while no inter-node coordination is needed. The proposal reduces network utilization and storage resources in order to optimize the I/O cost of fault tolerance, while minimizing the checkpointing overhead. Besides, the portability of the solution and the dynamic parallelism provided by OpenMP enable the restart of the applications using machines with different architectures, operating systems and/or number of cores, adapting the number of running OpenMP threads for the best exploitation of the available resources. Extensive evaluation using hybrid MPI-OpenMP applications from the ASC Sequoia Benchmark Codes and NERSC-8/Trinity benchmarks is presented, showing the effectiveness and efficiency of the approach.
机译:随着并行机增加处理器数量,全局系统的故障率也会增加,因此,长时间运行的应用程序将需要使用容错技术来确保成功执行。当前大多数HPC系统都构建为多核集群。混合MPI-OpenMP范例在这些系统上提供了许多好处。本文提出了一种用于混合MPI-OpenMP应用程序的检查点解决方案,其中使用节点间的协调协议可确保检查点的一致性,而无需节点间的协调。该提案降低了网络利用率和存储资源,以优化容错的I / O成本,同时最大程度地减少了检查点开销。此外,该解决方案的可移植性和OpenMP提供的动态并行性使您可以使用具有不同体系结构,操作系统和/或内核数的计算机重新启动应用程序,并调整正在运行的OpenMP线程的数量,以最佳地利用可用资源。提出了使用ASC红杉基准代码和NERSC-8 / Trinity基准的混合MPI-OpenMP应用程序进行的广泛评估,显示了该方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号