首页> 外文期刊>Journal of supercomputing >Analysis of parallel application checkpoint storage for system configuration
【24h】

Analysis of parallel application checkpoint storage for system configuration

机译:系统配置并行应用检查点存储分析

获取原文
获取原文并翻译 | 示例
       

摘要

The use of fault tolerance strategies such as checkpoints is essential to maintain the availability of systems and their applications in high-performance computing environments. However, checkpoint storage can impact the performance and scalability of parallel applications that use message passing. In the present work, a study is carried out on the elements that can impact the storage of the checkpoint and how these can influence the scalability of an application with fault tolerance. A methodology has been designed based on predicting the size of the checkpoint when the number of processes, the application workload or the mapping varies, using a reduced number of resources. By following this methodology, the system administrator will be able to make decisions about what should be done with the number of processes used and the number of appropriate nodes, adjusting the process mapping in applications that use checkpoints.
机译:在高性能计算环境中使用诸如检查点等容错策略的使用是必不可少的,以维持系统的可用性及其应用。 但是,检查点存储可能会影响使用消息传递的并行应用程序的性能和可扩展性。 在本作本作中,在可以影响检查点存储的元素上进行研究以及如何影响具有容错能力的应用的可扩展性。 使用减少数量的资源来预测检查点的大小,基于预测检查点的大小来设计一种方法。 通过以下方法,系统管理员将能够做出关于使用所使用的进程数和适当节点数量的决定,调整使用检查点的应用程序中的进程映射。

著录项

  • 来源
    《Journal of supercomputing》 |2021年第5期|4582-4617|共36页
  • 作者单位

    Univ Autonoma Barcelona Comp Architecture & Operating Syst Dept Barcelona 08193 Spain;

    Univ Autonoma Barcelona Comp Architecture & Operating Syst Dept Barcelona 08193 Spain;

    Univ Autonoma Barcelona Comp Architecture & Operating Syst Dept Barcelona 08193 Spain;

    Univ Autonoma Barcelona Comp Architecture & Operating Syst Dept Barcelona 08193 Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Fault tolerance; Checkpoint; Scalability; HPC systems; MPI application;

    机译:容错;检查点;可扩展性;HPC系统;MPI应用;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号