首页> 外文会议>IEEE High Performance Extreme Computing Conference >Self-Scaling Clusters and Reproducible Containers to Enable Scientific Computing
【24h】

Self-Scaling Clusters and Reproducible Containers to Enable Scientific Computing

机译:自我缩放的集群和可重复的容器,以实现科学计算

获取原文

摘要

Container technologies such as Docker have become a crucial component of many software industry practices especially those pertaining to reproducibility and portability. The containerization philosophy has influenced the scientific computing community, which has begun to adopt - and even develop - container technologies (such as Singularity). Leveraging containers for scientific software often poses challenges distinct from those encountered in industry, and requires different methodologies. This is especially true for HPC. With an increasing number of options for HPC in the cloud (including NSF-funded cloud projects), there is strong motivation to seek solutions that provide flexibility to develop and deploy scientific software on a variety of computational infrastructures in a portable and reproducible way. The Cyberinfrastructure Resource Integration team in the XSEDE project has developed a simple tool which provides HPC infrastructure in the cloud that scales with user demand. We now present one possible solution which uses the Nix package manager in an MPI-capable Docker container that is converted to Singularity. It provides consistent installations, dependencies, and environments in each image that are reproducible and portable across scientific computing infrastructures. We demonstrate the utility of these containers with cluster benchmark runs in a self-scaling virtual cluster using the Slurm scheduler deployed in the Jetstream and Aristotle Red Cloud OpenStack clouds. We conclude that this technique is useful as a template for scientific software application containers to be used in the XSEDE compute environment, other Singularity HPC environments, and cloud computing environments.
机译:诸如Docker等集装箱技术已成为许多软件行业实践的重要组成部分,尤其是与可重复性和可移植性有关的重要组成部分。集装箱化哲学影响了科学计算界,已经开始采用 - 甚至开发 - 集装箱技术(如奇点)。用于科学软件的利用容器经常造成与工业中遇到的挑战,并且需要不同的方法。 HPC尤其如此。随着云中HPC的越来越多的选择(包括NSF资助的云项目),寻求解决方案具有强烈的动机,以便以便携式和可重复的方式在各种计算基础架构上开发和部署科学软件的灵活性。 XSEDE项目中的Cyber​​Infrastructure资源集成团队开发了一个简单的工具,在云中提供了HPC基础架构,该云中的用户需求缩放。我们现在呈现了一种可能的解决方案,该解决方案在MPI的Docker容器中使用NIX包管理器,该容器被转换为奇点。它在每个图像中提供了一致的安装,依赖性和环境,这些图像在科学计算基础架构上可再现和便携。我们展示了这些容器的实用程序在使用JetStream和Aristotle Red Cloud OpenStack云中部署的Slurm Scheduler中,这些容器与群集基准运行。我们得出结论,该技术可用作用于在XSEDE计算环境,其他奇点HPC环境和云计算环境中使用的科学软件应用程序集装箱的模板。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号