首页> 外文会议>International Conference on High Performance Computing for Computational Science >HPC Environment Management: New Challenges in the Petaflop Era
【24h】

HPC Environment Management: New Challenges in the Petaflop Era

机译:HPC环境管理:Petaflop时代的新挑战

获取原文

摘要

High Performance Computing (HPC) is becoming much more popular nowadays. Currently, the biggest supercomputers in the world have hundreds of thousands of processors and consequently may have more software and hardware failures. HPC centers managers also have to deal with multiple clusters from different vendors with their particular architectures. However, since there are not enough HPC experts to manage all the new supercomputers, it is expected that non-experts will be managing those large clusters. In this paper we study the new challenges to manage HPC environments containing different clusters with different sizes and architectures. We review available tools and present LEMMing [1], an easy-to-use open source tool developed to support high performance computing centers. LEMMing integrates machine resources and the available management and monitoring tools on a single point of management.
机译:高性能计算(HPC)现在变得更加流行。目前,世界上最大的超级计算机拥有数十万个处理器,因此可能拥有更多的软件和硬件故障。 HPC中心管理人员还必须处理来自不同供应商的多个集群,具有特定的架构。但是,由于HPC专家无法管理所有新超级计算机,因此预计非专家将管理这些大集群。在本文中,我们研究了管理包含不同尺寸和架构的不同群集的HPC环境的新挑战。我们审查可用的工具和目前的LEMMING [1],开发了一种易于使用的开源工具,以支持高性能计算中心。 LEMMING在单点管理点集成机械资源和可用管理和监控工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号