首页> 外文期刊>Open Computer Science >A methodology for the professional training of the management and evaluation of HPC systems
【24h】

A methodology for the professional training of the management and evaluation of HPC systems

机译:HPC系统管理和评估的专业培训方法

获取原文
           

摘要

The paper is motivated by critical demand for experts and scientists working in areas of mathematical modeling, simulations, big data techniques and who are familiar with management of HPC systems from user and administrator point of view. We created a new course entitled “HPC system management”. Our goal is focused on students to provide them with knowledge and understanding of complex problem of the HPC system management concerning job scheduling. Important fact is that the job scheduling problem is an NP-complete problem. Next objective of our course is to educate skilled experts, who are able to design and implement programs, scripts and models doing job management to solve specific parts of this complex problem. The course is innovative from several points of view. Our new approach lies in specific content, which is oriented to the HPC system management in contrast to existing courses, which are usually focused on development of HPC applications. Also we developed and provide new education methodology in a form of scientific project, which decomposes the complex problem into subproblems and subsequently brings together solutions to the subproblems to form united model. New education methodology is focused on generation of (pseudo-) optimal jobs schedule using data from real systems. The huge volume of used data leads to ideas and methodologies of problem solving, which are suitable for problems not solvable in polynomial time. Educational methodology also contains implementation of a job scheduling simulator. The paper presents a pilot course, in which students explore various scheduling algorithms and research their properties with the use of data gained from NorduGrid.
机译:本文受到对数学建模,模拟,大数据技术领域以及从用户和管理员的角度熟悉HPC系统管理的专家和科学家的迫切需求的激励。我们开设了一门名为“ HPC系统管理”的新课程。我们的目标是使学生专注于为他们提供有关HPC系统管理中与工作安排有关的复杂问题的知识和理解。重要的事实是,作业调度问题是NP完全问题。本课程的下一个目标是教育熟练的专家,他们能够设计和实施程序,脚本和模型来进行工作管理,以解决此复杂问题的特定部分。从几个角度来看,本课程都是创新的。我们的新方法在于特定的内容,相对于现有课程(通常专注于HPC应用程序的开发),该内容针对HPC系统管理。此外,我们还以科学项目的形式开发并提供了新的教学方法,该方法将复杂的问题分解为子问题,然后将子问题的解决方案组合在一起以形成统一模型。新的教育方法论着重于使用来自真实系统的数据来生成(伪)最佳作业计划。大量使用的数据导致解决问题的思想和方法,适用于多项式时间内无法解决的问题。教育方法学还包含作业调度模拟器的实现。本文介绍了一个试验性课程,在该课程中,学生将探索各种调度算法,并利用从NorduGrid获得的数据来研究其属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号