首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >A Constraint Programming Scheduler for Heterogeneous High-Performance Computing Machines
【24h】

A Constraint Programming Scheduler for Heterogeneous High-Performance Computing Machines

机译:异构高性能计算机的约束编程调度程序

获取原文
获取原文并翻译 | 示例

摘要

Scheduling and dispatching tools for high-performance computing (HPC) machines have the key role of mapping jobs to the available resources, trying to maximize performance and quality-of-service (QoS). Allocation and Scheduling in the general case are well-known NP-hard problems, forcing commercial schedulers to adopt greedy approaches to improve performance and QoS. Search-based approaches featuring the exploration of the solution space have seldom been employed in this setting, but mostly applied in off-line scenarios. In this paper, we present the first search-based approach to job allocation and scheduling for HPC machines, working in a production environment. The scheduler is based on Constraint Programming, an effective programming technique for optimization problems. The resulting scheduler is flexible, as it can be easily customized for dealing with heterogeneous resources, user-defined constraints and different metrics. We evaluate our solution both on virtual machines using synthetic workloads, and on the Eurora HPC with production workloads. Tests on a wide range of operating conditions show significant improvements in waitings and QoS in mid-tier HPC machines w.r.t state-of-the-art commercial rule-based dispatchers. Furthermore, we analyze the conditions under which our approach outperforms commercial approaches, to create a portfolio of scheduling algorithms that ensures robustness, flexibility and scalability.
机译:高性能计算(HPC)机器的调度和调度工具具有将作业映射到可用资源的关键作用,从而试图最大化性能和服务质量(QoS)。一般情况下,分配和调度是众所周知的NP难题,迫使商业调度程序采用贪婪的方法来提高性能和QoS。在这种情况下,很少使用基于解决方案空间探索的基于搜索的方法,但大多数用于离线方案。在本文中,我们介绍了在生产环境中工作的HPC机器的第一种基于搜索的作业分配和计划方法。调度程序基于约束编程,这是一种用于优化问题的有效编程技术。生成的调度程序非常灵活,因为可以轻松自定义其以处理异构资源,用户定义的约束和不同的度量标准。我们在使用合成工作负载的虚拟机上以及在具有生产工作负载的Eurora HPC上评估我们的解决方案。在广泛的工作条件下进行的测试表明,采用最先进的基于规则的商业调度程序,中层HPC计算机的等待时间和QoS有了显着改善。此外,我们分析了我们的方法胜过商业方法的条件,以创建可确保鲁棒性,灵活性和可伸缩性的调度算法组合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号