首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing >One Can Only Gain by Replacing EASY Backfilling: A Simple Scheduling Policies Case Study
【24h】

One Can Only Gain by Replacing EASY Backfilling: A Simple Scheduling Policies Case Study

机译:一个人只能通过更换简易回填来获得:一个简单的调度政策案例研究

获取原文

摘要

High-Performance Computing (HPC) platforms are growing in size and complexity. In order to improve the quality of service of such platforms, researchers are devoting a great amount of effort to devise algorithms and techniques to improve different aspects of performance such as energy consumption, total usage of the platform, and fairness between users. In spite of this, system administrators are always reluctant to deploy state of the art scheduling methods and most of them revert to EASY-backfilling, also known as EASY-FCFS (EASY-First-Come-First-Served). Newer methods frequently are complex and obscure and the simplicity and transparency of EASY are too important to sacrifice. In this work, we used execution logs from five HPC platforms to compare four simple scheduling policies: FCFS, Shortest estimated Processing time First (SPF), Smallest Requested Resources First (SQF), and Smallest estimated Area First (SAF). Using simulations, we performed a thorough analysis of the cumulative results for up to 180 weeks and considered three scheduling objectives: waiting time, slowdown and per-processor slowdown. We also evaluated other effects, such as the relationship between job size and slowdown, the distribution of slowdown values, and the number of backfilled jobs, for each HPC platform and scheduling policy. We conclude that one can only gain by replacing EASY-backfilling with SAF with backfilling, as it offers improvements in performance by up to 80% in the slowdown metric while maintaining the simplicity and the transparency of FCFS. Moreover, SAF reduces the number of jobs with large slowdowns and the inclusion of a simple thresholding mechanism guarantees that no starvation occurs. Finally, we propose SAF as a new benchmark for future scheduling studies.
机译:高性能计算(HPC)平台的尺寸和复杂性成长。为了提高这些平台的服务质量,研究人员正在致力于设计大量努力来设计算法和技术,以改善性能的不同方面,例如能耗,平台总使用情况,以及用户之间的公平性。尽管如此,系统管理员始终不愿意部署艺术调度方法,大多数恢复到易于回填,也称为Easy-FCF(易于首先服务)。较新的方法经常复杂,晦涩难以置疑,简单和透明度变得非常重要,无法牺牲。在这项工作中,我们使用来自五个HPC平台的执行日志来比较四个简单的调度策略:FCFS,最短估计的处理时间首先(SPF),最小的请求资源首先(SQF),以及首先最小的估计区域(SAF)。使用模拟,我们对最多180周的累积结果进行了彻底的分析,并考虑了三个调度目标:等待时间,放缓和每个处理器放缓。我们还评估了其他效果,例如作业大小与放缓之间的关系,减速值分布,以及每个HPC平台和调度策略的回填作业的数量。我们得出结论,只有在使用回填时更换SAF易填写,只能通过易于回填来获得,因为它在减速度量中的性能提高了高达80%,同时保持了FCF的简单性和透明度。此外,SAF减少了大量放缓的作业数量,并包含简单的阈值机制,保证不会发生饥饿。最后,我们将SAF提出作为未来调度研究的新基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号