首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Backfilling Using System-Generated Predictions Rather than User Runtime Estimates
【24h】

Backfilling Using System-Generated Predictions Rather than User Runtime Estimates

机译:使用系统生成的预测而非用户运行时估算值进行回填

获取原文
获取原文并翻译 | 示例

摘要

The most commonly used scheduling algorithm for parallel supercomputers is FCFS with backfilling, as originally introduced in the EASY scheduler. Backfilling means that short jobs are allowed to run ahead of their time provided they do not delay previously queued jobs (or at least the first queued job). However, predictions have not been incorporated into production schedulers, partially due to a misconception (that we resolve) claiming inaccuracy actually improves performance, but mainly because underprediction is technically unacceptable: users will not tolerate jobs being killed just because system predictions were too short. We solve this problem by divorcing kill-time from the runtime prediction and correcting predictions adaptively as needed if they are proved wrong. The end result is a surprisingly simple scheduler, which requires minimal deviations from current practices (e.g., using FCFS as the basis) and behaves exactly like EASY as far as users are concerned; nevertheless, it achieves significant improvements in performance, predictability, and accuracy. Notably, this is based on a very simple runtime predictor that just averages the runtimes of the last two jobs by the same user; counter intuitively, our results indicate that using recent data is more important than mining the history for similar jobs. All the techniques suggested in this paper can be used to enhance any backfilling algorithm and are not limited to EASY
机译:并行超级计算机最常用的调度算法是带回填功能的FCFS,最早在EASY调度程序中引入。回填意味着允许短作业提前运行,前提是它们不延迟先前排队的作业(或至少不延迟第一个排队的作业)。但是,预测尚未纳入生产调度程序中,部分原因是误解(我们已解决)声称不准确实际上会提高性能,但主要是因为预测不足在技术上是不可接受的:用户不会容忍仅仅因为系统预测太短而被杀死的工作。我们通过将运行时间预测中的消散时间相除,并在证明错误时根据需要进行自适应校正来解决此问题。最终结果是一个非常简单的调度程序,它要求与当前实践的偏差最小(例如,使用FCFS作为基础),并且就用户而言,其行为与EASY完全相同;但是,它在性能,可预测性和准确性方面都取得了重大改进。值得注意的是,这是基于一个非常简单的运行时预测器,它仅对同一用户的最后两个作业的运行时求平均。从直觉上来说,我们的结果表明,使用最新数据比挖掘类似工作的历史记录更为重要。本文建议的所有技术均可用于增强任何回填算法,并且不仅限于EASY

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号