首页> 外文会议>IEEE International conference on cluster computing >Balancing job performance with system performance via locality-aware scheduling on torus-connected systems
【24h】

Balancing job performance with system performance via locality-aware scheduling on torus-connected systems

机译:通过与环型连接的系统上的位置感知调度来平衡工作性能与系统性能

获取原文

摘要

Torus-connected network is widely used in modern supercomputers due to its linear per node cost scaling and its competitive overall performance. Job scheduling system plays a critical role for the efficient use of supercomputers. As supercomputers continue growing in size, a fundamental problem arises: how to effectively balance job performance with system performance on torus-connected machines? In this work, we will present a new scheduling design named window-based locality-aware scheduling. Our design contains three novel features. First, rather than one-by-one job scheduling, our design takes a “window” of jobs, i.e. multiple jobs, into consideration for job prioritizing and resource allocation. Second, our design maintains a list of slots to preserve node contiguity information for resource allocation. Finally, we formulate our scheduling decision making into a 0-1 Multiple Knapsack Problem and present two algorithms to solve the problem. A series of trace-based simulations using job logs collected from production supercomputers indicate that this new scheduling design has real potentials and can effectively balance job performance and system performance.
机译:圆环连接网络由于其线性的每节点成本缩放和具有竞争力的整体性能而被广泛用于现代超级计算机中。作业调度系统对于高效使用超级计算机起着至关重要的作用。随着超级计算机规模的不断增长,出现了一个基本问题:如何在与环型连接的计算机上有效地平衡工作性能和系统性能?在这项工作中,我们将提出一种新的调度设计,称为基于窗口的位置感知调度。我们的设计包含三个新颖的功能。首先,我们的设计而不是一对一的作业调度,而是考虑了作业的“窗口”,即多个作业,以考虑作业优先级和资源分配。其次,我们的设计维护一个插槽列表,以保留节点连续性信息以进行资源分配。最后,我们将调度决策公式化为0-1多重背包问题,并提出了两种算法来解决该问题。使用从生产超级计算机收集的作业日志进行的一系列基于跟踪的模拟表明,这种新的调度设计具有真正的潜力,并且可以有效地平衡作业性能和系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号