首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >When the herd is smart: Aggregate behavior in the selection of job request
【24h】

When the herd is smart: Aggregate behavior in the selection of job request

机译:当群体聪明时:在选择工作请求时汇总行为

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In most parallel supercomputers, submitting a job for execution involves specifying how many processors are to be allocated to the job. When the job is moldable (i.e., there is a choice on how many processors the job uses), an application scheduler called SA can significantly improve job performance by automatically selecting how many processors to use. Since most jobs are moldable, this result has great impact to the current state of practice in supercomputer scheduling. However, the widespread use of SA can change the nature of workload processed by supercomputers. When many SAs are scheduling jobs on one supercomputer, the decision made by one SA affects the state of the system, therefore impacting other instances of SA. In this case, the global behavior of the system comes from the aggregate behavior caused by all SAs. In particular, it is reasonable to expect the competition for resources to become tougher with multiple SAs, and this tough competition to decrease the performance improvement attained by each SA individually. This paper investigates this very issue. We found that the increased competition indeed makes it harder for each individual instance of SA to improve job performance. Nevertheless, there are two other aggregate behaviors that override increased competition when the system load is moderate to heavy. First, as load goes up, SA chooses smaller requests, which increases efficiency, which effectively decreases the offered load, which mitigates long wait times. Second, better job packing and fewer jobs in the system make it easier for incoming jobs to fit in the supercomputer schedule, thus reducing wait times further. As a result, in moderate to heavy load conditions, a single instance of SA benefits from the fact that other jobs are also using SA.
机译:在大多数并行超级计算机中,提交要执行的作业涉及指定要为该作业分配多少处理器。当作业是可模制的(即,可以选择使用多少个处理器)时,称为SA的应用程序调度程序可以通过自动选择要使用的处理器数量来显着提高作业性能。由于大多数作业都是可模制的,因此该结果对超级计算机调度中的当前实践状态有很大影响。但是,SA的广泛使用可以改变超级计算机处理的工作负载的性质。当许多SA在一台超级计算机上调度作业时,一个SA做出的决定会影响系统状态,从而影响SA的其他实例。在这种情况下,系统的全局行为来自所有SA引起的聚合行为。特别是,可以合理地预期,随着多个SA的争夺资源的竞争将变得更加激烈,而这种激烈的竞争会降低每个SA单独实现的性能提升。本文对此问题进行了调查。我们发现竞争的加剧确实使SA的每个个体都很难提高工作绩效。但是,当系统负载为中等到沉重时,还有另外两种聚合行为可以抵消竞争的加剧。首先,随着负载的增加,SA选择较小的请求,从而提高了效率,有效地减少了所提供的负载,从而减轻了较长的等待时间。其次,更好的工作打包和更少的系统工作使传入的工作更容易适应超级计算机的计划,从而进一步减少了等待时间。结果,在中等负载到重负载的情况下,SA的单个实例受益于其他作业也使用SA的事实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号