首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Generalized Cost-Based Job Scheduling in Very Large Heterogeneous Cluster Systems
【24h】

Generalized Cost-Based Job Scheduling in Very Large Heterogeneous Cluster Systems

机译:基于概括的成本基于成本的工作调度在非常大的异构集群系统中

获取原文
获取原文并翻译 | 示例

摘要

We study job assignment in large, heterogeneous resource-sharing clusters of servers with finite buffers. This load balancing problem arises naturally in today's communication and big data systems, such as Amazon Web Services, Network Service Function Chains, and Stream Processing. Arriving jobs are dispatched to a server, following a load balancing policy that optimizes a performance criterion such as job completion time. Our contribution is a randomized Cost-Based Scheduling (CBS) policy in which the job assignment is driven by general cost functions of the server queue lengths. Beyond existing schemes, such as the Join the Shortest Queue (JSQ), the power of d or the SQ(d) and the capacity-weighted JSQ, the notion of CBS yields new application-specific policies such as hybrid locally uniform JSQ. As today's data center clusters have thousands of servers, exact analysis of CBS policies is tedious. In this article, we derive a scaling limit when the number of servers grows large, facilitating a comparison of various CBS policies with respect to their transient as well as steady state behavior. A byproduct of our derivations is the relationship between the queue filling proportions and the server buffer sizes, which cannot be obtained from infinite buffer models. Finally, we provide extensive numerical evaluations and discuss several applications including multi-stage systems.
机译:我们在具有有限缓冲区的服务器的大型异构资源共享集群中研究作业分配。此负载平衡问题在当今的通信和大数据系统中自然出现,例如亚马逊Web服务,网络服务功能链和流处理。在负载平衡策略之后,向服务器发送到服务器,该策略优化了诸如作业完成时间的性能标准。我们的贡献是一种基于随机的成本的计划(CBS)策略,其中作业分配是由服务器队列长度的一般成本函数驱动的。除了现有方案之外,例如加入最短队列(JSQ),D或SQ(D)的功率和容量加权JSQ,CBS的概念会产生新的应用程序特定的策略,例如混合本地统一JSQ。由于今天的数据中心集群有数千台服务器,对CBS政策的确切分析是乏味的。在本文中,当服务器的数量变大时,我们推出了缩放限制,便于对其瞬态以及稳定状态行为的各种CBS策略的比较。我们的派生的副产品是队列填充比例与服务器缓冲区尺寸之间的关系,其无法从无限缓冲模型获得。最后,我们提供了广泛的数值评估,并讨论了多级系统,包括多级系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号