...
【24h】

Replication-Based Load Balancing

机译:基于复制的负载平衡

获取原文
获取原文并翻译 | 示例
           

摘要

Load balancing of large distributed server systems is a complex optimization problem of critical importance in cloud systems and data centers. Existing schedulers often incur a high communication overhead when collecting the data required to make scheduling decisions, hence delaying job requests on their way to the executing servers. We propose a novel scheme that incurs between the users and the servers upon job arrival, thus removing any scheduling overhead from the job's critical path. Our approach is based on creating several replicas of each job and sending each replica to a different server. Upon the arrival of a replica to the head of the queue at its server, the latter signals the servers holding replicas of that job, so as to remove them from their queues. We show, through analysis and simulations, that this scheme significantly improves the expected queuing overhead over traditional schemes under various load conditions and different job length distributions. In addition, we show that our scheme remains efficient even when the inter-server signal propagation delay is significant (relative to the job's execution time). We provide a heuristic solution to the performance degradation that occurs in such cases and show, by simulations, that it efficiently mitigates the detrimental effect of propagation delays.
机译:大型分布式服务器系统的负载平衡是一个复杂的优化问题,在云系统和数据中心中至关重要。现有的调度程序在收集制定调度决策所需的数据时通常会产生较高的通信开销,因此会延迟作业请求到达执行服务器的过程。我们提出了一种新颖的方案,该方案在作业到达时在用户和服务器之间发生,从而消除了作业关键路径中的任何调度开销。我们的方法基于为每个作业创建多个副本并将每个副本发送到不同的服务器。当副本到达其服务器的队列头时,后者会向持有该作业副本的服务器发出信号,以便将其从队列中删除。通过分析和仿真,我们表明,该方案与传统方案相比,在各种负载条件和不同作业长度分布下,均显着提高了预期的排队开销。此外,我们证明即使服务器间信号传播延迟很大(相对于作业的执行时间),我们的方案仍然有效。我们为这种情况下发生的性能下降提供了一种启发式解决方案,并通过仿真显示,它可以有效地减轻传播延迟的有害影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号