首页> 外文会议>IEEE International Conference on Cloud Computing Technology and Science >Dependency-Aware and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds
【24h】

Dependency-Aware and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds

机译:云中异构作业的依赖关系感知和资源高效调度

获取原文

摘要

Data analytics frameworks shift towards larger degrees of parallelism. Efficient scheduling of data-parallel jobs (tasks) is critical for improving job performance such as response time, and resource utilization. It is an important challenge for large scale data analytics frameworks in which jobs are more complex and have diverse characteristics (e.g., diverse resource requirements). Prior work on scheduling cannot achieve low response time and high resource utilization simultaneously because they cannot accurately estimate the durations of tasks in the queue of a worker machine by using sampling-based approach (including sampling with late binding) for task placement, and thus they fail to place tasks at the best possible worker machine. Also, they do not sufficiently consider the diverse resource requirements of jobs (tasks) for placing tasks on worker machines. To address this challenge, we propose a Dependency-aware and Resource-efficient Scheduling (DRS) to achieve low response time and high resource utilization. DRS takes into account task dependency and assigns tasks that are independent of each other to different worker machines. Also, DRS considers tasks' resource requirements and packs complementary tasks whose resource demands on multiple resources are complementary to each other to increase the resource utilization. In addition, DRS uses the mutual reinforcement learning to estimate the task's waiting time (the duration of tasks in the queue of a worker), and assigns tasks to workers with the consideration of tasks' waiting time to reduce the response time. Extensive experimental results based on a real cluster and experiments using real-world Amazon EC2 cloud service show that DRS achieves low response time and high resource utilization compared to previous strategies.
机译:数据分析框架转向更大程度的并行性。数据并行作业(任务)的有效调度对于提高作业性能(例如响应时间和资源利用率)至关重要。这对于大规模数据分析框架是一个重要的挑战,在该框架中,工作更加复杂并且具有不同的特征(例如,不同的资源需求)。先前的调度工作无法同时实现低响应时间和高资源利用率,因为它们无法通过使用基于采样的方法(包括后期绑定采样)准确地估计工作机队列中任务的持续时间,因此它们无法将任务放置在可能的最佳工作计算机上。而且,他们没有充分考虑将任务放置在工作者计算机上的作业(任务)的各种资源需求。为了解决这一挑战,我们提出了一种依赖关系感知和资源高效的调度(DRS),以实现较低的响应时间和较高的资源利用率。 DRS考虑了任务依赖性,并将彼此独立的任务分配给不同的工作机。此外,DRS考虑任务的资源需求,并打包其对多个资源的资源需求彼此互补的互补任务,以提高资源利用率。另外,DRS使用相互强化学习来估计任务的等待时间(任务在队列中的持续时间),并考虑任务的等待时间将任务分配给工作者以减少响应时间。基于真实集群的大量实验结果以及使用真实Amazon EC2云服务进行的实验表明,与以前的策略相比,DRS的响应时间短且资源利用率高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号