首页> 外文期刊>Cloud Computing, IEEE Transactions on >PRISM: Fine-Grained Resource-Aware Scheduling for MapReduce
【24h】

PRISM: Fine-Grained Resource-Aware Scheduling for MapReduce

机译:PRISM:MapReduce的细粒度资源感知计划

获取原文
获取原文并翻译 | 示例
           

摘要

MapReduce has become a popular model for data-intensive computation in recent years. By breaking down each job into small and tasks and executing them in parallel across a large number of machines, MapReduce can significantly reduce the running time of data-intensive jobs. However, despite recent efforts toward designing resource-efficient MapReduce schedulers, existing solutions that focus on scheduling at the task-level still offer sub-optimal job performance. This is because tasks can have highly varying resource requirements during their lifetime, which makes it difficult for task-level schedulers to effectively utilize available resources to reduce job execution time. To address this limitation, we introduce PRISM, a fine-grained resource-aware MapReduce scheduler that divides tasks into phases, where each phase has a constant resource usage profile, and performs scheduling at the phase level. We first demonstrate the importance of phase-level scheduling by showing the resource usage variability within the lifetime of a task using a wide-range of MapReduce jobs. We then present a phase-level scheduling algorithm that improves execution parallelism and resource utilization without introducing stragglers. In a 10-node Hadoop cluster running standard benchmarks, PRISM offers high resource utilization and provides improvement in job running time compared to the current Hadoop schedulers.
机译:近年来,MapReduce已成为用于数据密集型计算的流行模型。通过将每个作业分解为细小的任务并在大量机器上并行执行,MapReduce可以显着减少数据密集型作业的运行时间。但是,尽管最近在设计资源高效的MapReduce调度程序方面付出了很多努力,但是现有的专注于任务级调度的解决方案仍然提供了次优的作业性能。这是因为任务在其生命周期内可能具有高度变化的资源需求,这使得任务级调度程序很难有效地利用可用资源来减少作业执行时间。为了解决此限制,我们引入了PRISM,这是一种细粒度的资源感知MapReduce调度程序,它将任务划分为多个阶段,每个阶段具有恒定的资源使用情况,并在阶段级别执行调度。我们首先通过使用各种MapReduce作业显示任务生命周期内的资源使用可变性,从而证明阶段级调度的重要性。然后,我们提出一种相位级别的调度算法,该算法可提高执行并行度和资源利用率,而不会引入散乱的内容。在运行标准基准的10节点Hadoop集群中,与当前的Hadoop调度程序相比,PRISM可提高资源利用率并缩短作业运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号