首页> 外文期刊>Journal of network and systems management >OPTIMA: On-Line Partitioning Skew Mitigation for MapReduce with Resource Adjustment
【24h】

OPTIMA: On-Line Partitioning Skew Mitigation for MapReduce with Resource Adjustment

机译:OPTIMA:具有资源调整功能的MapReduce在线分区偏斜缓解

获取原文
获取原文并翻译 | 示例
           

摘要

Partitioning skew has been shown to be a major issue that can significantly prolong the execution time of MapReduce jobs. Most of the existing off-line heuristics for partitioning skew mitigation are inefficient; they have to wait for the completion of all the map tasks. Some solutions can tackle this problem on-line, but will impose an additional overhead by repartitioning the workload of overloaded tasks. In this paper, we present OPTIMA, an on-line partitioning skew mitigation technique for MapReduce. OPTIMA predicts the workload distribution of reduce tasks at run-time, leverages the deviation detection technique to identify the overloaded tasks and pro-actively adjusts resource allocation for these tasks to reduce their execution time. We provide the upper bound of OPTIMA in time complexity, while allowing OPTIMA to perform totally on-line. Through experiments using both real and synthetic workloads running on an 11-node Hadoop cluster, we have observed OPTIMA can effectively mitigate the partitioning skew and improved the job completion time by up to 36.73 % in our experiments.
机译:事实证明,分区偏斜是一个主要问题,可以大大延长MapReduce作业的执行时间。现有的大多数用于减少偏斜缓解的离线启发式方法均效率不高。他们必须等待所有地图任务的完成。一些解决方案可以在线解决此问题,但会通过对超负荷任务的工作量进行重新划分而增加额外的开销。在本文中,我们提出了OPTIMA,这是一种针对MapReduce的在线分区偏斜缓解技术。 OPTIMA预测减少任务在运行时的工作量分布,利用偏差检测技术来识别过载任务,并主动调整这些任务的资源分配以减少其执行时间。我们提供了OPTIMA的时间复杂度上限,同时允许OPTIMA完全在线执行。通过使用在11节点Hadoop集群上运行的实际和合成工作负载进行的实验,我们观察到OPTIMA在我们的实验中可以有效地减轻分区偏差,并将作业完成时间缩短多达36.73%。

著录项

  • 来源
    《Journal of network and systems management》 |2016年第4期|859-883|共25页
  • 作者单位

    College of Computer, National University of Defense Technology, Changsha, Hunan, China;

    David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada;

    David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada;

    Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha, Hunan, China;

    College of Computer, National University of Defense Technology, Changsha, Hunan, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    MapReduce; Partitioning skew; Resource allocation; Scheduling;

    机译:MapReduce;分区偏斜;资源分配;排程;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号