...
首页> 外文期刊>The Computer journal >Impacts of Task Re-Execution Policy on MapReduce Jobs
【24h】

Impacts of Task Re-Execution Policy on MapReduce Jobs

机译:任务重新执行策略对MapReduce作业的影响

获取原文
获取原文并翻译 | 示例
           

摘要

MapReduce is a popular distributed programming framework for large-scale data processing. To prevent MapReduce jobs from being interrupted by node failures that occur frequently in a MapReduce cluster consisting of a set of commodity machinesodes, the most well-known MapReduce implementation, i.e. Hadoop, adopts a task re-execution policy (TR policy). When a map/reduce task of a job crashes, the TR policy assigns another node to reperform the task. However, the impact of the TR policy on MapReduce jobs in terms of reliability, job turnaround time (JTT) and energy consumption are not clear, particularly when jobs have different features, e.g. different filtering percentages, different input-data sizes, and different numbers of reduce tasks. In this paper, we formally analyze the job completion reliability (JCR) of a job based on Poisson distributions, and then derive the expected JTT and job energy consumption (JEC) based on the universal generation function. Extensive analyses are further conducted to explore the impact of the TR policy on JCR, JTT and JEC of jobs with different features. The results show that employing the TR policy can dramatically improve JCR for a large MapReduce job. Moreover, if the JCR of a job is highly improved by the TR policy, the expected JTT and JEC will not be significantly prolonged and increased, respectively.
机译:MapReduce是流行的用于大型数据处理的分布式编程框架。为了防止MapReduce作业被由一组商用机器/节点组成的MapReduce群集中频繁发生的节点故障中断,最著名的MapReduce实现即Hadoop采用了任务重新执行策略(TR策略)。当作业的映射/归约任务崩溃时,TR策略将分配另一个节点以重新执行任务。但是,TR策略对MapReduce作业在可靠性,作业周转时间(JTT)和能耗方面的影响尚不清楚,尤其是在作业具有不同功能(例如不同的过滤百分比,不同的输入数据大小和不同数量的reduce任务。在本文中,我们基于泊松分布正式分析了作业​​的作业完成可靠性(JCR),然后根据通用生成函数推导了预期的JTT和作业能耗(JEC)。进一步进行了广泛的分析,以探讨TR政策对具有不同功能的工作的JCR,JTT和JEC的影响。结果表明,采用TR策略可以极大地改善大型MapReduce作业的JCR。而且,如果通过TR政策大大提高了工作的JCR,则预期的JTT和JEC不会分别显着延长和增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号