...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption
【24h】

Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption

机译:MapReduce策略对作业完成可靠性和作业能耗的影响

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, MapReduce has been widely employed by many companies/organizations to tackle data-intensive problems over a large-scale MapReduce cluster. To solve machineode failure which is inevitable in a MapReduce cluster, MapReduce employs several policies, such as input-data replication and intermediate-data replication policies. To speed up job execution, MapReduce allows reduce tasks to early fetch their required intermediate data. However, the impact of these policy combinations on the job completion reliability (JCR for short) and job energy consumption (JEC for short) of a MapReduce cluster was not clear, where JCR is the reliability with which a MapReduce job can be completed by the cluster, whereas JEC is the energy consumed by the cluster to complete the job. Therefore, in this study, we analyze the JCR and JEC of a MapReduce cluster on four policy combinations (POCs for short) derived from two typical intermediate-data replication policies and two typical reduce-task assignment policies. The four POCs are further compared in extensive scenarios, which not only consider jobs at different scales with various parameters, but also give a MapReduce cluster two extreme parallel execution capabilities and diverse bandwidths. The analytical results enable MapReduce managers to comprehend how these POCs impact the JCR and JEC of a cluster and then select an appropriate POC based on the characteristics of their own MapReduce jobs and clusters.
机译:最近,MapReduce已被许多公司/组织广泛采用,以解决大规模MapReduce集群上的数据密集型问题。为了解决在MapReduce集群中不可避免的机器/节点故障,MapReduce采用了几种策略,例如输入数据复制和中间数据复制策略。为了加快作业执行速度,MapReduce允许reduce任务尽早获取所需的中间数据。但是,这些策略组合对MapReduce集群的工作完成可靠性(简称JCR)和工作能耗(简称JEC)的影响尚不清楚,其中JCR是通过MapReduce集群可以完成MapReduce任务的可靠性。集群,而JEC是集群完成工作所消耗的能量。因此,在本研究中,我们基于两个典型的中间数据复制策略和两个典型的减少任务分配策略得出的四个策略组合(简称POC)分析了MapReduce集群的JCR和JEC。在广泛的场景中进一步比较了这四个POC,这些场景不仅考虑了具有各种参数的不同规模的作业,而且还为MapReduce群集提供了两种极端的并行执行能力和不同的带宽。分析结果使MapReduce管理人员能够理解这些POC如何影响集群的JCR和JEC,然后根据他们自己的MapReduce作业和集群的特征选择合适的POC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号