【24h】

Locality premised reducer scheduling in Hadoop

机译:Hadoop中的本地化本地化约简调度

获取原文
获取原文并翻译 | 示例

摘要

Data is generated in high speed and volume called Big Data. Hadoop has ability to handle Big Data. Programs get executed in parallel manner in Hadoop. In Hadoop the problem gets divide in two parts namely, map task and reduce task. In the existing Hadoop version map task scheduling premise is the locality of input data lowers the network traffic and hence improve performance of the mappers. But reduce task get scheduled without any consideration of data locality, resulting to poor performance at requesting node. This paper propose a modified reduce task scheduling algorithm on the basis of data locality that will minimize data-local traffic. In the evaluation of algorithm it is observed that up to 80 % of bytes shuffling has reduced in Hadoop clusters.
机译:数据以大数量的高速生成。 Hadoop具有处理大数据的能力。程序在Hadoop中以并行方式执行。在Hadoop中,问题分为两部分,即映射任务和简化任务。在现有的Hadoop版本映射任务计划前提中,输入数据的局部性降低了网络流量,从而提高了映射器的性能。但是在不考虑数据局部性的情况下减少任务的调度时间,导致请求节点的性能较差。本文提出了一种基于数据局部性的改进的减少任务调度算法,该算法将最大程度地减少数据局部流量。在算法评估中,可以发现Hadoop集群中最多减少了80%的字节转换。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号