...
首页> 外文期刊>IEICE transactions on information and systems >System Status Aware Hadoop Scheduling Methods for Job Performance Improvement
【24h】

System Status Aware Hadoop Scheduling Methods for Job Performance Improvement

机译:系统状态感知Hadoop调度方法以提高工作绩效

获取原文
   

获取外文期刊封面封底 >>

       

摘要

MapReduce and its open software implementation Hadoop are now widely deployed for big data analysis. As MapReduce runs over a cluster of massive machines, data transfer often becomes a bottleneck in job processing. In this paper, we explore the influence of data transfer to job processing performance and analyze the mechanism of job performance deterioration caused by data transfer oriented congestion at disk I/O and/or network I/O. Based on this analysis, we update Hadoop's Heartbeat messages to contain the real time system status for each machine, like disk I/O and link usage rate. This enhancement makes Hadoop's scheduler be aware of each machine's workload and make more accurate decision of scheduling. The experiment has been done to evaluate the effectiveness of enhanced scheduling methods and discussions are provided to compare the several proposed scheduling policies.
机译:MapReduce及其开放软件实现Hadoop现在已广泛部署用于大数据分析。由于MapReduce在大型计算机集群上运行,因此数据传输通常成为作业处理的瓶颈。在本文中,我们探讨了数据传输对作业处理性能的影响,并分析了磁盘I / O和/或网络I / O上面向数据传输的拥塞导致作业性能下降的机制。基于此分析,我们将更新Hadoop的心跳消息,以包含每台计算机的实时系统状态,例如磁盘I / O和链接使用率。这项增强功能使Hadoop的调度程序了解每台计算机的工作量,并做出更准确的调度决策。已经进行了实验以评估增强调度方法的有效性,并提供了讨论以比较几种建议的调度策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号