首页> 外文会议>International Conference on Advanced Information Technologies >Improved Job Scheduling for Achieving Fairness on Apache Hadoop YARN
【24h】

Improved Job Scheduling for Achieving Fairness on Apache Hadoop YARN

机译:改进了作业调度以在Apache Hadoop YARN上实现公平

获取原文

摘要

Enormous amounts of data are gathered from social media sites, mobile and other business environment. Analyzing the enormous amounts of big data becomes large workloads with distributed applications and the resources of a single machine are insufficient for this application. Hadoop YARN (Yet Another Resource Negotiator) enables running multiple applications over hadoop cluster to utilize the resources efficiently and provide the data parallel programming model. Hadoop YARN breaks up the performance of open source framework for distributed applications and performs job scheduling and monitoring together with storage, processing and analysis of big data on commodity hardware. Apache Hadoop provides for over 200 default parameter configuration settings for all type of clusters and applications. Of If the available parameters misconfigure, the one or more machines in the cluster may decrease the system performance. Appropriate tuning parameters configuration can increase the system performance. Tuning parameter configuration becomes the challenge of Apache Hadoop Framework for utilization of system resources efficiently. In this paper, YARN parameters tuning is done for improving the execution time and efficient job scheduling.
机译:从社交媒体网站,移动和其他商业环境中收集大量数据。分析大量大数据具有带有分布式应用的大工作负载,并且单台机器的资源不足以进行此应用。 Hadoop纱(又一资源谈判代表)使得在Hadoop集群上运行多个应用程序,以有效地利用资源并提供数据并行编程模型。 Hadoop Yarn打破了分布式应用的开源框架的性能,并与商品硬件的大数据的存储,处理和分析一起执行作业调度和监控。 Apache Hadoop为所有类型的群集和应用程序提供超过200个默认参数配置设置。如果可用参数错误配置,则集群中的一个或多个机器可能会降低系统性能。适当的调整参数配置可以提高系统性能。调整参数配置成为Apache Hadoop框架以有效利用系统资源的挑战。在本文中,完成了纱线参数调整以改善执行时间和高效作业调度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号