首页> 外文期刊>International journal of distributed systems and technologies >Performance Analysis of Hadoop YARN Job Schedulers in a Multi-Tenant Environment on HiBench Benchmark Suite
【24h】

Performance Analysis of Hadoop YARN Job Schedulers in a Multi-Tenant Environment on HiBench Benchmark Suite

机译:Hadoop纱线在Hibench基准套件多租户环境中Hadoop纱线工作调度仪的性能分析

获取原文
获取原文并翻译 | 示例
       

摘要

Big data processing technology marks a prominent place in today's market. Hadoop is an efficient open-source distributed framework used to process big data with fewer expenses utilizing a cluster of commodity machines (nodes). In Hadoop, YARN got introduced for effective resource utilization among the jobs. Still, YARN over-allocates the resources for some tasks of a job and keeps the cluster resources underutilized. This paper has investigated the CAPACITY and FAIR schedulers' practical utilization of resources in a multi-tenancy shared environment using the HiBench benchmark suite. It compares the above MapReduce job schedulers' performance in two scenarios and proposes some open research questions (ORQ) with potential solutions to help the upcoming researchers. On average, the authors found that CAPACITY and FAIR schedulers utilize 77% of RAM and 82% of CPU cores. Finally, the experimental evaluation proves that these schedulers over-allocate the resources for some of the tasks and keep the cluster resources underutilized in different scenarios.
机译:大数据处理技术标志着当今市场上的突出位置。 Hadoop是一个有效的开源分布式框架,用于处理大数据,利用商品计算机(节点)群体的费用较少。在Hadoop,纱线在工作中介绍了有效的资源利用。仍然,纱线在作业的一些任务中过度分配了资源,并使群集资源未充分利用。本文使用Hibench基准套件调查了多租户共享环境中资源的能力和公平调度率。它比较了上面的MapReduce作业调度程序在两种情况下的性能,并提出了一些开放的研究问题(ORQ),潜在的解决方案来帮助即将到来的研究人员。平均而言,该作者发现能力和公平调度员利用77%的RAM和82%的CPU核心。最后,实验评估证明,这些调度员过度为某些任务分配资源,并使群集资源在不同方案中未充分利用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号