首页> 外文期刊>Advances in Science, Technology and Engineering Systems >Improving the Performance of Fair Scheduler in Hadoop
【24h】

Improving the Performance of Fair Scheduler in Hadoop

机译:改善Hadoop中Fair Scheduler的性能

获取原文
       

摘要

Cloud computing is a power platform to deal with big data. Among several software frameworks used for the construction of cloud computing systems, Apache Hadoop, which is an open-source software, becomes a popular one. Hadoop supports for distributed data storage and the process of large data sets on computer clusters based on a MapReduce parallel processing framework. The performance of Hadoop in parallel data processing is depended on the efficiency of a job scheduling algorithm underworking. In this paper, we improve the performance of the well-known fair scheduling algorithm adopted in Hadoop by introducing several mechanisms. The modified scheduling algorithm can dynamically adjust resource allocation to user jobs and the precedence of user jobs to be executed. Our approach can properly adapt to the runtime environment’s condition with the objective of achieving job fairness and reducing job turnaround time. Performance evaluations verify the superiority of the proposed scheduler over the original fair scheduler. The average turnaround time of jobs can be largely reduced in our experiments.
机译:云计算是处理大数据的强大平台。在用于构建云计算系统的几种软件框架中,作为开源软件的Apache Hadoop成为一种流行的软件。 Hadoop支持分布式数据存储以及基于MapReduce并行处理框架的计算机集群上的大数据集处理。 Hadoop在并行数据处理中的性能取决于作业调度算法欠工作的效率。在本文中,我们通过介绍几种机制来提高Hadoop中采用的著名公平调度算法的性能。修改后的调度算法可以动态地调整对用户作业的资源分配以及要执行的用户作业的优先级。我们的方法可以适当地适应运行时环境的条件,以实现工作公平性并减少工作周转时间。性能评估证明了拟议的调度程序优于原始公平调度程序的优越性。在我们的实验中,可以大大减少工作的平均周转时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号