首页> 外文会议>International Conference on Intelligent Computing and Applications >Analysis of Job Scheduling Algorithms and Studying Dynamic Job Ordering to Optimize MapReduce
【24h】

Analysis of Job Scheduling Algorithms and Studying Dynamic Job Ordering to Optimize MapReduce

机译:作业调度算法分析和研究动态作业排序优化MapReduce

获取原文

摘要

As there was a big rise in the Big Data field, Hadoop became one of the most used platforms in many applications like clinical data analysis, Facebook, Amazon, in which Big Data processing and utilization is required. One of the most important features that made Hadoop, one of the most popular platform, is adopting MapReduce, which made large changes in the market by processing huge amount of data in parallel technique by distributing data across multiple TaskTracker nodes and each node splits data by Map slots and shrinks the output of Map slots (key/ value) by Reduce slots. However, different algorithms are used for job scheduling in MapReduce which is used to improve resource utilization, job allocation, and minimizing processing time. However, research is still underway to propose optimal method to improve MapReduce model, and there are still several major drawbacks that are still not well studied. In this paper, we studied and analyzed different job scheduling, job ordering algorithms, and dynamic slot configuration. The study discussed the most popular and efficient systems design to find the most efficient improvements for MapReduce and review the corresponding solutions. In our proposed method, we are applying classification algorithm which is going to classify job into highest utilization and poor utilization. After that, the highest utilization job will be forwarded to PRISM algorithm, which will schedule into phase level as there are variations in processing time and resource requirement in each phase, for different kinds of jobs along with the application of dynamic slot configuration, which helps to improve resources utilization and reduce time.
机译:由于有在大数据领域的大幅上升,成为Hadoop的许多应用,例如临床数据分析,Facebook和亚马逊,在需要大数据的处理和利用最常用的平台之一。一的该取得的Hadoop,最流行的平台中的一个,最重要的特点是采用的MapReduce,其通过由分布在多个的TaskTracker节点的数据,并且每个节点分割数据并行处理技术的数据的巨大的量由在市场上大的变化地图槽和收缩地图槽通过降低槽的输出(键/值)。然而,不同的算法用于在MapReduce的作业调度其用于提高资源利用率,作业分配,并最小化处理时间。然而,研究工作仍在进行中,提出最佳的方法来提高MapReduce的模型,还有几个主要的缺点是还没有很好的研究。在本文中,我们研究和分析不同的作业调度,作业排序算法,以及动态插槽配置。该研究讨论最流行和最有效的系统设计找到MapReduce的最有效的改善和检讨相应的解决方案。在我们提出的方法,我们正在申请这是会进行分类工作进入最高使用率和利用率低下分类算法。此后,利用率最高的工作将被转发到PRISM算法,因为在处理时间和资源需求在每个阶段的变化,这将安排成相平,对于不同类型的工作动态时隙模式的应用,这有助于沿以提高资源利用率和降低时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号