首页> 外文会议>International Conference on Smart Systems and Inventive Technology >Optimization of Hadoop MapReduce Model in cloud Computing Environment
【24h】

Optimization of Hadoop MapReduce Model in cloud Computing Environment

机译:Hadoop MapReduce模型在云计算环境中的优化

获取原文

摘要

In recent years data analysis has become one of the trending topic among the researchers. Moreover, Information is the new baseline of all organization, as to grow the faster and bigger. Relevant information provides the flexibility to know the like and dislike of customer and to get the relevant information requires the analysis of huge information that is stored in various format. Hadoop constitutes of two basic model i.e. Hadoop Distributed File system (HDFS) and MapReduce, Hadoop is used for processing the huge amount of data whereas MapReduce is used for data processing. Hadoop MapReduce is one of the best platform for processing the huge data in efficient manner such as processing of web logs data. In this paper, we have proposed optimized HPMR (Hadoop MapReduce) model, which maximizes the memory utilization for the task and balances the performance between the I/O system and CPUs. HPMR contains the three phase i.e. Hadoop, Map and Reduce just like any other Hadoop model, however HPMR optimizes all three phase i.e. map, shuffle and reduce. Moreover, to optimize the memory model HPMR opts for dynamic terminology and input/output optimization is done through the dual operation. Moreover, in order to evaluate the performance of our model we have performed the Word-Count application on the Wikipedia data of size 128 Mb, 256 Mb, 512 Mb, 1 GB and 2 GB. The comparative analysis shows that our model optimizes nearly 30% better than the existing one.
机译:近年来,数据分析已成为研究人员中的趋势主题之一。此外,信息是所有组织的新基线,以增长更快,更大。相关信息提供了了解类似信息和不喜欢客户的灵活性,并获得相关信息需要分析以各种格式存储的巨大信息。 Hadoop构成两个基本模型I.E. Hadoop分布式文件系统(HDFS)和MapReduce,Hadoop用于处理大量数据,而MapReduce用于数据处理。 Hadoop MapReduce是以有效的方式处理大数据的最佳平台之一,例如Web日志数据的处理。在本文中,我们已经提出了优化的HPMR(Hadoop MapReduce)模型,最大化了任务的内存利用率,并余额平衡I / O系统和CPU之间的性能。 HPMR包含三相i.e.Hadoop,Map和Deamse,如任何其他Hadoop模型,但HPMR优化了所有三个阶段I.。地图,洗牌和减少。此外,为了优化存储器模型,HPMR选择动态术语和输入/输出优化是通过双操作完成的。此外,为了评估我们的模型的性能,我们已经在Wikipedia大小的大小128 MB,256 MB,512 MB,1 GB和2 GB上执行了单词计数应用程序。比较分析表明,我们的模型优化了比现有的更好的30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号