...
首页> 外文期刊>International Journal of Computer Trends and Technology >Data Analysis using Mapper and Reducer with Optimal Configuration in Hadoop
【24h】

Data Analysis using Mapper and Reducer with Optimal Configuration in Hadoop

机译:使用Mapper和Reducer在Hadoop中进行最佳配置进行数据分析

获取原文

摘要

Data analysis is an important functionality in cloud computing which allows a huge amount of data to be processed over very large clusters. Hadoop is a software framework for large data analysis. It provide a Hadoop distributed file system for the analysis and transformation of very large data sets is performed using the MapReduce paradigm. MapReduce is known as a popular way to hold data in the cloud environment due to its excellent scalability and good fault tolerance. Map Reduce is a programming model widely used for processing large data sets. Hadoop Distributed File System is designed to stream those data sets. The Hadoop MapReduce system was often unfair in its allocation and a dramatic improvement is achieved through the Elastic Mapper Reducer System. The proposed Mapper Reducer function allows us to analyze the data set and achieve better performance in executing the job by using optimal configuration of mappers and reducers based on the size of the data sets and also helps the users to view the status of the job and to find the error localization of scheduled jobs. This will efficiently utilize the performance properties of optimized scheduled jobs. So, the efficiency of the system will result in substantially lowered system cost, energy usage, management complexity and increases the performance of the system.
机译:数据分析是云计算中的一项重要功能,它允许在非常大的群集上处理大量数据。 Hadoop是用于大数据分析的软件框架。它提供了一个Hadoop分布式文件系统,用于使用MapReduce范式执行非常大的数据集的分析和转换。由于MapReduce具有出色的可伸缩性和良好的容错性,因此被称为在云环境中保存数据的一种流行方法。 Map Reduce是一种编程模型,广泛用于处理大型数据集。 Hadoop分布式文件系统旨在流式传输这些数据集。 Hadoop MapReduce系统的分配通常不公平,并且通过Elastic Mapper Reducer系统实现了巨大的改进。提出的Mapper Reducer功能使我们能够分析数据集,并根据数据集的大小使用映射器和化简器的最佳配置,从而在执行作业时获得更好的性能,还可以帮助用户查看作业的状态并查找计划作业的错误本地化。这将有效利用优化的计划作业的性能属性。因此,系统的效率将大大降低系统成本,能源使用,管理复杂性并提高系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号