首页> 外文会议>International Conference on Information Communication and Embedded Systems >A performance analysis of MapReduce applications on big data in cloud based Hadoop
【24h】

A performance analysis of MapReduce applications on big data in cloud based Hadoop

机译:基于云的Hadoop中的MapReduce应用程序在大数据上的性能分析

获取原文

摘要

MapReduce is one of the most popular programming model for big data analysis in Distributed and Parallel Computing Environment. It is used for implementing parallel applications. With the growing development of mobile Internet and cloud computing, the issues related to big data have been a matter of concern in both industry and academy. There are several platforms for users to develop their applications based on MapReduce framework such as Hadoop. Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. This paper discusses various MapReduce applications like Wordcount, Pi, TeraSort, Grep in Cloud based Hadoop. We have shown experimental results of these applications on Amazon EC2 using two types of Ubuntu instances. In this paper, performance of above application has been shown with respect to execution time and number of nodes. We find in our research study that as the number of nodes increases the execution time decreases and performance increases.
机译:MapReduce是分布式和并行计算环境中最流行的大数据分析编程模型之一。它用于实现并行应用程序。随着移动互联网和云计算的不断发展,与大数据有关的问题已成为业界和学术界关注的问题。有多种平台供用户基于Hadoop等MapReduce框架开发其应用程序。 Hadoop是一个免费的,基于Java的编程框架,它支持在分布式计算环境中处理大型数据集。本文讨论了基于云的Hadoop中的各种MapReduce应用程序,如Wordcount,Pi,TeraSort,Grep。我们已经在Amazon EC2上使用两种类型的Ubuntu实例展示了这些应用程序的实验结果。在本文中,已针对执行时间和节点数显示了上述应用程序的性能。我们在研究中发现,随着节点数量的增加,执行时间会减少,性能也会提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号