首页> 外文会议>2016 Symposium on Colossal Data Analysis and Networking >Big data management processing with Hadoop MapReduce and spark technology: A comparison
【24h】

Big data management processing with Hadoop MapReduce and spark technology: A comparison

机译:使用Hadoop MapReduce和Spark技术进行大数据管理处理:比较

获取原文
获取原文并翻译 | 示例

摘要

Hadoop MapReduce is processed for analysis large volume of data through multiple nodes in parallel. However MapReduce has two function Map and Reduce, large data is stored through HDFS. Lack of facility involve in MapReduce so Spark is designed to run for real time stream data and for fast queries. Spark jobs perform work on Resilient Distributed Datasets and directed acyclic graph execution engine. In this paper, we extend Hadoop MapReduce working and Spark architecture with supporting kind of operation to perform. We also show the differences between Hadoop MapReduce and Spark through Map and Reduce phase individually.
机译:Hadoop MapReduce经过处理,可以通过多个节点并行分析大量数据。但是MapReduce具有Map和Reduce两个功能,大数据通过HDFS存储。 MapReduce缺乏设施,因此Spark可以运行用于实时流数据和快速查询。 Spark作业在弹性分布式数据集和有向非循环图执行引擎上执行工作。在本文中,我们通过支持执行的某种操作来扩展Hadoop MapReduce工作和Spark架构。我们还将分别通过Map和Reduce阶段显示Hadoop MapReduce和Spark之间的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号