【24h】

Big Data and its Analyzing Tools : A Perspective

机译:大数据及其分析工具:一个视角

获取原文

摘要

Data are generated and stored in databases at a very high speed and hence it need to be handled and analyzed properly. Nowadays industries are extensively using Hadoop and Spark to analyze the datasets. Both the frameworks are used for increasing processing speeds in computing huge complex datasets. Many researchers are comparing both of them. Now, the big questions arising are, Is Spark a substitute for Hadoop? Is hadoop going to be replaced by spark in mere future?. Spark is “built on top of” Hadoop and it extends the model to deploy more types of computations which incorporates Stream Processing and Interactive Queries. No doubt, Spark's execution speed is much faster than Hadoop, but talking in terms of fault tolerance, hadoop is slightly more fault tolerant than spark. In this article comparison of various bigdata analytics tools are done and Hadoop and Spark are discussed in detail. This article further gives an overview of bigdata, spark and hadoop issues. In this survey paper, the approaches to resolve the issues of spark and hadoop are discussed elaborately.
机译:数据以极高的速度生成并存储在数据库中,因此需要对其进行适当的处​​理和分析。如今,行业广泛使用Hadoop和Spark来分析数据集。两种框架都可用于提高计算庞大的复杂数据集的处理速度。许多研究人员正在对两者进行比较。现在,出现的主要问题是,Spark是否可以替代Hadoop? Hadoop是否会在不久的将来被Spark取代? Spark是“构建在” Hadoop之上的,它扩展了模型以部署更多类型的计算,其中整合了流处理和交互式查询。毫无疑问,Spark的执行速度比Hadoop快得多,但是从容错角度来说,hadoop的容错性比spark略强。本文比较了各种大数据分析工具,并详细讨论了Hadoop和Spark。本文进一步概述了大数据,spark和hadoop问题。在本调查文件中,详细讨论了解决火花和Hadoop问题的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号