首页> 外文期刊>Wiley interdisciplinary reviews. Data mining and knowledge discovery >Big data processing tools: An experimental performance evaluation
【24h】

Big data processing tools: An experimental performance evaluation

机译:大数据处理工具:实验性能评估

获取原文
获取原文并翻译 | 示例
           

摘要

Big Data is currently a hot topic of research and development across several business areas mainly due to recent innovations in information and communication technologies. One of the main challenges of Big Data relates to how one should efficiently handle massive volumes of complex data. Due to the notorious complexity of the data that can be collected from multiple sources, usually motivated by increasing data volumes gathered at high velocity, efficient processing mechanisms are needed for data analysis purposes. Motivated by the rapid growth in technology, development of tools, and frameworks for Big Data, there is much discussion about Big Data querying tools and, specifically, those that are more appropriated for specific analytical needs. This paper describes and evaluates the following popular Big Data processing tools: Drill, HAWQ, Hive, Impala, Presto, and Spark. An experimental evaluation using the Transaction Processing Council (TPC-H) benchmark is presented and discussed, highlighting the performance of each tool, according to different workloads and query types.
机译:大数据目前是几家商业领域的研发的热门话题,主要是由于最近的信息和通信技术的创新。大数据的主要挑战之一涉及如何有效处理大量复杂数据的复杂数据。由于可以从多个源收集的数据的臭名昭着的复杂性,通常通过增加在高速下收集的数据量来激励,以进行数据分析目的所需的有效处理机制。通过技术的快速增长,工具的开发和大数据的框架的动机,有很多关于大数据查询工具的讨论,具体而言,那些更适合特定的分析需求的讨论。本文介绍并评估了以下流行的大数据处理工具:钻,Hawq,Hive,Impala,Presto和Spark。根据不同的工作负载和查询类型,提出并讨论了使用交易处理委员会(TPC-H)基准测试的实验评估,并讨论了每个工具的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号