首页> 外文会议>International conference on business information systems >Challenging SQL-on-Hadoop Performance with Apache Druid
【24h】

Challenging SQL-on-Hadoop Performance with Apache Druid

机译:使用Apache Druid挑战Hadoop上的SQL性能

获取原文

摘要

In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amounts of data, although new emerging tools may be an alternative. This paper evaluates if Apache Druid, an innovative column-oriented data store suited for online analytical processing workloads, is an alternative to some of the well-known SQL-on-Hadoop technologies and its potential in this role. In this evaluation, Druid, Hive and Presto are benchmarked with increasing data volumes. The results point Druid as a strong alternative, achieving better performance than Hive and Presto, and show the potential of integrating Hive and Druid, enhancing the potentialities of both tools.
机译:在大数据中,SQL-on-Hadoop工具通常可为处理大量数据提供令人满意的性能,尽管新兴的工具可能是替代方法。本文评估了Apache Druid(一种适用于在线分析处理工作负载的创新的面向列的数据存储)是否可以替代某些著名的SQL-on-Hadoop技术及其在此角色中的潜力。在此评估中,德鲁伊,蜂巢和普雷斯托以不断增加的数据量为基准。结果表明Druid是一个强大的替代方案,其性能优于Hive和Presto,并显示了将Hive和Druid集成在一起的潜力,从而增强了这两种工具的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号