...
首页> 外文期刊>Journal of Theoretical and Applied Information Technology >RESEARCH OF THE EFFECTIVENESS OF SQL ENGINES WORKING IN HDFS
【24h】

RESEARCH OF THE EFFECTIVENESS OF SQL ENGINES WORKING IN HDFS

机译:HDFS中SQL引擎工作效率的研究

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The rapid data growth at the beginning of the XXI century gave impetus to the development of big data technologies. A distributed platform Hadoop became a key element of big data technologies. Initially, it was difficult using Hadoop for tabular data processing, on which many modern industrial information systems are built. Therefore, a variety of SQL tools for Hadoop began to appear, which gave rise to the problem of choosing a specific solution. The aim of this work is to identify the most efficient SQL tools for tabular data processing in a distributed Hadoop system. For this purpose a comparative analysis of six most popular tools: Apache Hive, Cloudera Impala, Spark SQL, Presto, Apache Drill, Apache HAWQ has been done. The result of the study was the choice of the most effective means from the standpoint of completeness of the list of functions performed, tool performance and the level of SQL standards support. After summarizing of the results of a study, which has been done on all selected space coordinates comparison, Presto was the most effective tool.
机译:在二十一世纪初,数据的快速增长推动了大数据技术的发展。分布式平台Hadoop成为大数据技术的关键要素。最初,使用Hadoop进行表格数据处理非常困难,在该表格上建立了许多现代工业信息系统。因此,开始出现了各种用于Hadoop的SQL工具,这引起了选择特定解决方案的问题。这项工作的目的是确定用于分布式Hadoop系统中表格数据处理的最有效的SQL工具。为此,对六个最受欢迎的工具进行了比较分析:Apache Hive,Cloudera Impala,Spark SQL,Presto,Apache Drill,Apache HAWQ。从完成的功能,工具性能和SQL标准支持级别的完整性的角度来看,研究的结果是选择了最有效的方法。在总结了所有选定空间坐标比较的研究结果之后,Presto是最有效的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号