【24h】

Research on Data Query Optimization Based on SparkSQL and MongoDB

机译:基于SparkSQL和MongoDB的数据查询优化研究

获取原文

摘要

With the arrival of the era of big data, the analysis and processing of massive data has become a very critical computing problem. This paper proposes a query optimization method based on SparkSQL and MongoDB. It analyzes the principle and compares it with other literature in order to draw the conclusion. The conclusion shows that when dealing with problems such as interactive SQL queries, the Apache Spark engine can reasonably decompose the tasks based on the dependencies between the massive data, thereby reducing the data query processing time and improving the operating efficiency. Also it is very suitable for storing some simple data with large amount due to flexible query and index of MongoDB. Obviously, the combination of the two can significantly improve the query speed of massive data.
机译:随着大数据时代的到来,海量数据的分析和处理已成为一个非常关键的计算问题。提出了一种基于SparkSQL和MongoDB的查询优化方法。它分析了该原理,并将其与其他文献进行比较以得出结论。结论表明,在处理诸如交互式SQL查询之类的问题时,Apache Spark引擎可以根据海量数据之间的依赖关系合理地分解任务,从而减少了数据查询的处理时间并提高了运行效率。而且由于MongoDB的灵活查询和索引,它非常适合存储一些简单的数据。显然,两者的结合可以显着提高海量数据的查询速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号