Research on Data Query Optimization Based on SparkSQL and MongoDB

机译：基于SparkSQL和MongoDB的数据查询优化研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the arrival of the era of big data, the analysis and processing of massive data has become a very critical computing problem. This paper proposes a query optimization method based on SparkSQL and MongoDB. It analyzes the principle and compares it with other literature in order to draw the conclusion. The conclusion shows that when dealing with problems such as interactive SQL queries, the Apache Spark engine can reasonably decompose the tasks based on the dependencies between the massive data, thereby reducing the data query processing time and improving the operating efficiency. Also it is very suitable for storing some simple data with large amount due to flexible query and index of MongoDB. Obviously, the combination of the two can significantly improve the query speed of massive data.

机译：随着大数据时代的到来，海量数据的分析和处理已成为一个非常关键的计算问题。提出了一种基于SparkSQL和MongoDB的查询优化方法。它分析了该原理，并将其与其他文献进行比较以得出结论。结论表明，在处理诸如交互式SQL查询之类的问题时，Apache Spark引擎可以根据海量数据之间的依赖关系合理地分解任务，从而减少了数据查询的处理时间并提高了运行效率。而且由于MongoDB的灵活查询和索引，它非常适合存储一些简单的数据。显然，两者的结合可以显着提高海量数据的查询速度。

著录项

来源
《International Symposium on Distributed Computing and Applications for Business Engineering and Science》|2018年|144-147|共4页
会议地点
作者
chen yujun; Yuansheng Lou; Feng Ye;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Query processing; Big Data; Optimization methods;

机译：任务分析;查询处理;大数据;优化方法;

相似文献

外文文献
中文文献
专利

1. K-NEAREST NEIGHBOUR QUERY PERFORMANCE ANALYSES ON A LARGE SCALE TAXI DATASET: POSTGRESQL VS. MONGODB [J] . Co?kun ?. B., Sertok S., Anbaro?lu B. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2019,第2aW8期

机译：大型出租车数据集上的K-NEAREST近邻查询性能分析：POSTGRESQLVS。蒙古国数据库
2. Intelligent Query-Based Data Aggregation Model and Optimized Query Ordering for Efficient Wireless Sensor Network [J] . Sarode Prachi, Nandhini R. Wireless personal communications: An Internaional Journal . 2018,第4期

机译：基于智能查询的数据聚合模型和高效无线传感器网络的优化查询排序
3. Multi-join query optimization in bucket-based encrypted databases using an enhanced ant colony optimization algorithm [J] . Jafarinejad Mahmoud, Amini Morteza Distributed and Parallel Databases . 2018,第2期

机译：使用增强型蚁群优化算法的基于桶的加密数据库中的多联接查询优化
4. Research on Data Query Optimization Based on SparkSQL and MongoDB [C] . chen yujun, Yuansheng Lou, Feng Ye International Symposium on Distributed Computing and Applications for Business Engineering and Science . 2018

机译：基于SparkSQL和MongoDB的数据查询优化研究
5. A Mediator-based Data Integration System for Query Answering using an Optimized Extended Inverse Rules Algorithm. [D] . Jayaraman, Gayathri. 2010

机译：基于介体的数据集成系统，用于使用优化的扩展逆规则算法进行查询应答。
6. Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases [O] . Ricardo Sánchez-de-Madariaga, Adolfo Muñoz, Antonio L Castro, 2018

机译：在关系型（MySQL）和NoSQL型（MongoDB和EXist）增长大小的ISO / EN 13606标准化EHR数据库中执行增加复杂性的查询
7. Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases [O] . Ricardo Sánchez-de-Madariaga, Adolfo Muñoz, Antonio L Castro, 2018

机译：在关系（MySQL）和NoSQL（MongoDB和存在）中执行复杂性越来越多的查询（MongoDB和存在）尺寸生长ISO / EN 13606标准化的EHR数据库

Research on Data Query Optimization Based on SparkSQL and MongoDB

摘要

著录项

相似文献

相关主题

期刊订阅