首页> 外文会议>International conference on database and expert systems applications >Integrating Big Data and Relational Data with a Functional SQL-like Query Language
【24h】

Integrating Big Data and Relational Data with a Functional SQL-like Query Language

机译:将大数据和关系数据与功能类似SQL的查询语言集成

获取原文

摘要

Multistore systems have been recently proposed to provide integrated access to multiple, heterogeneous data stores through a single query engine. In particular, much attention is being paid on the integration of unstructured big data typically stored in HDFS with relational data. One main solution is to use a relational query engine that allows SQL-like queries to retrieve data from HDFS, which requires the system to provide a relational view of the unstructured data and hence is not always feasible. In this paper, we introduce a functional SQL-like query language that can integrate data retrieved from different data stores and take full advantage of the functionality of the underlying data processing frameworks by allowing the ad hoc usage of user defined map/filter/reduce operators in combination with traditional SQL statements. Furthermore, the query language allows for optimization by enabling subquery rewriting so that filter conditions can be pushed inside and executed at the data store as early as possible. Our approach is validated with two data stores and a representative query that demonstrates the usability of the query language and evaluates the benefits from query optimization.
机译:最近已经提出了多存储系统,以通过单个查询引擎提供对多个异构数据存储的集成访问。特别是,通常将HDFS中存储的非结构化大数据与关系数据的集成引起了很多关注。一种主要的解决方案是使用关系查询引擎,该引擎允许类似SQL的查询从HDFS检索数据,这要求系统提供非结构化数据的关系视图,因此并不总是可行的。在本文中,我们介绍了一种类似于SQL的功能性查询语言,该语言可以集成从不同数据存储中检索到的数据,并通过允许用户自定义使用map / filter / reduce运算符来充分利用基础数据处理框架的功能。结合传统的SQL语句。此外,查询语言通过启用子查询重写来实现优化,以便可以将过滤条件尽可能早地推入内部并在数据存储区执行。我们的方法已通过两个数据存储和一个代表性查询进行了验证,该查询演示了查询语言的可用性并评估了查询优化的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号