首页> 外文期刊>ACM transactions on database systems >Building a Hybrid Warehouse: Efficient Joins between Data Stored in HDFS and Enterprise Warehouse
【24h】

Building a Hybrid Warehouse: Efficient Joins between Data Stored in HDFS and Enterprise Warehouse

机译:构建混合仓库:HDFS中存储的数据与企业仓库之间的有效联接

获取原文
获取原文并翻译 | 示例

摘要

The Hadoop Distributed File System (HDFS) has become an important data repository in the enterprise as the center for all business analytics, from SQL queries and machine learning to reporting. At the same time, enterprise data warehouses (EDWs) continue to support critical business analytics. This has created the need for a new generation of a special federation between Hadoop-like big data platforms and EDWs, which we call the hybrid warehouse. There are many applications that require correlating data stored in HDFS with EDW data, such as the analysis that associates click logs stored in HDFS with the sales data stored in the database. All existing solutions reach out to HDFS and read the data into the EDW to perform the joins, assuming that the Hadoop side does not have efficient SQL support.
机译:Hadoop分布式文件系统(HDFS)已成为企业中重要的数据存储库,作为从SQL查询和机器学习到报告的所有业务分析的中心。同时,企业数据仓库(EDW)继续支持关键业务分析。这就需要在类似Hadoop的大数据平台与EDW(我们称为混合仓库)之间建立新一代的特殊联盟。有许多应用程序需要将HDFS中存储的数据与EDW数据相关联,例如将HDFS中存储的点击日志与数据库中存储的销售数据相关联的分析。假设Hadoop端没有有效的SQL支持,所有现有的解决方案都可以连接到HDFS并将数据读入EDW以执行联接。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号