首页> 外文会议>International conference on very large data bases >Indexing HDFS Data in PDW: Splitting the data from the index
【24h】

Indexing HDFS Data in PDW: Splitting the data from the index

机译:在PDW中索引HDFS数据:从索引中拆分数据

获取原文

摘要

There is a growing interest in making relational DBMSs work synergistically with MapReduce systems. However, there are interesting technical challenges associated with figuring out the right balance between the use and co-deployment of these systems. This paper focuses on one specific aspect of this balance, namely how to leverage the superior indexing and query processing power of a relational DBMS for data that is often more cost-effectively stored in Hadoop/HDFS. We present a method to use conventional B+-tree indices in an RDBMS for data stored in HDFS and demonstrate that our approach is especially effective for highly selective queries.
机译:使关系DBMS与MapReduce系统协同工作的兴趣日益浓厚。但是,要在这些系统的使用和共同部署之间找到适当的平衡,仍存在一些有趣的技术挑战。本文着重于这种平衡的一个特定方面,即如何利用关系DBMS的出色索引和查询处理能力来处理通常更经济高效地存储在Hadoop / HDFS中的数据。我们提出了一种在RDBMS中对存储在HDFS中的数据使用常规B +树索引的方法,并证明了我们的方法对于高度选择性的查询特别有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号