首页> 外文会议>SIGMOD/PODS 2007 >The Case for a Wide-Table Approach to Manage Sparse Relational Data Sets
【24h】

The Case for a Wide-Table Approach to Manage Sparse Relational Data Sets

机译:稀疏关系数据集的宽表方法案例

获取原文

摘要

A "sparse" data set typically has hundreds or even thousands of attributes, but most objects have non-null values for only a small number of these attributes. A popular view about sparse data is that it arises merely as the result of poor schema design. In this paper, we argue that rather than being the result of inept schema design, storing a sparse data set in a single table is the right way to proceed. However, for this to be the case, RDBMSs must provide sparse data management facilities that go beyond the previously studied requirement of storing such data sets efficiently. In particular, an RDBMS must 1) enable users to effectively build ad hoc queries over a very large number of attributes, and 2) support efficient evaluation of these queries over a wide, sparse table. We propose techniques that provide these capabilities, and argue that the single-table approach is a necessary component of selfmanaging database systems because it frees users from a tedious and potentially ineffective schema-design phase when managing sparse data sets.
机译:“稀疏”数据集通常具有数百甚至数千个属性,但是大多数对象仅对其中少数属性具有非空值。关于稀疏数据的一种流行观点是,它仅是由于不良的架构设计而导致的。在本文中,我们认为,将稀疏数据集存储在单个表中不是正确的方案设计的结果,而是正确的处理方式。但是,对于这种情况,RDBMS必须提供稀疏的数据管理工具,这超出了先前研究的有效存储此类数据集的要求。特别是,RDBMS必须1)使用户能够有效地基于大量属性构建即席查询,以及2)支持在较宽的稀疏表上对这些查询进行有效的评估。我们提出了提供这些功能的技术,并认为单表方法是自我管理数据库系统的必要组成部分,因为在管理稀疏数据集时,它使用户摆脱了繁琐且可能无效的模式设计阶段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号