【24h】

ULDBs: Databases with Uncertainty and Lineage

机译:ULDB:具有不确定性和血统的数据库

获取原文
获取原文并翻译 | 示例

摘要

This paper introduces ULDBs, an extension of relational databases with simple yet expressive constructs for representing and manipulating both lineage and uncertainty. Uncertain data and data lineage are two important areas of data management that have been considered extensively in isolation, however many applications require the features in tandem. Fundamentally, lineage enables simple and consistent representation of uncertain data, it correlates uncertainty in query results with uncertainty in the input data, and query processing with lineage and uncertainty together presents computational benefits over treating them separately.We show that the ULDB representation is complete, and that it permits straightforward implementation of many relational operations. We define two notions of ULDB minimality—data-minimal and lineage-minimal—and study minimization of ULDB representations under both notions. With lineage, derived relations are no longer self-contained: their uncertainty depends on uncertainty in the base data. We provide an algorithm for the new operation of extracting a database subset in the presence of interconnected uncertainty. Finally, we show how ULDBs enable a new approach to query processing in probabilistic databases.ULDBs form the basis of the Trio system under development at Stanford.
机译:本文介绍了ULDB,它是关系数据库的扩展,具有简单但可表达的结构,用于表示和操纵谱系和不确定性。不确定的数据和数据沿袭是数据管理的两个重要领域,它们被广泛地孤立地考虑,但是许多应用程序需要串联的功能。从根本上讲,沿袭可实现不确定数据的简单一致表示,它将查询结果中的不确定性与输入数据中的不确定性相关联,并且具有沿袭和不确定性的查询处理共同带来了优于单独对待它们的计算优势。我们证明了ULDB表示是完整的,并且它允许直接执行许多关系操作。我们定义了ULDB最小化的两个概念-数据最小和谱系最小-并研究了两种概念下ULDB表示的最小化。使用沿袭,派生关系不再是独立的:它们的不确定性取决于基础数据中的不确定性。我们提供了一种在存在互连不确定性的情况下提取数据库子集的新操作的算法。最后,我们展示了ULDB如何为概率数据库中的查询处理提供一种新方法.ULDB构成了斯坦福正在开发的Trio系统的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号