首页> 外文学位 >Incremental maintenance of consistent data warehouses.
【24h】

Incremental maintenance of consistent data warehouses.

机译:增量维护一致的数据仓库。

获取原文
获取原文并翻译 | 示例

摘要

A data warehouse stores information integrated from distributed and possibly heterogeneous information sources. In effect, the warehouse stores materialized views over the source data. This dissertation studies the maintenance of warehouse views as the data sources are updated.; The first part of this dissertation presents a family of algorithms that incrementally and consistently maintain relational materialized views in a data warehouse. This view maintenance problem differs from the traditional one in that the view definition and the base data are decoupled, and data sources are autonomous. We show that this decoupling can result in anomalies if traditional algorithms are applied. We formalize notions of consistency for warehouse views and present new algorithms that maintain consistency as the warehouse is updated. In addition, we develop simple, scalable, algorithms for ensuring mutual consistency among multiple views at a warehouse. We also present the implementation of the algorithms in the WHIPS (WareHousing Information Project at Stanford) prototype and related performance results.; The second part of this dissertation studies how to maintain graph-structured materialized views. A graph-structured database consists of records containing identifiers of other records. The data could represent semi-structured information such as Web pages, documents, XML data, or data integrated from heterogeneous data sources. We define views and materialized views for such graph-structured data, analyzing options for representing record identity and references in the view. We then develop incremental maintenance algorithms for these views, discuss how to realize these algorithms in a data warehouse, and study how to maintain the warehouse views without accessing base data.
机译:数据仓库存储从分布式信息源(可能是异构信息源)集成的信息。实际上,仓库存储了源数据的物化视图。本文研究了数据源更新时仓库视图的维护。本文的第一部分介绍了一系列算法,这些算法可在数据仓库中逐步并一致地维护关系物化视图。这种视图维护问题与传统视图维护问题的不同之处在于,视图定义和基础数据是分离的,并且数据源是自主的。我们表明,如果应用传统算法,这种去耦会导致异常。我们将仓库视图的一致性概念形式化,并提出了在仓库更新时保持一致性的新算法。此外,我们开发了简单,可扩展的算法,以确保仓库中多个视图之间的相互一致性。我们还将介绍WHIPS(斯坦福大学的仓库信息项目)原型中算法的实现以及相关的性能结果。本文的第二部分研究如何维护图结构的物化视图。图形结构的数据库由包含其他记录的标识符的记录组成。数据可以表示半结构化信息,例如网页,文档,XML数据或从异构数据源集成的数据。我们为此类图结构数据定义视图和实例化视图,分析用于表示视图中的记录标识和引用的选项。然后,我们为这些视图开发增量维护算法,讨论如何在数据仓库中实现这些算法,并研究如何在不访问基础数据的情况下维护仓库视图。

著录项

  • 作者

    Zhuge, Yue.;

  • 作者单位

    Stanford University.;

  • 授予单位 Stanford University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1999
  • 页码 239 p.
  • 总页数 239
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号