【24h】

Managing and Querying Versions of Multiversion Data Warehouse

机译:管理和查询多版本数据仓库的版本

获取原文
获取原文并翻译 | 示例

摘要

A data warehouse (DW) is a database that integrates external data sources (EDSs) for the purpose of advanced data analysis. The methods of designing a DW usually assume that a DW has a static schema and structures of dimensions. In practice, schema and dimensions' structures often change as the result of the evolution of EDSs, changes of the real world represented in a DW, new user requirements, new versions of software being installed, and system tuning activities. Examples of various change scenarios can be found in [1,8]. Handling schema and dimension changes is often supported by schema evolution, temporal extensions, and versioning extensions. Schema evolution approaches maintain one DW schema and the set of data that evolve in time. Temporal versioning techniques use timestamps on modified data in order to create temporal versions. In versioning extensions, a DW evolution is managed partially by means of schema versions and partially by data versions. These approaches solve the DW evolution problem partially. Firstly, they do not offer a clear separation between different DW states. Secondly, the approaches do not support modeling alternative, hypothetical DW states required for a what-if analysis. In order to eliminate the limitations of the aforementioned approaches, we propose a multiversion data warehouse.
机译:数据仓库(DW)是一个数据库,该数据库集成了外部数据源(EDS),以进行高级数据分析。设计DW的方法通常假定DW具有静态架构和尺寸结构。实际上,由于EDS的发展,DW中代表的真实世界的变化,新的用户需求,正在安装的软件的新版本以及系统调整活动,架构和维度的结构通常会发生变化。各种变更方案的示例可以在[1,8]中找到。模式演变,时间扩展和版本控制扩展通常支持处理模式和维度更改。模式演变方法维护一个DW模式和一组随时间变化的数据。时间版本控制技术在修改后的数据上使用时间戳,以创建时间版本。在版本扩展中,DW演变部分通过架构版本进行管理,部分通过数据版本进行管理。这些方法部分地解决了DW演进问题。首先,它们没有提供不同DW状态之间的明确分隔。其次,这些方法不支持对假设分析所需的替代性假设DW状态进行建模。为了消除上述方法的局限性,我们提出了一种多版本数据仓库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号