【24h】

A comparative analysis of data warehouse data models

机译:数据仓库数据模型的比较分析

获取原文
获取原文并翻译 | 示例

摘要

The main purpose of data warehouses (DW) is to maintain large volumes of historical data (originating from multiple heterogeneous data sources and representing the different states of a system caused by various business events or activities) in a format that facilitates its analysis in order to support timelier and better decision-making, at both the operational and strategic level. In order for a data warehouse to be able to adequately fulfill this purpose, its data model must enable the appropriate and consistent representation of the different states of a system. In effect, a DW data model, representing the physical structure of the DW, must be general enough, to be able to consume data from heterogeneous data sources (where all of the data should be treated as relevant data and it must be possible to trace it back to its source) and reconcile the semantic differences of the data source models, and, at the same time, be resilient to the constant changes in the structure of the data sources. One of the main problems related to DW development is the absence of a standardized DW data model. In this paper a comparative analysis of the four most prominent DW data models (namely the relationalormalized model, data vault model, anchor model and dimensional model) will be given. These models will be analyzed and compared on the basis of the following criteria: (1) semantics (i.e. the fundamental concepts), (2) resilience of the data model with regard to changes in the structure of the data sources, (3) temporal aspects and (4) completeness and traceability of the data. By identifying the strengths and weaknesses of each of these models it would be possible to establish the foundation for a new DW data model which would more adequately fulfill the posed requirements.
机译:数据仓库(DW)的主要目的是以易于分析的格式来维护大量历史数据(源自多个异构数据源并代表由各种业务事件或活动引起的系统的不同状态)。在运营和战略层面上支持更及时,更好的决策。为了使数据仓库能够充分满足此目的,其数据模型必须能够适当且一致地表示系统的不同状态。实际上,代表DW物理结构的DW数据模型必须足够通用,才能使用来自异构数据源的数据(其中所有数据都应视为相关数据,并且必须能够进行跟踪)数据源模型),并调和数据源模型的语义差异,并同时适应数据源结构的不断变化。与DW开发相关的主要问题之一是缺乏标准化的DW数据模型。本文将对四种最重要的DW数据模型(即关系/规范化模型,数据保险库模型,锚模型和维模型)进行比较分析。这些模型将根据以下标准进行分析和比较:(1)语义(即基本概念),(2)数据模型在数据源结构变化方面的弹性,(3)时间性方面;(4)数据的完整性和可追溯性。通过确定每个模型的优点和缺点,可以为新的DW数据模型建立基础,该模型可以更充分地满足所提出的要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号