首页> 外文期刊>Journal of the Chinese Institute of Engineers >ENHANCING DATA QUALITY THROUGH ATTRIBUTE-BASED METADATA AND COST EVALUATION IN DATA WAREHOUSE ENVIRONMENTS
【24h】

ENHANCING DATA QUALITY THROUGH ATTRIBUTE-BASED METADATA AND COST EVALUATION IN DATA WAREHOUSE ENVIRONMENTS

机译:通过基于属性的元数据增强数据质量,并在数据仓库环境中进行成本评估

获取原文
获取原文并翻译 | 示例
           

摘要

Data quality will be a significant issue as data warehousing becomes more and more popular. This paper aims at investigating and analyzing the data quality issues in data warehouse environments. We present an attribute-based metadata model for identifying data quality. A four-phase process is introduced for data quality management during the life cycle of data warehouses. Overall data quality conditions can be identified and related information can be provided for determining whether the data meet "fit to use" criteria and whether they need to be improved. Furthermore, we use a cost/benefit evaluation model to ferret out the poor-quality data and set priorities for improvement given limited resources. Our approach allows system developers to document relevant quality data as metadata, which may be associated with the whole life cycle of data warehouses. Quality metadata not only can enrich the interpretation of attribute data, but can also provide diagnostic information for finding the reasons for and the sources of errors. In addition, the cost/benefit evaluation model developed may provide a foundation for the quantitative analysis of data quality.
机译:随着数据仓库越来越流行,数据质量将成为一个重要问题。本文旨在调查和分析数据仓库环境中的数据质量问题。我们提出了一种基于属性的元数据模型,用于识别数据质量。在数据仓库的生命周期中,引入了一个四阶段的过程来进行数据质量管理。可以识别总体数据质量状况,并可以提供相关信息来确定数据是否满足“适合使用”标准以及是否需要对其进行改进。此外,我们使用成本/收益评估模型来找出质量较差的数据,并在资源有限的情况下为改进设置优先级。我们的方法允许系统开发人员将相关的质量数据记录为元数据,该数据可能与数据仓库的整个生命周期相关联。高质量的元数据不仅可以丰富属性数据的解释,而且还可以提供诊断信息以查找错误的原因和来源。此外,开发的成本/效益评估模型可以为数据质量的定量分析提供基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号