首页> 外文会议>International conference on big data analytics and knowledge discovery >Rule-Based Multidimensional Data Quality Assessment Using Contexts
【24h】

Rule-Based Multidimensional Data Quality Assessment Using Contexts

机译:基于上下文的基于规则的多维数据质量评估

获取原文

摘要

It is an accepted fact that a value for a data quality metric can be acceptable or not, depending on the context in which data are produced and consumed. In particular, in a data warehouse (DW), the context for the value of a measure is given by the dimensions, and external data. In this paper we propose the use of logic rules to assess the quality of measures in a DW, accounting for the context in which these measures are considered. For this, we propose the use of three sets of rules: one, for representing the DW; a second one, for defining the particular context for the measures in the warehouse; and a third one for representing data quality metrics. This provides an uniform, elegant, and flexible framework for context-aware DW quality assessment. Our representation is implementation independent, and not only allows us to assess the quality of measures at the lowest granularity level in a data cube, but also the quality of aggregate and dimension data.
机译:公认的事实是,取决于数据产生和使用的上下文,数据质量度量的值是否可接受。特别是,在数据仓库(DW)中,度量值的上下文由维度和外部数据给出。在本文中,我们建议使用逻辑规则来评估DW中措施的质量,并考虑考虑这些措施的环境。为此,我们建议使用三套规则:一套用于表示DW;另一套用于表示DW。第二个,用于定义仓库中措施的特定上下文;第三个用于表示数据质量指标。这为上下文感知的DW质量评估提供了一个统一,优雅且灵活的框架。我们的表示方式与实现无关,并且不仅使我们能够评估数据多维数据集中最低粒度级别的度量质量,而且还可以评估聚合和维度数据的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号