【24h】

A Novel Data Quality Metric for Minimality

机译:最小性的新型数据质量度量

获取原文

摘要

The development of well-founded metrics to measure data quality is essential to estimate the significance of data-driven decisions, which are, besides others, the basis for artificial intelligence applications. While the majority of research into data quality refers to the data values of an information system, less research is concerned with schema quality. However, a poorly designed schema negatively impacts the quality of the data, for example, redundancies at the schema-level lead to inconsistencies and anomalies at the data-level. In this paper, we propose a new metric to measure the minimality of a schema, which is an important indicator to detect redundancies. We compare it to other minimality metrics and show that it is the only one that fulfills all requirements for a sound data quality metric. In our ongoing research, we are evaluating the benefits of the metric in more detail and investigate its applicability for redundancy detection in data values.
机译:衡量数据质量的良好度量的发展是必不可少的,估计数据驱动的决策的重要性,除了他人,这是人工智能应用的基础。虽然大多数研究数据质量指的是信息系统的数据值,但较少的研究涉及架构质量。然而,设计不良的模式对数据的质量产生负面影响,例如,架构级别的冗余导致数据级别的不一致性和异常。在本文中,我们提出了一种新的指标来衡量模式的最小值,这是检测冗余的重要指标。我们将其与其他最小度量进行比较,并表明它是唯一满足声音数据质量度量要求的所有要求的指标。在我们正在进行的研究中,我们更详细地评估了度量标准的好处,并调查其在数据值中冗余检测的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号