首页> 外文期刊>Decision support systems >Assessing data quality - A probability-based metric for semantic consistency
【24h】

Assessing data quality - A probability-based metric for semantic consistency

机译:评估数据质量-一种基于概率的语义一致性度量

获取原文
获取原文并翻译 | 示例
       

摘要

We present a probability-based metric for semantic consistency using a set of uncertain rules. As opposed to existing metrics for semantic consistency, our metric allows to consider rules that are expected to be fulfilled with specific probabilities. The resulting metric values represent the probability that the assessed dataset is free of internal contradictions with regard to the uncertain rules and thus have a clear interpretation. The theoretical basis for determining the metric values are statistical tests and the concept of the p-value, allowing the interpretation of the metric value as a probability. We demonstrate the practical applicability and effectiveness of the metric in a real-world setting by analyzing a customer dataset of an insurance company. Here, the metric was applied to identify semantic consistency problems in the data and to support decision-making, for instance, when offering individual products to customers.
机译:我们提出了使用一组不确定规则的基于概率的语义一致性度量。与现有的语义一致性度量相反,我们的度量允许考虑预期以特定概率满足的规则。所得度量值表示评估数据集没有关于不确定规则的内部矛盾并因此具有清晰解释的可能性。确定度量标准值的理论基础是统计检验和p值的概念,从而可以将度量标准值解释为概率。通过分析保险公司的客户数据集,我们演示了该指标在实际环境中的实际适用性和有效性。在这里,该度量标准用于识别数据中的语义一致性问题并支持决策(例如,当向客户提供单个产品时)。

著录项

  • 来源
    《Decision support systems》 |2018年第6期|95-106|共12页
  • 作者单位

    Univ Regensburg, Dept Management Informat Syst, Univ Str 31, D-93053 Regensburg, Germany;

    Univ Ulm, Inst Technol & Proc Management, Helmholtzstr 22, D-89081 Ulm, Germany;

    Univ Regensburg, Dept Management Informat Syst, Univ Str 31, D-93053 Regensburg, Germany;

    Univ Regensburg, Dept Management Informat Syst, Univ Str 31, D-93053 Regensburg, Germany;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data quality; Data quality assessment; Data quality metric; Data consistency;

    机译:数据质量;数据质量评估;数据质量指标;数据一致性;
  • 入库时间 2022-08-18 02:13:13

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号