首页> 外文会议>International Workshop on Data Quality and Trust in Big Data >Automated Schema Quality Measurement in Large-Scale Information Systems
【24h】

Automated Schema Quality Measurement in Large-Scale Information Systems

机译:大型信息系统中的自动模式质量测量

获取原文

摘要

Assessing the quality of information system schemas is crucial, because an unoptimized or erroneous schema design has a strong impact on the quality of the stored data, e.g., it may lead to inconsistencies and anomalies at the data-level. Even if the initial schema had an ideal design, changes during the life cycle can negatively affect the schema quality and have to be tackled. Especially in Big Data environments there are two major challenges: large schemas, where manual verification of schema and data quality is very arduous, and the integration of heterogeneous schemas from different data models, whose quality cannot be compared directly. Thus, we present a domain-independent approach for automatically measuring the quality of large and heterogeneous (logical) schemas. In contrast to existing approaches, we provide a fully automatable workflow that also enables regular reassessment. Our implementation allows to measure the quality dimensions correctness, completeness, pertinence, minimality, readability, and normalization.
机译:评估信息系统模式的质量至关重要,因为未优化或错误的模式设计对存储数据的质量产生强烈影响,例如,它可能导致数据级别的不一致和异常。即使初始模式具有理想的设计,即使是理想的设计,生命周期中的变化也会对模式质量产生负面影响,并且必须解决。特别是在大数据环境中存在两个主要挑战:大模式,其中图案验证的模式和数据质量非常艰巨,并且来自不同数据模型的异构模式的集成,其质量不能直接比较。因此,我们提出了一种独立的方法,用于自动测量大型和异构(逻辑)模式的质量。与现有方法相比,我们提供完全自动的工作流程,也可以定期重新评估。我们的实施允许测量质量尺寸正确,完整性,满足,最小,可读性和标准化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号