首页> 外国专利> Systematic approach to determine source of data quality issue in data flow in an enterprise

Systematic approach to determine source of data quality issue in data flow in an enterprise

机译:确定企业数据流中数据质量问题来源的系统方法

摘要

A method may include applying periodically a data validation rule to data transformed through a data processing system, wherein the data validation rule applies aspects selected from a group consisting of data value range, specific data values, and relationship with other data entries; responsive to detecting a violation of the data validation rule, identifying a portion of the transformed data for lineage assessment; examining the identified transformed data iteratively upstream at a previous transformation node in a lineage graph, until the method detects a node where the violation of the data validation rule can't be reproduced; creating a separate node in a distributed network for each of the previous transformation nodes in the lineage graph; and identifying the separate node in the distributed network introducing the violation of the data validation rule.
机译:一种方法可以包括将数据验证规则周期性地应用于通过数据处理系统转换的数据,其中,数据验证规则应用从包括以下各项的组中选择的方面:数据值范围,特定数据值以及与其他数据条目的关系。响应于检测到违反数据验证规则,识别一部分转换后的数据用于谱系评估;在谱系图中的先前转换节点的上游迭代地检查已标识的转换数据,直到该方法检测到无法再现违反数据验证规则的节点为止;为沿袭图中每个先前的转换节点在分布式网络中创建一个单独的节点;确定在分布式网络中引入了违反数据验证规则的单独节点。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号