首页> 外国专利> Preparing high-quality data repositories sets utilizing heuristic data analysis

Preparing high-quality data repositories sets utilizing heuristic data analysis

机译:准备高质量的数据存储库设置利用启发式数据分析

摘要

A mechanism is provided for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
机译:提供了一种用于准备高质量数据存储库的机制。 从一组数据源的数据和相关元数据被摄取,从而形成一组未准备的数据。 基于一组变换数据的一组函数来转换该组毫不准备的数据。 使用第一组语义关联,第二组语义转换和一组语义转换关联,将一组语义文本描述详细说明将毫不准备的数据集的转换为变换数据集。 该组转换数据是针对一个或多个治理策略测试的,这些策略最终显示准备的数据符合规定。 响应于遵守一个或多个治理策略的转换数据集,使用变换数据自动构建高质量的数据存储库。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号