首页> 外国专利> Methods and systems for discovery of linkage points between data sources

Methods and systems for discovery of linkage points between data sources

机译:用于发现数据源之间的链接点的方法和系统

摘要

Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
机译:数据记录跨多个数据集链接。每个数据集包含至少一个数据记录,并且每个数据记录与一个实体相关联,并且包括该实体的一个或多个属性以及每个属性的值。在整个数据集中比较与属性关联的值,并识别具有满足预定相似性阈值的值的匹配属性。另外,识别出成对的数据集之间的链接点。每个链接点链接一对或多对数据记录。该对数据记录中的每个数据记录都包含在给定的一组数据集中的一个中,并且每个对数据记录与给定的一组数据集中具有匹配属性的公共实体相关联。使用标识的链接点,可以将与公共实体关联的数据记录跨数据集链接。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号