首页> 外国专利> SYSTEM AND METHOD FOR MANAGING DUPLICATE ENTITIES BASED ON A RELATIONSHIP CARDINALITY IN PRODUCTION KNOWLEDGE BASE REPOSITORY

SYSTEM AND METHOD FOR MANAGING DUPLICATE ENTITIES BASED ON A RELATIONSHIP CARDINALITY IN PRODUCTION KNOWLEDGE BASE REPOSITORY

机译:生产知识库存储中基于关系基数的重复实体管理系统和方法

摘要

Disclosed is a system and method for managing one or more duplicate entities based on a relationship cardinality in a production knowledge base repository. The method comprises steps of performing a first level detection of duplicates in existing data present in the production knowledge base repository through an object harmonisation module (202). The first level detection identifies duplicates of one or more attribute objects within a specific entity. The object harmonisation module (202) implements a sanitization and standardization operation on the identified attribute objects. Then the method performs a second level detection of duplicates between entities of a specific concept through a homogeneity recognition module (204). The homogeneity recognition module (204) identifies duplicates according to base-attributes of the specific concept based on a predefined similarity threshold. The method then enables a user to determine the similarity of the entities and further enables the user to merge the similar entities through an entity conflation and merging module (206).
机译:公开了一种用于基于生产知识库存储库中的关系基数来管理一个或多个重复实体的系统和方法。该方法包括以下步骤:通过对象协调模块(202)对生产知识库存储库中存在的现有数据中的重复项执行第一级检测。第一级检测标识特定实体内一个或多个属性对象的重复项。对象协调模块(202)对所标识的属性对象执行清理和标准化操作。然后,该方法通过同质性识别模块对特定概念的实体之间的重复项执行第二级检测(204)。均匀性识别模块(204)基于预定义的相似性阈值,根据特定概念的基本属性来识别重复项。然后,该方法使用户能够确定实体的相似性,并且还使用户能够通过实体合并和合并模块来合并相似实体(206)。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号