首页> 外国专利> Entity resolution system identifying non-distinct names in a set of names

Entity resolution system identifying non-distinct names in a set of names

机译:实体解析系统,用于识别一组名称中的不同名称

摘要

A method for identifying non-distinct names in a set of names (e.g. people, buildings, places, organizations, documents, cars, tings, objects etc.) comprises obtaining the set of names for a first entity and in response to comparing a first name and a second name in the set of names, it is determined that the first name is similar to the second name. Initials in the first name and the second name are searched for. In response to the search indicating that there is at least one initial in at least one of the first name and the second name, it is determined that the at least one initial matches a corresponding initial in another one of the first name and the second name and one of the first name and the second name are marked as a non-distinct name. The remaining names are considered to be distinct names to be used for scoring against another set of names for a second entity using a cross-entity scoring technique. Thus, the entity resolution system finds the most distinct name(s) among a group of related names for a single entity before attempting to resolve possibly related entities.
机译:一种用于识别一组名称(例如,人,建筑物,地点,组织,文档,汽车,颜色,物体等)中不同名称的方法,包括获取第一实体的一组名称,并响应于比较第一名称和名称中的第二名称,确定第一名称与第二名称相似。搜索名字和名字的缩写。响应于搜索指示在第一名称和第二名称中的至少一个中存在至少一个首字母,确定至少一个初始与第一名称和第二名称中的另一个中的对应的首字母相匹配名字和第二名字中的一个被标记为非唯一的名字。其余名称被认为是不同的名称,用于使用跨实体评分技术针对第二个实体的另一组名称进行评分。因此,实体解析系统在尝试解析可能的相关实体之前,在单个实体的一组相关名称中找到最不同的名称。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号