【24h】

A Mixture Record Linkage Approach for US Patent Inventor Disambiguation

机译:用于美国专利发明人消除歧义的混合记录链接方法

获取原文

摘要

Inventor name disambiguation is a task that distinguishes each unique inventor from all other inventor records in patent database. This task is essential for processing person name queries in order to get information related to certain inventor. We proposed a mixture approach that applies to the combination of supervised learning, stochastic record linkage and ruled-based method to determine whether each pair of inventor records are from same inventor or not. Our algorithm tested on the USPTO patent database disam-biguated 12 million inventor records in 7 h. Evaluation is on labeled dataset from USPTO PatentsView inventor name disambiguation competition and showed our approach have an excellent output.
机译:发明人名称消除歧义是一项将每个唯一的发明人与专利数据库中所有其他发明人记录区分开的任务。为了获得与某些发明人相关的信息,此任务对于处理人员姓名查询至关重要。我们提出了一种混合方法,该方法适用于监督学习,随机记录链接和基于规则的方法的组合,以确定每对发明人记录是否来自同一发明人。我们在USPTO专利数据库上测试的算法在7小时内使1200万个发明人记录陷入歧义。对USPTO PatentsView发明人名称歧义化竞争的标记数据集进行了评估,结果表明我们的方法具有出色的输出结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号