首页> 外国专利> DISAMBIGUATION METHOD AND APPARATUS FOR AUTHOR OF PAPER, AND COMPUTER DEVICE

DISAMBIGUATION METHOD AND APPARATUS FOR AUTHOR OF PAPER, AND COMPUTER DEVICE

机译:纸张作者的消歧方法和装置,以及计算机设备

摘要

A disambiguation method for an author of a paper. The method comprises: respectively forming a name tree by means of the names of authors involved in all papers in a database and according to a preset rule (S1); acquiring an association relationship heterogeneous network corresponding to all the papers in the database (S2); acquiring paper semantic representations respectively corresponding to all the papers in the database (S3); on the basis of the name tree, the association relationship heterogeneous network and the paper semantic representations, constructing a similar matrix (S4); clustering the similar matrix to obtain paper clustering groups corresponding to all the papers in the database (S5); determining whether a paper clustering group corresponding to an author to be disambiguated belongs to a paper clustering group corresponding to a specified author (S6); and if not, determining that the author to be disambiguated is different from the specified author (S7). The names of authors are preprocessed to construct a name tree, and a clustering error caused by the different ways in which names can be written is then eliminated according to the name tree, thereby guaranteeing that the names of the same author are divided into the same group as much as possible, and improving the precision of name disambiguation.
机译:纸张作者的消歧方法。该方法包括:通过涉及数据库中的所有文件的作者名称分别形成名称树,并根据预设规则(S1);获取与数据库中所有文件对应的关联关系异构网络(S2);获取分别对应于数据库中的所有文件的纸张语义表示(S3);在名称树的基础上,关联关系异构网络和纸张语义表示,构建类似的矩阵(S4);群集类似的矩阵以获取对应于数据库中所有文件的纸张聚类组(S5);确定对应于作者歧义的纸张聚类组是否属于与指定作者对应的纸张聚类组(S6);如果没有,确定作者歧义的作者与指定作者的不同(S7)不同。作者的名称是预处理的构造一个名称树,并且由可以根据名称的不同方式引起的群集错误,然后根据名称树消除,从而保证同一作者的名称分为相同的尽可能小组,提高名称歧义的精确度。

著录项

  • 公开/公告号WO2021139256A1

    专利类型

  • 公开/公告日2021-07-15

    原文格式PDF

  • 申请/专利权人 PING AN TECHNOLOGY (SHENZHEN) CO. LTD.;

    申请/专利号WO2020CN118531

  • 发明设计人 MA WENJIA;LIN GUI;NI YUAN;

    申请日2020-09-28

  • 分类号G06F40/30;

  • 国家 CN

  • 入库时间 2022-08-24 19:58:42

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号