【24h】

Author Name Disambiguation Based on Rule and Graph Model

机译:作者名称基于规则和图形模型消歧

获取原文
获取外文期刊封面目录资料

摘要

Author name disambiguation has long been viewed as a challenging problem in scientific literature management, and with the substantial growth of the scientific literature, the solution to this problem has become increasingly difficult and urgency. In this paper, we conduct research on the author name disambiguation problem in large-scale academic papers. In our method, we combine the paper feature information and the relation information between the papers for disambiguation. Based on the Aminer's disambiguation framework, we present a novel method to constructing the paper relation graph based on atomic cluster and propose an efficient post processing algorithm, aiming to improve the disambiguation performance by rule-based clustering, this algorithm utilizes similarity features based on metadata information and implement two types of disambiguation rules. We carefully evaluate the proposed disambiguation method on real-world large data and experimental result shows that our method achieves clearly better performance than the state-of-the-art methods.
机译:作者姓名歧义长期以来一直被视为科学文学管理中有挑战性的问题,并以科学文献的大量增长,解决这个问题的解决方案越来越困难和紧迫性。本文在大型学术论文中对作者名称消歧问题进行了研究。在我们的方法中,我们将纸质特征信息和纸张之间的关系信息组合在歧义。基于Aminer的歧义框架,我们提出了一种基于原子群构建纸张关系图的新方法,并提出了一种高效的后处理算法,旨在通过基于规则的聚类来提高消歧性能,该算法利用基于元数据的相似性特征信息并实施两种消费歧义规则。我们仔细评估了真实世界大数据和实验结果的提出的消歧方法表明,我们的方法比现有技术的方法明显地实现了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号