首页> 外文会议>International Conference on Contemporary Computing >Author name disambiguation using vector space model and hybrid similarity measures
【24h】

Author name disambiguation using vector space model and hybrid similarity measures

机译:作者名称使用矢量空间模型和混合相似度措施消歧

获取原文

摘要

Differentiating people on the basis of their names has always been a complex issue and our desire for grouping people, in a particular domain, based on their attributes is growing day by day. Despite years of research and a bunch of proposed techniques, the name ambiguity problem remains largely unsolved and the so far proposed techniques have faced one problem or the other. In case of author name disambiguation in digital citations, additional attributes like e-mail ID and affiliation of author and co-authors, which are normally available in publications, can help a lot in disambiguation process. Vector space model has traditionally been used in information retrieval field with great degree of success and we explore its use in case of author name disambiguation here. In this paper we propose an enhanced vector space model for disambiguating authors and their publications. Experimental results show that additional attributes present in publications can help a lot in disambiguation and solve the name ambiguity problem with a great degree of confidence. From the study we conducted and the experimental results obtained we conclude that both mixed citation and split citations problem can be handled very efficiently. We obtained a great deal of improvement in evaluation metrics obtaining F1 score of 0.97.
机译:根据他们的名字的基础上的人们始终是一个复杂的问题,我们对基于他们的属性的特定领域的人们对人们进行分组的愿望是日益增长。尽管有多年的研究和一堆提出的技术,但名称歧义问题仍然很大程度上未解决,到目前为止的拟议技术面临一个问题或另一个问题。如果作者姓名在数字引用中的歧义,那么附加属性,如电子邮件ID和作者和共同作者的关系,通常在出版物中提供,可以帮助消费者歧义过程。传统上传统上,传统上用于信息检索领域,成功越来越多,我们在此处探讨了在此处的作者名称歧义的情况下。在本文中,我们提出了一种增强的传染媒介空间模型,用于消除作者及其出版物。实验结果表明,出版物中存在的额外属性可以在消除歧义下有很大帮助,并以大量的信心解决歧义问题。从我们进行的研究中,获得的实验结果我们得出结论,可以非常有效地处理混合引文和分裂引用问题。我们在评估指标获得大量改进,获得0.97的F1得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号