...
首页> 外文期刊>Computational & Mathematical Organization Theory >A Network Analysis Model for Disambiguation of Names in Lists
【24h】

A Network Analysis Model for Disambiguation of Names in Lists

机译:列表中名称歧义的网络分析模型

获取原文
获取原文并翻译 | 示例

摘要

In research and application, social networks are increasingly extracted from relationships inferred by name collocations in text-based documents. Despite the fact that names represent real entities, names are not unique identifiers and it is often unclear when two name observations correspond to the same underlying entity. One confounder stems from ambiguity, in which the same name correctly references multiple entities. Prior name disambiguation methods measured similarity between two names as a function of their respective documents. In this paper, we propose an alternative similarity metric based on the probability of walking from one ambiguous name to another in a random walk of the social network constructed from all documents. We experimentally validate our model on actor-actor relationships derived from the Internet Movie Database. Using a global similarity threshold, we demonstrate random walks achieve a significant increase in disambiguation capability in comparison to prior models.
机译:在研究和应用中,越来越多地从基于文本的文档中由名称搭配推断出的关系中提取社交网络。尽管名称代表真实实体,但名称并不是唯一的标识符,而且经常不清楚两个名称观察值何时对应于相同的基础实体。一个混淆因素源于模棱两可,其中相同的名称正确地引用了多个实体。在先名称消除歧义的方法根据它们各自的文档来衡量两个名称之间的相似性。在本文中,我们基于从所有文档构成的社交网络的随机游走中,从一个模棱两可的名字走到另一个歧义的名字的概率,提出了一种替代相似性度量。我们通过实验验证了基于Internet电影数据库中的演员与演员关系的模型。使用全局相似性阈值,我们证明了与以前的模型相比,随机游走的消歧能力显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号