首页> 外文会议>International conference on computational linguistics >A Data Driven Approach for Person Name Disambiguation in Web Search Results
【24h】

A Data Driven Approach for Person Name Disambiguation in Web Search Results

机译:一个数据驱动方法,用于Web搜索结果中的人名歧义

获取原文

摘要

This paper presents an unsupervised approach for the task of clustering the results of a search engine when the query is a person name shared by different individuals. We propose an algorithm that calculates the number of clusters and establishes the groups of web pages according to the different individuals without the need to any training data or predefined thresholds, as the successful state of the art systems do. In addition, most of those systems do not deal with social media web pages and their performance could fail in a real scenario. In this paper we also propose a heuristic method for the treatment of social networking profiles. Our approach is compared with four gold standard collections for this task obtaining really competitive results, comparable to those obtained by some approaches with supervision.
机译:本文介绍了在查询是由不同个人共享的人名时聚类搜索引擎的结果的任务的无监督方法。 我们提出了一种算法,其计算群集的数量,并根据不同的个体建立网页的组,而无需任何训练数据或预定义的阈值,作为艺术系统的成功状态。 此外,大多数系统都不应处理社交媒体网页,并且它们的性能可能会在真实方案中失败。 在本文中,我们还提出了一种治疗社交网络概况的启发式方法。 我们的方法与这项任务的四个金标准集合进行了比较,获得了真正竞争的结果,可与由某些方法与监督获得的方法相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号