首页> 外文会议>19th international world wide web conference 2010 >Efficient Web Pages Identification for Entity Resolution
【24h】

Efficient Web Pages Identification for Entity Resolution

机译:高效的网页识别以解决实体问题

获取原文

摘要

Entity resolution (ER) is a problem that arises in many areas. In most of cases, it represents a task that multiple entities from different sources require to be identified if they refer to the same or different objects because there are not unique identifiers associated with them. In this paper, we propose a model using web pages identification to identify entities and merge those entities refer to one object together. We use a classical name disambiguation problem as case study and examine our model on a subset of digital library records as the first stage of our work. The favorable results indicated that our proposed approach is highly effective.
机译:实体解析(ER)是许多领域中出现的问题。在大多数情况下,它代表着一项任务,即如果来自不同来源的多个实体引用相同或不同的对象,则需要对其进行标识,因为没有与之关联的唯一标识符。在本文中,我们提出了一种使用网页标识来识别实体并将这些实体引用到一个对象的模型合并在一起的模型。我们将经典名称消除歧义问题用作案例研究,并在数字图书馆记录的子集上检查模型,这是我们工作的第一步。良好的结果表明,我们提出的方法非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号