首页> 外文期刊>Information retrieval >Identifying and exploiting target entity type information for ad hoc entity retrieval
【24h】

Identifying and exploiting target entity type information for ad hoc entity retrieval

机译:识别和利用Ad Hoc实体检索的目标实体类型信息

获取原文
获取原文并翻译 | 示例
           

摘要

Today, the practice of returning entities from a knowledge base in response to search queries has become widespread. One of the distinctive characteristics of entities is that they are typed, i.e., assigned to some hierarchically organized type system (type taxonomy). The primary objective of this paper is to gain a better understanding of how entity type information can be utilized in entity retrieval. We perform this investigation in two settings: firstly, in an idealized oracle setting, assuming that we know the distribution of target types of the relevant entities for a given query; and secondly, in a realistic scenario, where target entity types are identified automatically based on the keyword query. We perform a thorough analysis of three main aspects: (i) the choice of type taxonomy, (ii) the representation of hierarchical type information, and (iii) the combination of type-based and term-based similarity in the retrieval model. Using a standard entity search test collection based on DBpedia, we show that type information can significantly and substantially improve retrieval performance, yielding up to 67% relative improvement in terms of NDCG@10 over a strong text-only baseline in an oracle setting. We further show that using automatic target type detection, we can outperform the text-only baseline by 44% in terms of NDCG@10. This is as good as, and sometimes even better than, what is attainable by using explicit target type information provided by humans. These results indicate that identifying target entity types of queries is challenging even for humans and attests to the effectiveness of our proposed automatic approach.
机译:如今,以响应搜索查询的知识库返回实体的做法已经普遍存在。实体的一个独特特征是它们被键入,即分配给某些分层组织类型系统(类型分类)。本文的主要目的是更好地理解实体检索中可以利用实体​​类型信息。我们在两个设置中执行此调查:首先,在理想化的Oracle设置中,假设我们知道给定查询的目标类型的分发;其次,在一个现实的场景中,其中基于关键字查询自动识别目标实体类型。我们对三个主要方面进行了彻底的分析:(i)类型分类学类型,(ii)分层类型信息的表示,(iii)在检索模型中基于类型的基于术语的相似性的组合。使用基于DBPedia的标准实体搜索测试集合,我们显示类型信息可以显着且大大提高检索性能,在Oracle设置中的强大文本基准中产生高达67%的相对改善。我们进一步表明,使用自动目标类型检测,我们可以在NDCG @ 10方面优于44%的文本基准。这与人类提供的显式目标类型信息可实现的那样良好,有时甚至更好。这些结果表明,识别目标实体类型的查询是甚至对人类的挑战,并证明我们提出的自动方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号