首页> 外文期刊>Information retrieval >Overcoming low-utility facets for complex answer retrieval
【24h】

Overcoming low-utility facets for complex answer retrieval

机译:克服了用于复杂答案的低实用面

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Many questions cannot be answered simply; their answers must include numerous nuanced details and context. Complex Answer Retrieval (CAR) is the retrieval of answers to such questions. These questions can be constructed from a topic entity (e.g., cheese') and a facet (e.g., health effects'). While topic matching has been thoroughly explored, we observe that some facets use general language that is unlikely to appear verbatim in answers, exhibiting low utility. In this work, we present an approach to CAR that identifies and addresses low-utility facets. First, we propose two estimators of facet utility: the hierarchical structure of CAR queries, and facet frequency information from training data. Then, to improve the retrieval performance on low-utility headings, we include entity similarity scores using embeddings trained from a CAR knowledge graph, which captures the context of facets. We show that our methods are effective by applying them to two leading neural ranking techniques, and evaluating them on the TREC CAR dataset. We find that our approach perform significantly better than the unmodified neural ranker and other leading CAR techniques, yielding state-of-the-art results. We also provide a detailed analysis of our results, verify that low-utility facets are indeed difficult to match, and that our approach improves the performance for these difficult queries.
机译:许多问题无法简单地回答;他们的答案必须包括众多细节细节和上下文。复杂的答案检索(汽车)是对这些问题的答案的检索。这些问题可以由主题实体(例如,奶酪')和小平面(例如,健康效果')构成。虽然主题匹配已经彻底探索,但我们观察到一些方面使用普通语言,不太可能出现逐字的曲目,呈现出低效用。在这项工作中,我们提出了一种识别和解决低实用面的汽车方法。首先,我们提出了两个方面实用程序的估算:汽车查询的层次结构,以及来自训练数据的平面频率信息。然后,为了提高低实用标题的检索性能,我们包括使用从汽车知识图表培训的嵌入的实体相似性分数,这捕获了方面的上下文。我们表明我们的方法通过将它们应用于两个主要的神经排名技术,并在TREC Car DataSet上评估它们。我们发现,我们的方法显着优于未修改的神经排名和其他主要的汽车技术,产生最先进的结果。我们还提供了对我们的结果的详细分析,验证低实用面确实难以匹配,而我们的方法可以提高这些困难查询的性能。

著录项

  • 来源
    《Information retrieval》 |2019年第4期|395-418|共24页
  • 作者单位

    Georgetown Univ Comp Sci Dept Informat Retrieval Lab 3700 O St NW Washington DC 20057 USA;

    Max Planck Inst Informat Saarland Informat Campus Saarbrucken Germany;

    Georgetown Univ Comp Sci Dept Informat Retrieval Lab 3700 O St NW Washington DC 20057 USA|Allen Inst Artificial Intelligence Seattle WA USA;

    Georgetown Univ Comp Sci Dept Informat Retrieval Lab 3700 O St NW Washington DC 20057 USA;

    SAP SE Machine Learning R&D Berlin Germany;

    Georgetown Univ Comp Sci Dept Informat Retrieval Lab 3700 O St NW Washington DC 20057 USA;

    Georgetown Univ Comp Sci Dept Informat Retrieval Lab 3700 O St NW Washington DC 20057 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Complex answer retrieval; Knowledge graphs; Neural information retrieval; Reranking;

    机译:复杂答案检索;知识图形;神经信息检索;重新划分;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号