【24h】

Positional Relevance Model for Pseudo-Relevance Feedback

机译:伪相关反馈的位置相关模型

获取原文
获取原文并翻译 | 示例

摘要

Pseudo-relevance feedback is an effective technique for improving retrieval results. Traditional feedback algorithms use a whole feedback document as a unit to extract words for query expansion, which is not optimal as a document may cover several different topics and thus contain much irrelevant information. In this paper, we study how to effectively select from feedback documents those words that are focused on the query topic based on positions of terms in feedback documents. We propose a positional relevance model (PRM) to address this problem in a unified probabilistic way. The proposed PRM is an extension of the relevance model to exploit term positions and proximity so as to assign more weights to words closer to query words based on the intuition that words closer to query words are more likely to be related to the query topic. We develop two methods to estimate PRM based on different sampling processes. Experiment results on two large retrieval datasets show that the proposed PRM is effective and robust for pseudo-relevance feedback, significantly outperforming the relevance model in both document-based feedback and passage-based feedback.
机译:伪相关反馈是一种改善检索结果的有效技术。传统的反馈算法将整个反馈文档作为一个单元来提取单词以进行查询扩展,这并不是最佳选择,因为文档可能涵盖多个不同的主题,因此包含许多不相关的信息。在本文中,我们研究如何基于反馈文档中术语的位置,从反馈文档中有效地选择那些集中在查询主题上的单词。我们提出一种位置相关性模型(PRM),以统一的概率方式解决此问题。所提出的PRM是相关性模型的扩展,以利用词的位置和邻近度,从而基于直觉更接近查询词的词更可能与查询主题相关的直觉为更大的权重分配给更接近查询词的词。我们开发了两种基于不同采样过程估算PRM的方法。在两个大型检索数据集上的实验结果表明,所提出的PRM对于伪相关性反馈是有效且健壮的,在基于文档的反馈和基于段落的反馈中均明显优于相关性模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号