首页> 外文期刊>Information Processing & Management >A query term re-weighting approach using document similarity
【24h】

A query term re-weighting approach using document similarity

机译:使用文档相似度的查询词重新加权方法

获取原文
获取原文并翻译 | 示例
       

摘要

Pseudo-relevance feedback is the basis of a category of automatic query modification techniques. Pseudo-relevance feedback methods assume the initial retrieved set of documents to be relevant. Then they use these documents to extract more relevant terms for the query or just re-weigh the user's original query. In this paper, we propose a straightforward, yet effective use of pseudo-relevance feedback method in detecting more informative query terms and re-weighting them. The query-by-query analysis of our results indicates that our method is capable of identifying the most important keywords even in short queries. Our main idea is that some of the top documents may contain a closer context to the user's information need than the others. Therefore, re-examining the similarity of those top documents and weighting this set based on their context could help in identifying and re-weighting informative query terms. Our experimental results in standard English and Persian test collections show that our method improves retrieval performance, in terms of MAP criterion, up to 7% over traditional query term re-weighting methods.
机译:伪相关反馈是自动查询修改技术类别的基础。伪相关性反馈方法假定初始检索的文档集是相关的。然后他们使用这些文档为查询提取更多相关的术语,或者只是重新称量用户的原始查询。在本文中,我们提出了一种简单而有效的伪相关反馈方法,用于检测更多信息性查询词并对其进行加权。我们对结果的逐查询分析表明,即使在简短查询中,我们的方法也能够识别出最重要的关键字。我们的主要思想是,某些顶级文档可能包含比其他文档更接近用户信息需求的上下文。因此,重新检查这些顶级文档的相似性并根据其上下文对该集合进行加权可以帮助识别和重新加权信息性查询词。我们在标准英语和波斯语测试集中的实验结果表明,相对于传统的查询词重新加权方法,根据MAP准则,该方法将检索性能提高了7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号