首页> 外文OA文献 >A split-list approach for relevance feedback in information retrieval
【2h】

A split-list approach for relevance feedback in information retrieval

机译:信息检索中相关反馈的拆分列表方法

摘要

In this paper we present a new algorithm for relevance feedback (RF) in information retrieval. Unlike conventional RF algorithms which use the top ranked documents for feedback, our proposed algorithm is a kind of active feedback algorithm which actively chooses documents for the user to judge. The objectives are (a) to increase the number of judged relevant documents and (b) to increase the diversity of judged documents during the RF process. The algorithm uses document-contexts by splitting the retrieval list into sub-lists according to the query term patterns that exist in the top ranked documents. Query term patterns include a single query term, a pair of query terms that occur in a phrase and query terms that occur in proximity. The algorithm is an iterative algorithm which takes one document for feedback in each of the iterations. We experiment with the algorithm using the TREC-6, -7, -8, -2005 and GOV2 data collections and we simulate user feedback using the TREC relevance judgements. From the experimental results, we show that our proposed split-list algorithm is better than the conventional RF algorithm and that our algorithm is more reliable than a similar algorithm using maximal marginal relevance.
机译:在本文中,我们提出了一种新的信息检索中的相关反馈(RF)算法。与使用排名靠前的文档进行反馈的常规RF算法不同,我们提出的算法是一种主动反馈算法,可以主动选择文档供用户判断。目标是(a)在RF流程中增加已判断的相关文件的数量,以及(b)增加已判断的文件的多样性。该算法通过根据排名靠前的文档中存在的查询词模式将检索列表分为子列表来使用文档上下文。查询词模式包括单个查询词,在短语中出现的一对查询词和在邻近出现的查询词。该算法是一种迭代算法,在每次迭代中都需要一个文档来进行反馈。我们使用TREC-6,-7,-8,-2005和GOV2数据集对算法进行实验,并使用TREC相关性判断来模拟用户反馈。从实验结果可以看出,我们提出的拆分列表算法优于传统的RF算法,并且比使用最大边际相关性的类似算法更可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号