首页> 外文OA文献 >Exploring sentence level query expansion in language modeling based information retrieval
【2h】

Exploring sentence level query expansion in language modeling based information retrieval

机译:探索基于语言建模的信息检索中的句子级查询扩展

摘要

We introduce two novel methods for query expansion in information retrieval (IR). The basis of these methods is to add the most similar sentences extracted fromudpseudo-relevant documents to the original query. The first method adds a fixed number of sentences to the original query, the second a progressively decreasing number of sentences. We evaluate these methods on the English and Bengali test collections from the FIRE workshops. The majorudfindings of this study are that: i) performance is similar for both English and Bengali; ii) employing a smaller context (similar sentences) yields a considerably higherudmean average precision (MAP) compared to extracting terms from full documents (up to 5.9% improvemnent in MAP forudEnglish and 10.7% for Bengali compared to standard Blind Relevance Feedback (BRF); iii) using a variable number of sentences for query expansion performs better and shows less variance in the best MAP for different parameter settings; iv) query expansion based on sentences canudimprove performance even for topics with low initial retrieval precision where standard BRF fails.
机译:我们介绍了两种新颖的信息检索(IR)查询扩展方法。这些方法的基础是将从 udpseudo相关文档中提取的最相似的句子添加到原始查询中。第一种方法是将固定数量的句子添加到原始查询中,第二种方法是逐渐减少句子的数量。我们在FIRE研讨会的英语和孟加拉语测试集中评估了这些方法。该研究的主要结论是:i)英语和孟加拉语的表现相似; ii)与从完整文档中提取术语相比,采用较小的上下文(相似的句子)会产生更高的 udmean平均精度(MAP)(与标准的盲人相关性反馈相比, udEnglish的MAP最高提高了5.9%,孟加拉语的提高了10.7%) (BRF); iii)使用可变数量的句子进行查询扩展效果更好,并且在针对不同参数设置的最佳MAP中显示较少的差异; iv)即使对于标准BRF失败的初始检索精度较低的主题,基于句子的查询扩展也可以 dimproving性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号