首页> 外文会议>2012 Fourth International Symposium on Information Science and Engineering. >A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents
【24h】

A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents

机译:一种改进伪相关反馈的聚类方法:通过删除嘈杂的文档来提高检索效果

获取原文
获取原文并翻译 | 示例

摘要

Pseudo relevance feedback is an effective technique for improving retrieval results, which assumes a small number of top-ranked documents in the initial retrieval results are relevant and selects from these documents related terms to the query to improve the query representation through query expansion. However, these documents are often a mixture of relevant and irrelevant documents. The relevance feedback is quite effective and performs significantly better than pseudo-relevance feedback, which needs the user explicitly provides information on relevant documents to a query. This paper makes a case for the use of query-specific density clustering in IR on the grounds of improved retrieval effectiveness in a fully automatic manner and without relevance information provided by human and the experimental results show that significant improvements can be obtained on several collections when our new model FWN (Feedback Without Noise) is used.
机译:伪相关反馈是一种改善检索结果的有效技术,它假定初始检索结果中的少量顶级文档是相关的,并从这些文档中选择与查询相关的术语,以通过查询扩展来改善查询的表示性。但是,这些文档通常是相关文档和无关文档的混合。相关性反馈非常有效,并且比伪相关性反馈要好得多,后者需要用户将有关相关文档的信息显式提供给查询。本文以完全自动化的检索效率为基础,无需人工提供相关信息的情况下,在IR中使用特定于查询的密度聚类为例,实验结果表明,当多个集合在我们使用了新型号FWN(无噪音反馈)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号