首页> 外文会议>International Symposium on Information Science and Engineering;International Symposium on Information Processing >A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents
【24h】

A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents

机译:提高伪相关性反馈的聚类方法:通过删除嘈杂的文件来提高检索水力

获取原文

摘要

Pseudo relevance feedback is an effective technique for improving retrieval results, which assumes a small number of top-ranked documents in the initial retrieval results are relevant and selects from these documents related terms to the query to improve the query representation through query expansion. However, these documents are often a mixture of relevant and irrelevant documents. The relevance feedback is quite effective and performs significantly better than pseudo-relevance feedback, which needs the user explicitly provides information on relevant documents to a query. This paper makes a case for the use of query-specific density clustering in IR on the grounds of improved retrieval effectiveness in a fully automatic manner and without relevance information provided by human and the experimental results show that significant improvements can be obtained on several collections when our new model FWN (Feedback Without Noise) is used.
机译:伪相关性反馈是提高检索结果的有效技术,它在初始检索结果中假设少量排名的档案相关,并从这些文档中选择与查询相关的术语,以通过查询扩展来改善查询表示。 但是,这些文件通常是相关和无关文件的混合。 相关性反馈非常有效,并且比伪相关反馈更好地执行,这需要用户明确地向查询提供有关相关文档的信息。 本文以完全自动的方式改善检索效果的原因,在IR中使用查询特异性密度聚类的情况,无论是人类提供的相关信息,实验结果表明,在几个收集时,可以获得显着的改进 我们使用我们的新模型FWN(无噪音的反馈)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号