首页> 外文会议>Asia Information Retrieval Symposium(AIRS 2005); 20051013-15; Jeju Island(KR) >Chinese Document Re-ranking Based on Term Distribution and Maximal Marginal Relevance
【24h】

Chinese Document Re-ranking Based on Term Distribution and Maximal Marginal Relevance

机译:基于术语分布和最大边际相关性的中文文档重排

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a document re-ranking method for Chinese information retrieval where a query is a short natural language description. The method bases on term distribution where each term is weighted by its local and global distribution, including document frequency, document position and term length. The weight scheme lifts off the worry that very fewer relevant documents appear in top retrieved documents, and allows randomly setting a larger portion of the retrieved documents as relevance feedback. It also helps to improve the performance of MMR model in document re-ranking. The experiments show our method can get significant improvement against standard baselines, and outperforms relevant methods consistently.
机译:在本文中,我们提出了一种用于中文信息检索的文档重新排序方法,其中查询是一种简短的自然语言描述。该方法基于术语分布,其中每个术语均以其局部和全局分布(包括文档频率,文档位置和术语长度)加权。权重方案消除了在顶部检索的文档中出现很少相关文档的担心,并允许将检索文档的较大部分随机设置为相关性反馈。它还有助于提高MMR模型在文档重新排序中的性能。实验表明,相对于标准基准,我们的方法可以得到显着改进,并且始终优于相关方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号