首页> 外文会议>International Conference on Machine Learning >Learn to Weight Terms in Information Retrieval Using Category Information
【24h】

Learn to Weight Terms in Information Retrieval Using Category Information

机译:使用类别信息学会信息检索中的重量术语

获取原文

摘要

How to assign appropriate weights to terms is one of the critical issues in information retrieval. Many term weighting schemes are unsupervised. They are either based on the empirical observation in information retrieval, or based on generative approaches for language modeling. As a result, the existing term weighting schemes are usually insufficient in distinguishing informative words from the uninformative ones, which is crucial to the performance of information retrieval. In this paper, we present supervised term weighting schemes that automatically learn term weights based on the correlation between word frequency and category information of documents. Empirical studies with the ImageCLEF dataset have indicated that the proposed methods perform substantially better than the state-of-the-art approaches for term weighting and other alternatives that exploit category information for information retrieval.
机译:如何为术语分配适当的权重是信息检索中的关键问题之一。许多术语加权计划是无监督的。它们是基于信息检索的经验观察,或者基于语言建模的生成方法。结果,现有的术语加权方案通常不足以区分来自无关的信息,这对于信息检索的性能至关重要。在本文中,我们呈现了监督术语加权方案,其基于文字词频率和类别信息之间的相关性自动学习术语权重。具有ImageClef DataSet的实证研究表明,所提出的方法比术语加权和其他替代方案的最先进的方法表现得显着更好,以及用于信息检索的类别信息的其他替代方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号