首页> 外文会议>Advances in information retrieval. >Relation Based Term Weighting Regularization
【24h】

Relation Based Term Weighting Regularization

机译:基于关系的术语加权正则化

获取原文
获取原文并翻译 | 示例

摘要

Traditional retrieval models compute term weights based on only the information related to individual terms such as TF and IDF. However, query terms are related. Intuitively, these relations could provide useful information about the importance of a term in the context of other query terms. For example, query "perl tutorial" specifies that a user look for information relevant to both perl and tutorial. Thus, a document containing both terms should have higher relevance score than the ones with only one of them. However, if the IDF value of "tutorial" is much smaller than "perl", existing retrieval models may assign the document lower score than those containing multiple occurrences of "perl". It is clear that the importance of a term should be dependent on not only collection statistics but also the relations with other query terms. In this work, we study how to utilize semantic relations among query terms to regularize term weighting. Experiment results over TREC collections show that the proposed strategy is effective to improve the retrieval performance.
机译:传统检索模型仅基于与单个术语(例如TF和IDF)有关的信息来计算术语权重。但是,查询词是相关的。直观地,这些关系可以在其他查​​询词的上下文中提供有关该词重要性的有用信息。例如,查询“ perl教程”指定用户查找与perl和教程相关的信息。因此,包含两个术语的文档的相关性得分应高于仅包含两个术语的文档。但是,如果“ tutorial”的IDF值比“ perl”小得多,则现有的检索模型可以为文档分配比包含多次出现的“ perl”的评分更低的分数。很明显,一个术语的重要性不仅应取决于集合统计,而且还应取决于与其他查询术语的关系。在这项工作中,我们研究如何利用查询词之间的语义关系来规范词加权。在TREC集合上的实验结果表明,该策略可有效提高检索性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号