...
首页> 外文期刊>The Journal of Documentation >Understanding inverse document frequency: on theoretical arguments for IDF
【24h】

Understanding inverse document frequency: on theoretical arguments for IDF

机译:了解反文档频率:关于IDF的理论论证

获取原文
获取原文并翻译 | 示例

摘要

The term-weighting function known as IDF was proposed in 1972, and has since been extremely widely used, usually as part of a TF*IDF function. It is often described as a heuristic, and many papers have been written (some based on Shannon's Information Theory) seeking to establish some theoretical basis for it Some of these attempts are reviewed, and it is shown that the Information Theory approaches are problematic, but that there are good theoretical justifications of both IDF and TF*IDF in the traditional probabilistic model of information retrieval.
机译:术语加权函数IDF于1972年提出,此后已被广泛使用,通常作为TF * IDF函数的一部分。它通常被描述为一种启发式方法,并且已经写了许多论文(其中一些基于Shannon的信息论),试图为其建立一些理论基础。其中一些尝试得到了回顾,结果表明,信息论的方法是有问题的,但是在传统的信息检索概率模型中,IDF和TF * IDF都有很好的理论依据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号