首页> 外文会议>Advances in Information Retrieval >Computing Information Retrieval Performance Measures Efficiently in the Presence of TiedScores
【24h】

Computing Information Retrieval Performance Measures Efficiently in the Presence of TiedScores

机译:在TiedScores存在的情况下有效地计算信息检索性能指标

获取原文
获取原文并翻译 | 示例

摘要

The Information Retrieval community uses a variety of performance measures to evaluate the effectiveness of scoring functions. In this paper, we show how to adapt six popular measures - precision, recall, F1, average precision, reciprocal rank, and normalized discounted cumulative gain - to cope with scoring functions that are likely to assign many tied scores to the results of a search. Tied scores impose only a partial ordering on the results, meaning that there are multiple possible orderings of the result set, each one performing differently. One approach to cope with ties would be to average the performance values across all possible result orderings; but unfortunately, generating result permutations requires super-exponential time. The approach presented in this paper computes precisely the same performance value as the approach of averaging over all permutations, but does so as efficiently as the original, tie-oblivious measures.
机译:信息检索社区使用各种绩效指标来评估评分功能的有效性。在本文中,我们将展示如何适应六种流行的度量标准-精度,召回率,F1,平均精度,倒数排名和归一化的折现累积增益-以应对可能会给搜索结果分配许多相关得分的评分功能。捆绑分数仅对结果施加部分排序,这意味着结果集存在多种可能的排序,每个排序的执行方式都不相同。处理关系的一种方法是对所有可能的结果排序中的性能值求平均。但不幸的是,生成结果排列需要超指数的时间。本文介绍的方法与对所有排列进行平均的方法精确地计算出相同的性能值,但其效果与原始的,无约束力的度量一样有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号