...
首页> 外文期刊>ACM Transactions on Information Systems >Cumulated Gain-Based Evaluation of IR Techniques
【24h】

Cumulated Gain-Based Evaluation of IR Techniques

机译:基于增益的IR技术的累积评估

获取原文
获取原文并翻译 | 示例
           

摘要

Modern large retrieval environments tend to overwhelm their users by their large output. Since all documents are not of equal relevance to their users, highly relevant documents should be identified and ranked first for presentation. In order to develop IR techniques in this direction, it is necessary to develop evaluation approaches and methods that credit IR methods for their ability to retrieve highly relevant documents. This can be done by extending traditional evaluation methods, that is, recall and precision based on binary relevance judgments, to graded relevance judgments. Alternatively, novel measures based on graded relevance judgments may be developed. This article proposes several novel measures that compute the cumulative gain the user obtains by examining the retrieval result up to a given ranked position. The first one accumulates the relevance scores of retrieved documents along the ranked result list. The second one is similar but applies a discount factor to the relevance scores in order to devaluate late-retrieved documents. The third one computes the relative-to-the-ideal performance of IR techniques, based on the cumulative gain they are able to yield. These novel measures are defined and discussed and their use is demonstrated in a case study using TREC data: sample system run results for 20 queries in TREG-7. As a relevance base we used novel graded relevance judgments on a four-point scale. The test results indicate that the proposed measures credit IR methods for their ability to retrieve highly relevant documents and allow testing of statistical significance of effectiveness differences. The graphs based on the measures also provide insight into the performance IR techniques and allow interpretation, for example, from the user point of view.
机译:现代大型检索环境往往因其大量输出而使用户不知所措。由于所有文档与用户的相关性均不相同,因此应确定高度相关的文档,并将其排名第一。为了朝这个方向发展信息检索技术,有必要开发一种评价方法和方法,这些方法和方法应归功于信息检索方法检索高度相关文件的能力。这可以通过将传统的评估方法(即基于二元相关性判断的召回率和准确性)扩展到分级相关性判断来完成。或者,可以开发基于分级相关性判断的新颖度量。本文提出了几种新颖的措施,这些措施可以通过检查直到给定排名位置的检索结果来计算用户获得的累积增益。第一个沿排序结果列表累积检索文档的相关性分数。第二个相似,但是对相关性分数应用了折现因子,以便使后期检索的文档贬值。第三个基于IR技术能够产生的累积增益来计算IR技术的相对理想性能。定义和讨论了这些新颖的措施,并在使用TREC数据的案例研究中证明了它们的用途:TREG-7中20个查询的示例系统运行结果。作为相关性基础,我们使用了基于四点量表的新颖的分级相关性判断。测试结果表明,所提议的措施将IR方法归功于其检索高度相关文档并能够测试有效性差异的统计显着性的能力。基于这些度量的图形还可以提供对性能IR技术的深入了解,并可以从用户角度进行解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号