【24h】

Strategic System Comparisons via Targeted Relevance Judgments

机译:通过有针对性的相关性判断的战略制度比较

获取原文

摘要

Relevance judgments are used to compare text retrieval systems. Given a collection of documents and queries, and a set of systems being compared, a standard approach to forming judgments is to manually examine all documents that are highly ranked by any of the systems. However, not all of these relevance judgments provide the same benefit to the final result, particularly if the aim is to identify which systems are best, rather than to fully order them. In this paper we propose new experimental methodologies that can significantly reduce the volume of judgments required in system comparisons. Using rank-biased precision, a recently proposed effectiveness measure, we show that judging around 200 documents for each of 50 queries in a TREC-scale system evaluation containing over 100 runs is sufficient to identify the best systems.
机译:相关性判断用于比较文本检索系统。鉴于文件和查询的集合,并进行了一组系统,形成判断的标准方法是手动检查所有系统高度排名的文件。但是,并非所有这些相关性判断都为最终结果提供了相同的益处,特别是如果目的是确定哪个系统是最佳的,而不是完全订购它们。在本文中,我们提出了新的实验方法,可以显着降低系统比较所需的判断量。使用秩偏置的精度,最近提出的有效性措施,我们表明,在包含超过100个运行的TREC级系统评估中判断50个查询中的每一个文件足以识别最佳系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号