首页> 外文会议>FIRE 2011 >Test Collections and Evaluation Metrics Based on Graded Relevance
【24h】

Test Collections and Evaluation Metrics Based on Graded Relevance

机译:基于分级相关性的测试和评估指标

获取原文

摘要

In modern large information retrieval (IR) environments, the number of documents relevant to a request may easily exceed the number of documents a user is willing to examine. Therefore it is desirable to rank highly relevant documents first in search results. To develop retrieval methods for this purpose requires evaluating retrieval methods accordingly. However, the most IR method evaluations are based on rather liberal and binary relevance assessments. Therefore differences between sloppy and excellent IR methods may not be observed in evaluation. An alternative is to employ graded relevance assessments in evaluation. The present paper discusses graded relevance, test collections providing graded assessments, evaluation metrics based on graded relevance assessments. We shall also examine the effects of using graded relevance assessments in retrieval evaluation, and some evaluation results based on graded relevance. We find that graded relevance provides new insight into IR phenomena and affects the relative merits of IR methods.
机译:在现代大型信息检索(IR)环境中,与请求相关的文档数量可能很容易超过用户愿意检查的文件数量。因此,希望首先在搜索结果中排列高度相关的文件。为此目的开发检索方法需要相应地评估检索方法。但是,大多数IR方法评估都是基于相当自由的和二元相关评估。因此,在评估中可能无法观察到邋x和优异的红外方法之间的差异。另一种方法是在评估中使用评分相关性评估。本文讨论了评分相关性,提供评级评估,基于评分相关性评估的评估指标。我们还应研究在检索评估中使用分级相关性评估的影响,以及基于评分相关性的一些评估结果。我们发现分级相关性为IR现象提供了新的洞察力,并影响红外方法的相对优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号