首页> 外文会议>European Conference on IR Research >Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions
【24h】

Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions

机译:基于累积的多查询IR会话评估的折扣

获取原文

摘要

IR research has a strong tradition of laboratory evaluation of systems. Such research is based on test collections, pre-defined test topics, and standard evaluation metrics. While recent research has emphasized the user viewpoint by proposing user-based metrics and non-binary relevance assessments, the methods are insufficient for truly user-based evaluation. The common assumption of a single query per topic and session poorly represents real life. On the other hand, one well-known metric for multiple queries per session, instance recall, does not capture early (within session) retrieval of (highly) relevant documents. We propose an extension to the Discounted Cumulated Gain (DCG) metric, the Session-based DCG (sDCG) metric for evaluation scenarios involving multiple query sessions, graded relevance assessments, and open-ended user effort including decisions to stop searching. The sDCG metric discounts relevant results from later queries within a session. We exemplify the sDCG metric with data from an interactive experiment, we discuss how the metric might be applied, and we present research questions for which the metric is helpful.
机译:IR研究具有系统实验室评估的一个优良传统。这样的研究是基于测试的集合,预先定义的测试题目和标准评价指标。虽然最近的研究强调通过提出基于用户的指标和非二进制相关评估用户角度来看,这些方法不足以真正基于用户评价。每个主题和会话建立一个查询的普遍看法不佳代表真实的生活。在另一方面,一个众所周知的每个会话,比如召回多个查询指标,就不能过早的(高度)有关文件检索(会话中)捕获。我们建议的扩展,贴现累计收益(DCG)指标,基于会话的DCG(sDCG)度量涉及多个查询会话,分级的相关评估和开放式的用户的努力,包括决定停止搜索评估场景。该sDCG指标折扣从会话中以后的查询相关的结果。我们从交互式实验例证sDCG指标与数据,我们将讨论如何度量可能被应用,而我们目前的研究问题,而该指标是有帮助的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号