首页> 外文会议>Advances in Information Retrieval >Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions
【24h】

Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions

机译:基于折扣累积增益的多查询IR会话评估

获取原文
获取原文并翻译 | 示例

摘要

IR research has a strong tradition of laboratory evaluation of systems. Such research is based on test collections, pre-defined test topics, and standard evaluation metrics. While recent research has emphasized the user viewpoint by proposing user-based metrics and non-binary relevance assessments, the methods are insufficient for truly user-based evaluation. The common assumption of a single query per topic and session poorly represents real life. On the other hand, one well-known metric for multiple queries per session, instance recall, does not capture early (within session) retrieval of (highly) relevant documents. We propose an extension to the Discounted Cumulated Gain (DCG) metric, the Session-based DCG (sDCG) metric for evaluation scenarios involving multiple query sessions, graded relevance assessments, and open-ended user effort including decisions to stop searching. The sDCG metric discounts relevant results from later queries within a session. We exemplify the sDCG metric with data from an interactive experiment, we discuss how the metric might be applied, and we present research questions for which the metric is helpful.
机译:IR研究具有实验室评估系统的悠久传统。此类研究基于测试集合,预定义的测试主题和标准评估指标。尽管最近的研究通过提出基于用户的度量标准和非二进制相关性评估来强调用户观点,但是这些方法不足以进行真正的基于用户的评估。对每个主题和会话进行单个查询的常见假设很难代表现实生活。另一方面,每个会话的多个查询的一个众所周知的度量标准(例如,回忆)不能捕获(高度)相关文档的早期(会话内)检索。我们建议对折扣累计收益(DCG)指标,基于会话的DCG(sDCG)指标进行扩展,以用于涉及多个查询会话,分级相关性评估以及开放性用户努力(包括决定停止搜索)的评估方案。 sDCG指标会取消会话中以后查询的相关结果。我们以互动实验中的数据为例来说明sDCG指标,我们讨论了如何应用该指标,并提出了对指标有帮助的研究问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号