首页> 外文期刊>Information Processing & Management >An in-depth study on diversity evaluation: The importance of intrinsic diversity
【24h】

An in-depth study on diversity evaluation: The importance of intrinsic diversity

机译:关于多样性评估的深入研究:内在多样性的重要性

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Diversified document ranking has been recognized as an effective strategy to tackle ambiguous and/or underspecified queries. In this paper, we conduct an in-depth study on diversity evaluation that provides insights for assessing the performance of a diversified retrieval system. By casting the widely used diversity metrics (e.g., ERR-IA, α-nDCG and D#-nDCG) into a unified framework based on marginal utility, we analyze how these metrics capture extrinsic diversity and intrinsic diversity. Our analyses show that the prior metrics (ERR-IA, α-nDCG and D#-nDCG) are not able to precisely measure intrinsic diversity if we merely feed a set of subtopics into them in a traditional manner (i.e., without fine-grained relevance knowledge per subtopic). As the redundancy of relevant documents with respect to each specific information need (i.e., subtopic) can not be then detected and solved, the overall diversity evaluation may not be reliable. Furthermore, a series of experiments are conducted on a gold standard collection (English and Chinese) and a set of submitted runs, where the intent-square metrics that extend the diversity metrics through incorporating hierarchical subtopics are used as references. The experimental results show that the intent-square metrics disagree with the diversity metrics (ERR-IA and α-nDCG) being used in a traditional way on top-ranked runs, and that the average precision correlation scores between intent-square metrics and the prior diversity metrics (ERR-IA and α-nDCG) are fairly low. These results justify our analyses, and uncover the previously-unknown importance of intrinsic diversity to the overall diversity evaluation.
机译:多样化的文档排名已被认为是解决歧义和/或未指定查询的有效策略。在本文中,我们对多样性评估进行了深入研究,为评估多元化检索系统的性能提供了见识。通过将广泛使用的多样性度量标准(例如ERR-IA,α-nDCG和D#-nDCG)转换为基于边际效用的统一框架,我们分析了这些度量标准如何捕获外部多样性和内在多样性。我们的分析表明,如果我们仅以传统方式(例如,没有细粒度的相关性)将一组子主题提供给它们,则先前的度量标准(ERR-IA,α-nDCG和D#-nDCG)将无法精确地测量内在多样性。每个子主题的知识)。由于随后无法检测和解决有关每个特定信息需求(即子主题)的相关文档的冗余,因此总体多样性评估可能并不可靠。此外,针对黄金标准馆藏(英语和中文)和一组提交的运行进行了一系列实验,其中通过合并分层子主题扩展多样性指标的意图平方指标被用作参考。实验结果表明,意图平方度量与排名靠前的运行中以传统方式使用的多样性度量(ERR-IA和α-nDCG)不一致,并且意图平方度量与度量之间的平均精度相关性得分先前的多样性指标(ERR-IA和α-nDCG)相当低。这些结果证明了我们的分析是正确的,并且揭示了内在多样性对整体多样性评估的先前未知的重要性。

著录项

  • 来源
    《Information Processing & Management》 |2017年第4期|799-813|共15页
  • 作者单位

    Faculty of Library, Information and Media Science, University of Tsukuba, Tsukuba, Japan;

    Department of Social informatics, Graduate School of Informatics, Kyoto University, Kyoto, Japan;

    IRLab, Computer Science Department, University of A Coruna, Spain;

    Research Center for Knowledge Communities, Faculty of Library, Information and Media Science, University of Tsukuba, Tsukuba, Japan;

    School of Computing Science, University of Glasgow, Glasgow, UK;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Extrinsic diversity; Intrinsic diversity; Marginal utility;

    机译:外在多样性;内在多样性;边际效用;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号