首页> 外文OA文献 >Assessors agreement: A case study across assessor type, payment levels, query variations and relevance dimensions
【2h】

Assessors agreement: A case study across assessor type, payment levels, query variations and relevance dimensions

机译:评估者协议:横跨评估者类型,付款水平,查询变化和相关性维度的案例研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Relevance assessments are the cornerstone of Information Retrieval evaluation. Yet, there is only limited understanding of how assessment disagreement influences the reliability of the evaluation in terms of systems rankings. In this paper we examine the role of assessor type (expert vs. layperson), payment levels (paid vs. unpaid), query variations and relevance dimensions (topicality and understandability) and their influence on system evaluation in the presence of disagreements across assessments obtained in the different settings. The analysis is carried out in the context of the CLEF 2015 eHealth Task 2 collection and shows that disagreements between assessors belonging to the same group have little impact on evaluation. It also shows, however, that assessment disagreement found across settings has major impact on evaluation when topical relevance is considered, while it has no impact when understandability assessments are considered.
机译:相关性评估是信息检索评估的基石。但是,对于评估分歧如何根据系统排名影响评估可靠性的认识有限。在本文中,我们研究了评估者类型(专家与非专业人员),付款水平(有偿与无偿),查询变量和相关性维度(主题和可理解性)的作用,以及在评估之间存在分歧时它们对系统评估的影响。在不同的设置中。该分析是在CLEF 2015 eHealth Task 2集合的背景下进行的,表明属于同一组的评估者之间的分歧对评估影响很小。但是,它也表明,在考虑主题相关性的情况下,在不同环境中发现的评估分歧对评估有重大影响,而在考虑可理解性评估时则没有影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号