An Analysis of Systematic Judging Errors in Information Retrieval

机译：信息检索中系统性判断错误的分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Test collections are powerful mechanisms for evaluation and optimization of information retrieval systems. There is reported evidence that experiment outcomes can be affected by changes in the judge population or in judging guidelines. We examine such effects in a web search setting, comparing the judgments of four groups of judges: NIST Web Track judges, untrained crowd workers and two groups of trained judges of a commercial search engine. Our goal is to identify systematic judging errors by comparing the labels contributed by the different groups. In particular, we focus on detecting systematic differences in judging depending on specific characteristics of the queries and URLs. For example, we ask whether a given population of judges, working under a given set of judging guidelines, are more likely to overrate Wikipedia pages than another group judging under the same instructions. Our approach is to identify judging errors with respect to a consensus set. a judged gold set and a set of user clicks. We further demonstrate how such biases can affect the training of retrieval systems.

机译：测试集合是用于评估和优化信息检索系统的强大机制。有证据表明，实验结果可能会受到评审人数或评审准则变化的影响。我们在网络搜索设置中检查了这种影响，比较了四组法官的判断：NIST Web Track法官，未经培训的人群工作者和两组经过训练的商业搜索引擎法官。我们的目标是通过比较不同小组贡献的标签来识别系统的判断错误。特别是，我们专注于根据查询和URL的特定特征在判断中检测系统差异。例如，我们问在给定的一组评审准则下工作的给定的法官群体是否比另一组在相同的指导下评审的法官更有可能高估维基百科页面。我们的方法是识别关于共识集的判断错误。判断的金牌设置和一组用户点击。我们进一步证明了这种偏见如何影响检索系统的训练。

著录项

来源
《ACM international conference on information and knowledge management》|2012年|105-114|共10页
会议地点
作者
Gabriella Kazai; Nick Craswell; Emine Yilmaz; S.M.M. Tahaghoghi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Relevance labeling; assessor errors; crowdsourcing;

机译：相关标签;评估者错误;众包;

相似文献

外文文献
中文文献
专利

1. Harmless Error Analysis: How Do Judges Respond to Confession Errors? [J] . D. Brian Wallace, Saul M. Kassin Law and human behavior: The official journal of the American Psychology-Law Society . 2012,第2期

机译：无害错误分析：法官如何应对自白错误？
2. Errors in search strategies used in systematic reviews and their effects on information retrieval [J] . Jose Antonio Salvador-Olivan, Gonzalo Marco-Cuenca, Rosario Arquero-Aviles Journal of the Medical Library Association : . 2019,第2期

机译：搜索策略中的错误，用于系统审查及其对信息检索的影响
3. Errors in search strategies used in systematic reviews and their effects on information retrieval [J] . José Antonio Salvador-Oliván, Gonzalo Marco-Cuenca, Rosario Arquero-Avilés Journal of the Medical Library Association : . 2019,第2期

机译：系统评价中使用的搜索策略中的错误及其对信息检索的影响
4. An Analysis of Systematic Judging Errors in Information Retrieval [C] . Gabriella Kazai, Nick Craswell, Emine Yilmaz, ACM international conference on information and knowledge management . 2012

机译：信息检索中系统判断误差分析
5. Estimation and Adaptive Online Correction of Systematic Errors in The Global Forecast System (GFS) Using Analysis Increments [D] . Bhargava, Kriti. 2019

机译：使用分析增量估算和自适应在线校正全球预测系统（GFS）系统错误的在线校正
6. Errors in search strategies used in systematic reviews and their effects on information retrieval [O] . José Antonio Salvador-Oliván, Gonzalo Marco-Cuenca, Rosario Arquero-Avilés 2019

机译：系统评价中使用的搜索策略中的错误及其对信息检索的影响
7. Don't judge a book by its cover, don't judge a study by its abstract. Common statistical errors seen in medical papers [O] . Choi SW, Cheung CW 2016

机译：不要以封面来判断一本书，不要以抽象的方式判断一项研究。医学论文中常见的统计错误

An Analysis of Systematic Judging Errors in Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅