Pooling-based continuous evaluation of information retrieval systems

Tonon Alberto; Demartini Gianluca; Cudre-Mauroux Philippe

首页> 外文期刊>Information retrieval >Pooling-based continuous evaluation of information retrieval systems

【24h】

Pooling-based continuous evaluation of information retrieval systems

机译：基于池的信息检索系统连续评估

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The dominant approach to evaluate the effectiveness of information retrieval (IR) systems is by means of reusable test collections built following the Cranfield paradigm. In this paper, we propose a new IR evaluation methodology based on pooled test-collections and on the continuous use of either crowdsourcing or professional editors to obtain relevance judgements. Instead of building a static collection for a finite set of systems known a priori, we propose an IR evaluation paradigm where retrieval approaches are evaluated iteratively on the same collection. Each new retrieval technique takes care of obtaining its missing relevance judgements and hence contributes to augmenting the overall set of relevance judgements of the collection. We also propose two metrics: Fairness Score, and opportunistic number of relevant documents, which we then use to define new pooling strategies. The goal of this work is to study the behavior of standard IR metrics, IR system ranking, and of several pooling techniques in a continuous evaluation context by comparing continuous and non-continuous evaluation results on classic test collections. We both use standard and crowdsourced relevance judgements, and we actually run a continuous evaluation campaign over several existing IR systems.

机译：评估信息检索（IR）系统有效性的主要方法是通过遵循Cranfield范式建立的可重用测试集合。在本文中，我们提出了一种新的IR评估方法，该方法基于汇总的测试集合并持续使用众包或专业编辑来获得相关性判断。代替为已知先验的有限系统建立静态集合，我们提出一种IR评估范式，其中对同一集合迭代地评估检索方法。每种新的检索技术都需要获得其缺失的相关性判断，因此有助于增加集合的相关性判断的整体集合。我们还提出了两个指标：公平性得分和相关文档的机会数量，然后我们将其用于定义新的合并策略。这项工作的目的是通过比较经典测试集合上的连续和非连续评估结果，研究标准IR指标，IR系统排名和几种合并技术在连续评估环境中的行为。我们都使用标准和众包相关性判断，实际上我们对几个现有的IR系统进行了持续评估。

著录项

来源
《Information retrieval》 |2015年第5期|445-472|共28页
作者
Tonon Alberto; Demartini Gianluca; Cudre-Mauroux Philippe;
展开▼
作者单位

Univ Fribourg, CH-1700 Fribourg, Switzerland;

Univ Sheffield, Sheffield S10 2TN, S Yorkshire, England;

Univ Fribourg, CH-1700 Fribourg, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Information retrieval evaluation; Crowdsourcing; Continuous evaluation; Poolingtechniques;

机译：信息检索评估;众包;连续评估;池技术;

相似文献

外文文献
中文文献
专利

1. Pooling-based continuous evaluation of information retrieval systems [J] . Thierry Edoh Computing reviews . 2016,第7期

机译：基于池的信息检索系统连续评估
2. Multi-armed bandits for adjudicating documents in pooling-based evaluation of information retrieval systems [J] . David E. Losada, Javier Parapar, Alvaro Barreiro Information Processing & Management . 2017,第5期

机译：在基于池的信息检索系统评估中裁定文档的多臂匪徒
3. Evaluating performance of biomedical image retrieval systems An overview of the medical image retrieval task at ImageCLEF 2004-2013 [J] . Kalpathy-Cramer Jayashree, de Herrera Alba Garcia Seco, Demner-Fushman Dina, Computerized Medical Imaging and Graphics: The Official Jounal of the Computerized Medical Imaging Society . 2015,第Null期

机译：生物医学图像检索系统的性能评估ImageCLEF 2004-2013年医学图像检索任务概述
4. IP pooling-based email systems reputation assurance [C] . Bocu Razvan 2011 10th Roedunet International Conference . 2011

机译：基于IP池的电子邮件系统信誉保证
5. A Comprehensive Method for Automating Test Collection Creation and Evaluation for Retrieval and Summarization Systems. [D] . Ekstrand-Abueg, Matthew Paul. 2017

机译：一种用于检索和汇总系统的自动化测试集创建和评估的综合方法。
6. Evaluating performance of biomedical image retrieval systems – an overview of the medical image retrieval task at ImageCLEF 2004–2013 [O] . Jayashree Kalpathy-Cramer, Alba García Seco de Herrera, Dina Demner-Fushman, -1

机译：评估生物医学图像检索系统的性能– ImageCLEF 2004–2013医学图像检索任务概述
7. Evaluating performance of biomedical image retrieval systems - an overview of the medical image retrieval task at ImageCLEF 2004-2013 [O] . Kalpathy-Cramer J, Seco de Herrera AG, Demner-Fushman D, 2015

机译：评估生物医学图像检索系统的性能-ImageCLEF 2004-2013医学图像检索任务概述

Pooling-based continuous evaluation of information retrieval systems

摘要

著录项

相似文献

相关主题

期刊订阅