【24h】

How reliable are the results of large-scale information retrieval experiments?

机译:大规模信息检索实验的结果有多可靠?

获取原文

摘要

Two stages in measurement of techniques for informationretrieval are gathering of documents for relevance assessment anduse of the assessments to numerically evaluate effectiveness. Weconsider both of these stages in the context of the TRECexperiments, to determine whether they lead to measurements thatare trustworthy and fair. Our detailed empirical investigation ofthe TREC results shows that the measured relative performance ofsystems appears to be reliable, but that recall is overestimated:it is likely that many relevant documents have not been found. Wepropose a new pooling strategy that can significantly in- creasethe number of relevant documents found for given effort, withoutcompromising fairness.

机译:

信息检索技术的度量的两个阶段是收集文档以进行相关性评估,并使用评估以数字方式评估有效性。我们在TREC实验的背景下考虑这两个阶段,以确定它们是否导致可信赖且公平的测量。我们对TREC结果的详细实证研究表明,测得的系统相对性能似乎是可靠的,但是召回率被高估了:很可能没有找到许多相关的文档。我们提出了一种新的合并策略,该策略可以在不影响公平性的前提下,显着增加在给定工作量下发现的相关文档的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号