On Per-topic Variance in IR Evaluation

机译：投资者关系评估中的主题差异

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We explore the notion, put forward by Cormack & Lynam and Robertson, that we should consider a. document collection used for Cranfield-style experiments as a sample from some larger population of documents. In this view, any per-topic metric (such as average precision) should be regarded as an estimate of that metric's true value for that topic in the full population, and therefore as carrying its own per-topic variance or estimate precision or noise. As in the two mentioned papers, we explore this notion by simulating other samples from the same large population. We investigate different ways of performing this simulation. One use of this analysis is to refine the notion of statistical significance of a difference between two systems (in most such analyses, each per-topic measurement is treated as equally precise). We propose a mixed-effects model method to measure significance, and compare it experimentally with the traditional t-fest.

机译：我们探讨了Cormack＆Lynam和Robertson提出的概念，我们应该考虑a。克兰菲尔德（Cranfield）风格的实验所使用的文档集合，是来自大量文档的样本。按照这种观点，任何按主题进行的度量（例如平均精度）都应视为该主题在整个人群中该主题的真实值的估计，因此应视为带有其按主题的自身变化或估计精度或噪声。就像在上面提到的两篇论文中一样，我们通过模拟来自相同人口的其他样本来探索这一概念。我们研究了执行此模拟的不同方法。该分析的一种用途是完善两个系统之间差异的统计显着性的概念（在大多数此类分析中，每个按主题进行的度量均被视为同等精确）。我们提出了一种混合效果模型方法来测量重要性，并将其与传统的t-fest进行实验比较。

著录项

来源
《International ACM SIGIR conference on research development in information retrieval》|2012年|891-900|共10页
会议地点 Portland OR(US)
作者
Stephen E. Robertson; Evangelos Kanoulas;
展开▼
作者单位

Microsoft Research 7 JJ Thomson Avenue Cambridge CB3 0FB UK;

Information School University of Sheffield Sheffield UK;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
information retrieval; evaluation; statistical precision; significance testing; mixed-effects model; simulation;

机译：信息检索；评估;统计精度；重要性测试；混合效应模型；模拟;

相似文献

外文文献
中文文献
专利

1. The derivation of the NPV variance of a risky capital investment project with first-order autoregressive cash flows and autoregressive conditional heteroscedastic variances [J] . Jean-Paul Paquin, Alain Charbonneau, David Tessier Applied Economics . 2015,第10a12期

机译：具有一阶自回归现金流量和自回归条件异方差的风险资本投资项目的NPV方差的推导
2. Steady-state optimal discrete-time control of first-order systems with actuator noise variance linearly related to actuator signal variance [J] . Krolikowski A. IEEE Transactions on Automatic Control . 1997,第2期

机译：具有与执行器信号方差线性相关的执行器噪声方差的一阶系统的稳态最优离散时间控制
3. Graphical evaluation of robust parameter designs based on extended scaled prediction variance and extended spherical average prediction variance [J] . Oh Jin H., Park Sung H., Kwon Soon S. Communications in Statistics . 2018,第13a15期

机译：基于扩展的比例预测方差和扩展的球面平均预测方差的鲁棒参数设计的图形评估
4. On Per-topic Variance in IR Evaluation [C] . Stephen E. Robertson, Evangelos Kanoulas International ACM SIGIR conference on research development in information retrieval . 2012

机译：关于IR评估的每个主题方差
5. Implementing Different Approaches for Image Receptor Performance Characterization for Digital Radiography Systems: Evaluating the Use of Pixel Variance and Non-Uniformity Analyses [D] . Finley, Caitlin. 2019

机译：实现数字放射成像系统的图像受体性能表征的不同方法：评估像素方差和非均匀性分析的使用
6. Optimization of PCR Condition: The First Study of High Resolution Melting Technique for Screening of APOA1 Variance [O] . Hesty Wahyuningsih, Ferdy K Cayami, Udin Bahrudin, 2017

机译：PCR条件的优化：高分辨率熔解技术筛选APOA1方差的第一个研究
7. Comparative evaluation in the measurement of the radial height, radial inclination, and ulnar variance in fracture distal end radius treated conservatively by closed reduction and cast and closed reduction, Kirschner wire and cast [O] . Rahul R Bagul, Ashwin Deshmukh, Anil Salgia, 2014

机译：通过闭合复位，铸造和闭合复位，克氏针和铸造保守治疗的骨折远端半径径向高度，径向倾斜和尺骨方差测量的对比评价

On Per-topic Variance in IR Evaluation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅