首页> 外文会议>SPIE Conference on Image Perception, Observer Performance, and Technology Assessment >Semi-parametric Estimation of the Area Under the Precision-Recall Curve
【24h】

Semi-parametric Estimation of the Area Under the Precision-Recall Curve

机译:精密召回曲线下区域的半参数估计

获取原文

摘要

Precision and recall are two common metrics used in the evaluation of information retrieval systems. By changing the number of retrieved documents, one can obtain a precision-recall curve. The area under the precision-recall curve (AUCPR) has been suggested as a performance measure for information retrieval systems, in a manner similar to the use of the area under the receiver operating characteristic curve in binary classification. Limited work has been performed in the literature to investigate the bias and variance of AUCPR estimators. The goal of our study was to investigate the bias and variability of a semi-parametric binormal method for estimating the AUCPR, and to compare it to other techniques, such as average precision (AP) and lower trapezoid (LT) approximation. We show how AUCPR can be obtained given the binormal model parameters, and how its variance can be estimated using the delta method. We performed simulation experiments with normal and non-normal data, and investigated the effect of sample size and prevalence. Our results indicated that the semi-parametric binormal approach provided AUCPR estimates with small bias and confidence intervals with acceptable coverage when the sample size was large, and the performance of the binormal model was comparable to or better than alternative methods evaluated in this study when the sample size was small. We conclude that the semi-parametric binormal model can be used to accurately estimate the AUCPR, and that the confidence intervals derived from the model can be at least as accurate as from other alternatives, even for non-normal decision variable distributions.
机译:精度和召回是评估信息检索系统的两个常见度量。通过更改检索到的文档的数量,可以获得精度召回曲线。已经提出了精密召回曲线(AUCPR)下的区域作为信息检索系统的性能测量,以类似于在二进制分类中的接收器操作特性曲线下的区域的使用。在文献中进行了有限的工作,以研究Aucpr估计的偏差和方差。我们研究的目标是研究用于估计AUCPR的半参数二英制方法的偏差和可变性,并将其与其他技术进行比较,例如平均精度(AP)和较低梯形(LT)近似。我们展示了如何使用Binormal Model参数来获得Aucpr,以及如何使用Delta方法估计其方差。我们进行了正常和非正常数据进行了模拟实验,并研究了样品尺寸和患病率的影响。我们的结果表明,当样本尺寸大时,半参数二英寸方法提供了具有小偏差和置信区间的偏差和置信区间的估计,并且二英寸模型的性能与本研究中的替代方法相当或更好样本大小很小。我们得出结论,半导体双向模型可用于精确地估计Aucpr,并且从模型中导出的置信区间可以至少与其他替代方案一样准确,即使对于非正常判定可变分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号