ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

Muschelli John III

首页> 外文期刊>Journal of classification >ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

【24h】

ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

机译：ROC和AUC与二进制预测器：一个潜在的误导性指标

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In analysis of binary outcomes, the receiver operator characteristic (ROC) curve is heavily used to show the performance of a model or algorithm. The ROC curve is informative about the performance over a series of thresholds and can be summarized by the area under the curve (AUC), a single number. When a predictor is categorical, the ROC curve has one less than number of categories as potential thresholds; when the predictor is binary, there is only one threshold. As the AUC may be used in decision-making processes on determining the best model, it important to discuss how it agrees with the intuition from the ROC curve. We discuss how the interpolation of the curve between thresholds with binary predictors can largely change the AUC. Overall, we show using a linear interpolation from the ROC curve with binary predictors corresponds to the estimated AUC, which is most commonly done in software, which we believe can lead to misleading results. We compare R, Python, Stata, and SAS software implementations. We recommend using reporting the interpolation used and discuss the merit of using the step function interpolator, also referred to as the "pessimistic" approach by Fawcett (2006).

机译：在分析二进制结果时，接收器-操作员特征（ROC）曲线被大量用于显示模型或算法的性能。ROC曲线是关于一系列阈值下性能的信息，可以通过曲线下面积（AUC）来总结，即单个数字。当预测因子是分类的时，ROC曲线的潜在阈值比类别数少一个；当预测器为二进制时，只有一个阈值。由于AUC可用于确定最佳模型的决策过程，因此讨论其与ROC曲线直觉的一致性非常重要。我们讨论了使用二元预测器对阈值之间的曲线进行插值如何在很大程度上改变AUC。总的来说，我们展示了使用ROC曲线的线性插值和二元预测值对应于估计的AUC，这通常是在软件中完成的，我们认为这可能会导致误导性结果。我们比较了R、Python、Stata和SAS软件实现。我们建议使用报告所使用的插值，并讨论使用阶跃函数插值器的优点，也就是福塞特（2006）提出的“悲观”方法。

著录项

来源
《Journal of classification》 |2020年第3期|共13页
作者
Muschelli John III;
展开▼
作者单位

Johns Hopkins Bloomberg Sch Publ Hlth Dept Biostat Baltimore MD 21205 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自然科学理论与方法论;
关键词
ROC; AUC; Area under the curve; R;

机译：ROC;AUC;曲线下的区域;r;

相似文献

外文文献
中文文献
专利

1. A BINARY INFINITESIMAL FORM OF TEICHM(U)LLER METRIC AND ANGLES IN AN ASYMPTOTIC TEICHM(U)LLER SPACE [J] . Yan WU, Yi QI 数学物理学报（英文版） . 2016,第002期
2. Estimating the Forest Above-ground Biomass Based on Extracted LiDAR Metrics and Predicted Diameter at Breast Height [J] . Petar DONEV, Hong WANG, Shuhong QIN, 测绘学报（英文） . 2021,第003期
3. Estimating the Forest Above-ground Biomass Based on Extracted LiDAR Metrics and Predicted Diameter at Breast Height [J] . Petar DONEV, Hong WANG, Shuhong QIN, 测绘学报(英文版) . 2021,第003期
4. The use of ROC and AUC in the validation of objective image fusion evaluation metrics [J] . Xiaoli Zhang, Xiongfei Li, Yuncong Feng, Signal processing . 2015,第octa期

机译：ROC和AUC在客观图像融合评估指标验证中的使用
5. AUC: a misleading measure of the performance of predictive distribution models [J] . Lobo JM, Jimenez-Valverde A, Real R Global ecology and biogeography . 2008,第2期

机译：AUC：一种对预测分布模型的性能的误导性度量
6. Using the area under an estimated ROC curve to test the adequacy of binary predictors [J] . Lieli Robert P., Hsu Yu-Chin Journal of nonparametric statistics . 2019,第1a2期

机译：使用估计的ROC曲线下的面积测试二元预测变量的适当性
7. The Optimization of a Page Rank Based Key Classes Classifier using Simulated Annealing with ROC-AUC and Recall Metrics [C] . Ciprian-Bogdan Chirila, Ioana Şora IEEE International Symposium on Applied Computational Intelligence and Informatics . 2019

机译：使用ROC-AUC和召回指标的模拟退火优化基于页面等级的键类分类器
8. Learning to rank by maximizing the AUC with linear programming for problems with binary output [D] . Ataman, Kaan 2007

机译：通过使用线性编程最大化AUC来学习排名，以解决二进制输出问题
9. Rocker: Open source easy-to-use tool for AUC and enrichment calculations and ROC visualization [O] . Sakari Lätti, Sanna Niinivehmas, Olli T. Pentikäinen 2016

机译：Rocker：开放源代码易于使用的工具用于AUC和浓缩计算以及ROC可视化
10. ROC and AUC with a Binary Predictor: a Potentially Misleading Metric [O] . John Muschelli 2019

机译：ROC和AUC具有二进制预测因子：一个潜在的误导性指标
11. Traffic days at AUC. Conference report vol. 1. (Trafikdage paa AUC. Konferencerapport bind 1) [R] . Lahrmann, H. , Hald Pedersen, L. 1994

机译：aUC交通日。会议报告卷1.（Trafikdage paa aUC.Konferencerapport bind 1）

ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

摘要

著录项

相似文献

相关主题

期刊订阅