A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval

Swamidass, S. Joshua; Azencott, Chloe-Agathe; Daily, Kenny; Baldi, Pierre

首页> 外文期刊>Bioinformatics >A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval

【24h】

A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval

机译：比ROC更强的CROC：测量，可视化和优化早期检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: The performance of classifiers is often assessed using Receiver Operating Characteristic ROC [or (AC) accumulation curve or enrichment curve] curves and the corresponding areas under the curves (AUCs). However, in many fundamental problems ranging from information retrieval to drug discovery, only the very top of the ranked list of predictions is of any interest and ROCs and AUCs are not very useful. New metrics, visualizations and optimization tools are needed to address this 'early retrieval' problem.Results: To address the early retrieval problem, we develop the general concentrated ROC (CROC) framework. In this framework, any relevant portion of the ROC (or AC) curve is magnified smoothly by an appropriate continuous transformation of the coordinates with a corresponding magnification factor. Appropriate families of magnification functions confined to the unit square are derived and their properties are analyzed together with the resulting CROC curves. The area under the CROC curve (AUC[CROC]) can be used to assess early retrieval. The general framework is demonstrated on a drug discovery problem and used to discriminate more accurately the early retrieval performance of five different predictors. From this framework, we propose a novel metric and visualization-the CROC(exp), an exponential transform of the ROC curve-as an alternative to other methods. The CROC(exp) provides a principled, flexible and effective way for measuring and visualizing early retrieval performance with excellent statistical power. Corresponding methods for optimizing early retrieval are also described in the Appendix.Availability: Datasets are publicly available. Python code and command-line utilities implementing CROC curves and metrics are available at http://pypi.python.org/pypi/CROC/Contact: pfbaldi@ics.uci.edu

机译：动机：通常使用接收器工作特性ROC [或（AC）累积曲线或富集曲线]曲线和曲线下的相应区域（AUC）来评估分类器的性能。但是，在从信息检索到药物发现等许多基本问题中，只有排名最高的预测列表才有意义，ROC和AUC并不是很有用。需要新的度量，可视化和优化工具来解决此“早期检索”问题。结果：为了解决早期检索问题，我们开发了通用的集中式ROC（CROC）框架。在此框架中，通过使用相应的放大系数对坐标进行适当的连续变换，可以平滑地放大ROC（或AC）曲线的任何相关部分。导出了限制在单位平方内的适当放大函数族，并分析了它们的特性以及所得的CROC曲线。 CROC曲线下的面积（AUC [CROC]）可用于评估早期检索。该通用框架针对药物发现问题进行了演示，可用于更准确地区分五个不同预测变量的早期检索性能。从这个框架中，我们提出了一种新颖的度量和可视化-CROC（exp），ROC曲线的指数变换，作为其他方法的替代方法。 CROC（exp）提供了一种原则上，灵活而有效的方式来以出色的统计能力来测量和可视化早期检索性能。附录中还描述了优化早期检索的相应方法。可用性：数据集可公开获得。实现CROC曲线和度量的Python代码和命令行实用程序可在http://pypi.python.org/pypi/CROC/Contact：pfbaldi@ics.uci.edu获得

著录项

来源
《Bioinformatics》 |2010年第10期|共9页
作者
Swamidass, S. Joshua; Azencott, Chloe-Agathe; Daily, Kenny; Baldi, Pierre;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词
information retrieval; Receiver Operating Characteristic; areas under curve; concentrated ROC; drug discovery problem;

机译：信息检索;接收器工作特性;曲线下面积;集中ROC;药物发现问题;

相似文献

外文文献
中文文献
专利

1. A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval [J] . Swamidass, S. Joshua, Azencott, Chloe-Agathe, Daily, Kenny, Bioinformatics . 2010,第10期

机译：比ROC更强的CROC：测量，可视化和优化早期检索
2. Measuring and Visualizing Strong Magnetic Fields by Means of Indicators Based on Garnet Ferrite Films [J] . E. I. Ilyashenko, L. Z. Lubyanyi, V. N. Samofalov Instruments and Experimental Techniques . 2005,第4期

机译：基于石榴石铁氧体薄膜的指示器测量和可视化强磁场
3. Visualization blackboard-visualizing optimization by multiple processors [J] . Herman G.T., Odhner D. IEEE Computer Graphics and Applications . 1991,第6期

机译：可视化黑板-多个处理器的可视化优化
4. Optimizing EEG Visualization Through Remote Data Retrieval [C] . N. Capp, C. Campbell, T. Elseify, IEEE Signal Processing in Medicine and Biology Symposium . 2018

机译：通过远程数据检索优化EEG可视化
5. Making Difference with Optimization and Big Data: Topics in Power Grid Visualization, Airline Fleet Assignment and Sports Play Retrieval. [D] . Di, Mingyang. 2017

机译：通过优化和大数据实现差异：电网可视化，航空公司机队分配和体育比赛检索等主题。
6. A CROC stronger than ROC: measuring visualizing and optimizing early retrieval [O] . S. Joshua Swamidass, Chloé-Agathe Azencott, Kenny Daily, -1

机译：比ROC更强的CROC：测量可视化和优化早期检索
7. A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval [O] . S. Joshua Swamidass, Kenny Daily, Pierre Baldi 2015

机译：比ROC更强大的CROC：测量，可视化和优化早期检索

A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval

摘要

著录项

相似文献

相关主题

期刊订阅