...
首页> 外文期刊>Computational statistics & data analysis >On the scalability of ordered multi-class ROC analysis
【24h】

On the scalability of ordered multi-class ROC analysis

机译:关于有序多类ROC分析的可伸缩性

获取原文
获取原文并翻译 | 示例
           

摘要

Receiver operating characteristics (ROC) analysis provides a way to select possibly optimal models for discriminating two kinds of objects without the need of specifying the cost or class distribution. It is nowadays established as a standard analysis tool in different domains, including medical decision making, pattern recognition and machine learning. Recently, an extension to the ordered multi-class case has been proposed, in which the concept of a ROC curve is generalized to an r-dimensional surface for r ordered categories, and the volume under this ROC surface (VUS) measures the overall power of a model to classify objects of the various categories. However, the computation of this criterion as well as the U-statistics estimators of its variance and covariance for two models is believed to be complex. New algorithms to compute VUS and its (co)variance estimator are presented. In particular, the volume under the ROC surface can be found very efficiently with a simple dynamic program dominated by a single sorting operation on the data set. For the variance and covariance, the respective estimators are reformulated as a series of recurrent functions over layered data graphs and subsequently these functions are rapidly evaluated with a dynamic program. Simulation experiments confirm that the presented algorithms scale well with respect to the size of the data set and the number of categories. For example, the volume under the ROC surface could be rapidly computed on very large data sets of more than 500 000 instances, while a naive implementation spent much more time on data sets of size less than 1000.
机译:接收机工作特性(ROC)分析提供了一种方法,可以选择可能的最佳模型来区分两种对象,而无需指定成本或类别分布。如今,它已被确立为包括医疗决策,模式识别和机器学习在内的不同领域的标准分析工具。最近,有人提出了对有序多类情况的扩展,其中将ROC曲线的概念推广到r个有序类别的r维表面,并且该ROC面下的体积(VUS)测量了总功率模型以对各种类别的对象进行分类。但是,对于两个模型,此标准的计算以及其方差和协方差的U统计估计量被认为是复杂的。提出了用于计算VUS及其(协)方差估计量的新算法。特别是,可以通过简单的动态程序非常有效地找到ROC表面下的体积,该程序由对数据集的单个排序操作控制。对于方差和协方差,将各个估计量重新构造为分层数据图上的一系列递归函数,然后使用动态程序快速评估这些函数。仿真实验证实,提出的算法相对于数据集的大小和类别的数量可以很好地缩放。例如,可以在超过500 000个实例的非常大的数据集上快速计算ROC面下的体积,而天真的实现在大小小于1000的数据集上花费更多的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号