首页> 外文期刊>Journal of nonparametric statistics >The TreeRank Tournament algorithm for multipartite ranking
【24h】

The TreeRank Tournament algorithm for multipartite ranking

机译:用于多方排名的TreeRank锦标赛算法

获取原文
获取原文并翻译 | 示例
       

摘要

Whereas various efficient learning algorithms have been recently proposed to perform bipartite ranking tasks, cast as receiver operating characteristic (ROC) curve optimisation, no method fully tailored to K-partite ranking when K >= 3 has been documented in the statistical learning literature yet. The goal is to optimise the ROC manifold, or summary criteria such as its volume, the gold standard for assessing performance in K-partite ranking. It is the main purpose of this paper to describe at length an efficient approach to recursive maximisation of the ROC surface, extending the TreeRank methodology originally tailored for the bipartite situation (i.e. when K=2). The main barrier arises from the fact that, in contrast to the bipartite case, the volume under the ROC surface criterion of any scoring rule taking K >= 3 values cannot be interpreted as a cost-sensitive misclassification error and no method is readily available to perform the recursive optimisation stage. The learning algorithm we propose, called TreeRank Tournament (referred to as 'TRT' in the tables), breaks it and builds recursively an ordered partition of the feature space. It defines a piecewise scoring function whose ROC manifold can be remarkably interpreted as a statistical version of an adaptive piecewise linear approximant of the optimal ROC manifold. Rate bounds in sup norm describing the generalisation ability of the scoring rule thus built are established and numerical results illustrating the performance of the TRT approach, compared to that of natural competitors such as aggregation methods, are also displayed.
机译:尽管最近已经提出了各种有效的学习算法来执行两方排名任务,并将其转换为接收器工作特性(ROC)曲线优化,但在统计学习文献中还没有文献记载当K> = 3时完全适合K部分排名的方法。目标是优化ROC集成块或汇总标准(例如其体积),这是评估K零件排名中性能的金标准。本文的主要目的是详细描述ROC曲面的递归最大化的有效方法,以扩展最初为二元情况(即,当K = 2时)量身定制的TreeRank方法。主要障碍来自于以下事实:与二分情况不同,任何采用K> = 3值的评分规则在ROC表面准则下的体积都不能解释为对成本敏感的错误分类错误,并且没有任何方法可用于执行递归优化阶段。我们提出的学习算法称为TreeRank锦标赛(在表中称为“ TRT”),将其分解并递归构建特征空间的有序分区。它定义了一个分段计分函数,其ROC流形可以明显地解释为最优ROC流形的自适应分段线性近似的统计形式。建立了描述这样建立的评分规则的泛化能力的准则范本中的速率界限,并显示了与自然竞争者(如聚合方法)相比,说明TRT方法性能的数值结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号