首页> 外文期刊>JMLR: Workshop and Conference Proceedings >A Probabilistic Theory of Supervised Similarity Learning for Pointwise ROC Curve Optimization
【24h】

A Probabilistic Theory of Supervised Similarity Learning for Pointwise ROC Curve Optimization

机译:逐点ROC曲线优化的监督相似学习概率理论。

获取原文
           

摘要

The performance of many machine learning techniques depends on the choice of an appropriate similarity or distance measure on the input space. Similarity learning (or metric learning) aims at building such a measure from training data so that observations with the same (resp. different) label are as close (resp. far) as possible. In this paper, similarity learning is investigated from the perspective of pairwise bipartite ranking, where the goal is to rank the elements of a database by decreasing order of the probability that they share the same label with some query data point, based on the similarity scores. A natural performance criterion in this setting is pointwise ROC optimization: maximize the true positive rate under a fixed false positive rate. We study this novel perspective on similarity learning through a rigorous probabilistic framework. The empirical version of the problem gives rise to a constrained optimization formulation involving U-statistics, for which we derive universal learning rates as well as faster rates under a noise assumption on the data distribution. We also address the large-scale setting by analyzing the effect of sampling-based approximations. Our theoretical results are supported by illustrative numerical experiments.
机译:许多机器学习技术的性能取决于对输入空间的适当相似度或距离度量的选择。相似性学习(或度量学习)旨在根据训练数据构建这样的度量,以使具有相同(分别为不同)标签的观察结果尽可能接近(分别为远)。在本文中,从成对两部分排名的角度研究了相似性学习,其目的是根据相似性得分,通过降低数据库元素与某些查询数据点共享同一标签的概率来对数据库元素进行排名。在这种情况下,自然的性能标准是逐点ROC优化:在固定的假阳性率下最大化真实阳性率。我们通过严格的概率框架研究这种关于相似性学习的新颖观点。问题的经验形式引起了涉及U统计量的约束优化公式,为此我们可以得出通用学习率以及在数据分布的噪声假设下的更快速率。我们还通过分析基于采样的近似值的影响来解决大规模设置问题。我们的理论结果得到了说明性数值实验的支持。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号