On Tree-Based Methods for Similarity Learning

机译：论相似性学习的基于树的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many situations, the choice of an adequate similarity measure or metric on the feature space dramatically determines the performance of machine learning methods. Building automatically such measures is the specific purpose of metric/similarity learning. In [21], similarity learning is formulated as a pairwise bipartite ranking problem: ideally, the larger the probability that two observations in the feature space belong to the same class (or share the same label), the higher the similarity measure between them. From this perspective, the ROC curve is an appropriate performance criterion and it is the goal of this article to extend recursive tree-based ROC optimization techniques in order to propose efficient similarity learning algorithms. The validity of such iterative partitioning procedures in the pairwise setting is established by means of results pertaining to the theory of U-processes and from a practical angle, it is discussed at length how to implement them by means of splitting rules specifically tailored to the similarity learning task. Beyond these theoretical/methodological contributions, numerical experiments are displayed and provide strong empirical evidence of the performance of the algorithmic approaches we propose.

机译：在许多情况下，特征空间的适当相似度测量或度量的选择显着地确定了机器学习方法的性能。建设自动这些措施是度量/相似度学习的具体目的。在[21]中，相似性学习被制定为成对二分位排名问题：理想情况下，特征空间中的两个观察的概率越大，它们属于同一类（或共享相同标签），它们之间的相似度测量越高。从这个角度来看，ROC曲线是一个适当的性能标准，这是本文的目标是扩展基于树的ROC优化技术，以提出有效的相似性学习算法。通过与U-Process的理论和从实际角度相关的结果建立成对设置中这种迭代分区过程的有效性，其简要讨论如何通过专门针对相似性定制的拆分规则来实现它们学习任务。除了这些理论/方法论贡献之外，展示数值实验并提供了我们提出的算法方法性能的强大实证证据。

著录项

来源
《International Conference on Machine Learning, Optimization, and Data Science》|2019年|772p|共13页
会议地点
作者
Stephan Clemencon; Robin Vogel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词
Metric-learning; Rate bound analysis; Similarity learning; Tree-based algorithms; U-processes;

机译：公制学习;率绑定分析;相似性学习;基于树的算法;U-Procession.;

相似文献

外文文献
中文文献
专利

1. A novel hybrid ensemble model based on tree-based method and deep learning method for default prediction [J] . He Hongliang, Fan Yanli Expert systems with applications . 2021,第Auga期

机译：一种基于树的方法的新型混合集合模型和默认预测的深度学习方法
2. Using Machine Learning Methods to Develop a Short Tree-Based Adaptive Classification Test: Case Study With a High-Dimensional Item Pool and Imbalanced Data [J] . Zheng Yi, Cheon Hyunjung, Katz Charles M. Applied Psychological Measurement . 2020,第7a8期

机译：使用机器学习方法开发基于短树的自适应分类测试：案例研究与高维项池和不平衡数据
3. Learning desk fan usage preferences for personalised thermal comfort in shared offices using tree-based methods [J] . Shetty Sindhu S., Huang Duc Chinh, Gupta Manish, Building and Environment . 2019,第FEBa期

机译：使用基于树的方法在共享办公室中使用学习台风扇使用偏好来实现个性化的热舒适性
4. On Tree-Based Methods for Similarity Learning [C] . Stephan Clemencon, Robin Vogel International conference on machine learning, optimization, and data science . 2019

机译：基于树的相似度学习方法
5. Group theoretical methods in signal processing: Learning similarities, transformations and invariants. [D] . Arora, Raman. 2009

机译：信号处理中的分组理论方法：学习相似性，变换和不变式。
6. Comparing performance of non–tree-based and tree-based association mapping methods [O] . Katherine L. Thompson, David W. Fardo 2016

机译：非基于树和基于树的关联映射方法的性能比较
7. Boosting Insights in Insurance Tariff Plans with Tree-Based Machine Learning Methods [O] . Roel Henckaerts, Marie-Pier Côté, Katrien Antonio, 2020

机译：通过基于树的机器学习方法提升保险资费计划的见解

On Tree-Based Methods for Similarity Learning

摘要

著录项

相似文献

相关主题

期刊订阅