【24h】

Similarity Component Analysis

机译:相似性分量分析

获取原文
获取外文期刊封面目录资料

摘要

Measuring similarity is crucial to many learning tasks. To this end, metric learning has been the dominant paradigm. However, similarity is a richer and broader notion than what metrics entail. For example, similarity can arise from the process of aggregating the decisions of multiple latent components, where each latent component compares data in its own way by focusing on a different subset of features. In this paper, we propose Similarity Component Analysis (SCA), a probabilistic graphical model that discovers those latent components from data. In SCA, a latent component generates a local similarity value, computed with its own metric, independently of other components. The final similarity measure is then obtained by combining the local similarity values with a (noisy-)OR gate. We derive an EM-based algorithm for fitting the model parameters with similarity-annotated data from pairwise comparisons. We validate the SCA model on synthetic datasets where SCA discovers the ground-truth about the latent components. We also apply SCA to a multiway classification task and a link prediction task. For both tasks, SCA attains significantly better prediction accuracies than competing methods. Moreover, we show how SCA can be instrumental in exploratory analysis of data, where we gain insights about the data by examining patterns hidden in its latent components' local similarity values.
机译:测量相似性是许多学习任务至关重要。为此,度量学习一直占据主导范式。然而,相似性比指标意味着更丰富和更广泛的概念。例如,相似性可以从聚集多个潜部件,其中每个部件潜以自己的方式通过关注的特征的不同子集进行比较的数据的决定的过程中出现。在本文中,我们提出了相似成分分析(SCA),概率图模型,发现的从数据中那些潜在的组件。在SCA,潜组件生成局部相似值,具有自己的度量来计算独立的其它部件。最终的相似性度量,然后通过局部相似性值与(noisy-)OR门组合而获得。我们推导基于EM的算法拟合从两两比较相似标注的数据模型参数。我们确认在哪里SCA发现地面真相潜成分合成的数据集SCA模型。我们也适用于SCA多路分类任务和链接预测任务。对于这两个任务,SCA无所获显著较好的预测精度最高比竞争对手的方法。此外,我们将展示SCA如何能够在数据,我们通过检查隐藏在其潜在的组件的局部相似值模式获得有关数据洞察的探索性分析工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号