首页> 外文学位 >Beyond Affinity Propagation: Message Passing Algorithms for Clustering.
【24h】

Beyond Affinity Propagation: Message Passing Algorithms for Clustering.

机译:超越亲和力传播:用于群集的消息传递算法。

获取原文
获取原文并翻译 | 示例

摘要

Affinity propagation is an exemplar-based clustering method that takes as input similarities between data points. It outputs a set of data points that best represent the data (exemplars), and assignments of each non-exemplar point to its most appropriate exemplar, thereby partitioning the data set into clusters. The objective of affinity propagation is to maximize the sum of similarities between the data points and their exemplars.;In this thesis, we develop several extensions of affinity propagation. The extensions provide clustering tools that go beyond the capabilities of the basic affinity propagation algorithm, and generalize it to various problems of interest in machine learning. We also investigate alternative approaches to the underlying mechanism of affinity propagation using recent inference techniques that are based on optimization theory.;Affinity propagation was first described using a particular graphical model for the exemplar-based clustering problem. We first provide an alternative graphical model and derivation of affinity propagation, which are more amenable to model manipulation. Building on this representation, we develop capacitated affinity propagation, semi-supervised affinity propagation, and the hierarchical affinity propagation algorithms. We also discuss the relationship of affinity propagation to some canonical problems in combinatorial optimization.;The underlying mechanism of affinity propagation is an approximate inference procedure known as max-product belief propagation. We provide a comparison of affinity propagation to alternative inference techniques such as max-product linear programming, and dual decomposition. We show that for a collection of benchmark data sets, affinity propagation outperforms these more theoretically justified approaches.;We conclude by discussing the contributions and findings of this thesis, and how they relate to current research themes in more general inference problems. We point to several interesting avenues for future research.
机译:相似性传播是一种基于示例的聚类方法,将数据点之间的输入相似性作为输入。它输出一组最能代表数据(示例性)的数据点,并将每个非示例性点分配给其最合适的示例,从而将数据集划分为多个簇。亲和力传播的目的是最大程度地提高数据点与其示例之间的相似度之和。这些扩展提供了超越基本亲和力传播算法功能的聚类工具,并将其推广到机器学习中感兴趣的各种问题。我们还研究了使用基于优化理论的最新推理技术对亲和力传播的潜在机制进行替代的方法。亲和力传播首先使用特定的图形模型描述了基于示例的聚类问题。我们首先提供替代的图形模型和亲和力传播的派生,它们更适合于模型操作。在此表示的基础上,我们开发了容量亲和传播,半监督亲和传播以及分层亲和传播算法。我们还讨论了组合优化中亲和力传播与一些规范问题之间的关系。亲和力传播的基本机制是称为最大乘积置信度传播的近似推理过程。我们将相似性传播与替代推理技术(例如最大乘积线性规划和对偶分解)进行了比较。我们表明,对于一组基准数据集而言,亲和力传播优于这些在理论上更合理的方法。我们通过讨论本论文的贡献和发现以及它们如何与更广泛的推理问题与当前研究主题相关联来得出结论。我们为将来的研究指出了一些有趣的途径。

著录项

  • 作者

    Givoni, Inmar-Ella.;

  • 作者单位

    University of Toronto (Canada).;

  • 授予单位 University of Toronto (Canada).;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 212 p.
  • 总页数 212
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号