首页> 外文会议>Annual Conference on Neural Information Processing Systems >Convex Clustering with Exemplar-Based Models
【24h】

Convex Clustering with Exemplar-Based Models

机译:基于示例模型的凸聚类

获取原文

摘要

Clustering is often formulated as the maximum likelihood estimation of a mixture model that explains the data. The EM algorithm widely used to solve the resulting optimization problem is inherently a gradient-descent method and is sensitive to initialization. The resulting solution is a local optimum in the neighborhood of the initial guess. This sensitivity to initialization presents a significant challenge in clustering large data sets into many clusters. In this paper, we present a different approach to approximate mixture fitting for clustering. We introduce an exemplar-based likelihood function that approximates the exact likelihood. This formulation leads to a convex minimization problem and an efficient algorithm with guaranteed convergence to the globally optimal solution. The resulting clustering can be thought of as a probabilistic mapping of the data points to the set of exemplars that minimizes the average distance and the information-theoretic cost of mapping. We present experimental results illustrating the performance of our algorithm and its comparison with the conventional approach to mixture model clustering.
机译:聚类通常被公式化为解释数据的混合模型的最大似然估计。广泛用于解决最终优化问题的EM算法本质上是梯度下降方法,并且对初始化敏感。所得的解决方案是初始猜测附近的局部最优值。这种对初始化的敏感性在将大型数据集聚集成许多聚类中提出了巨大的挑战。在本文中,我们提出了一种不同的方法来近似用于聚类的混合拟合。我们引入了一个基于样本的似然函数,该函数近似精确的似然。这种表述导致凸最小化问题和有效算法,并保证收敛到全局最优解。可以将所得的聚类视为数据点到示例集的概率映射,该映射将平均距离和映射的信息理论成本最小化。我们提供的实验结果说明了我们算法的性能,并将其与传统的混合模型聚类方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号