首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery
【24h】

Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery

机译:按规范总和聚类:随机增量算法,收敛性和聚类恢复

获取原文
           

摘要

Standard clustering methods such as K-means, Gaussian mixture models, and hierarchical clustering are beset by local minima, which are sometimes drastically suboptimal. Moreover the number of clusters K must be known in advance. The recently introduced the sum-of-norms (SON) or Clusterpath convex relaxation of k-means and hierarchical clustering shrinks cluster centroids toward one another and ensure a unique global minimizer. We give a scalable stochastic incremental algorithm based on proximal iterations to solve the SON problem with convergence guarantees. We also show that the algorithm recovers clusters under quite general conditions which have a similar form to the unifying proximity condition introduced in the approximation algorithms community (that covers paradigm cases such as Gaussian mixtures and planted partition models). We give experimental results to confirm that our algorithm scales much better than previous methods while producing clusters of comparable quality.
机译:标准聚类方法(例如K均值,高斯混合模型和分层聚类)由局部极小值设置,有时这些局部极小值是次优的。而且,簇数K必须事先知道。最近推出的k均值的范数和(SON)或Clusterpath凸松弛和层次聚类使聚类质心彼此缩近,并确保了唯一的全局最小化器。我们给出了一种基于近端迭代的可扩展的随机增量算法,以解决具有收敛性保证的SON问题。我们还表明,该算法可在相当普遍的条件下恢复群集,这些条件的形式与逼近算法社区中引入的统一接近度条件相似(涵盖了高斯混合和种植分区模型等范例情况)。我们给出实验结果以确认我们的算法在生成质量相当的簇时比以前的方法具有更好的伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号