...
首页> 外文期刊>Journal of machine learning research >Exact Clustering of Weighted Graphs via Semidefinite Programming
【24h】

Exact Clustering of Weighted Graphs via Semidefinite Programming

机译:通过SEMIDEFINITE编程精确聚类加权图

获取原文
           

摘要

As a model problem for clustering, we consider the densest $k$-disjoint-clique problem of partitioning a weighted complete graph into $k$ disjoint subgraphs such that the sum of the densities of these subgraphs is maximized. We establish that such subgraphs can be recovered from the solution of a particular semidefinite relaxation with high probability if the input graph is sampled from a distribution of clusterable graphs. Specifically, the semidefinite relaxation is exact if the graph consists of (k) large disjoint subgraphs, corresponding to clusters, with weight concentrated within these subgraphs, plus a moderate number of nodes not belonging to any cluster. Further, we establish that if noise is weakly obscuring these clusters, i.e, the between-cluster edges are assigned very small weights, then we can recover significantly smaller clusters. For example, we show that in approximately sparse graphs, where the between-cluster weights tend to zero as the size $n$ of the graph tends to infinity, we can recover clusters of size polylogarithmic in $n$ under certain conditions on the distribution of edge weights. Empirical evidence from numerical simulations is also provided to support these theoretical phase transitions to perfect recovery of the cluster structure.
机译:作为群集的模型问题,我们考虑将加权完整图分区为$ k $不相交的子图的密度为k $ -disjoint-clique问题,以便最大化这些子图的密度之和。我们确定这些子图可以通过具有高概率的特定半纤维弛豫的溶液来恢复,如果输入图是从聚类图的分布中采样的。具体地,如果图表由对应于簇的(k )大的脱节子图,则SEMIDEFINITE弛豫是精确的。此外,我们确定如果噪声弱遮挡这些簇,即簇之间的边缘被分配非常小的权重,那么我们可以恢复明显更小的簇。例如,我们表明,在大约稀疏的图形中,当尺寸$ N $倾向于无穷大时,群集权重趋于为零,我们可以在分布的某些条件下以$ n $恢复大小的波动力学群体边缘重量。还提供了来自数值模拟的经验证据,以支持这些理论相位过渡以完善群集结构的恢复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号