首页> 外文会议>Annual Allerton Conference on Communication, Control, and Computing >Phase transitions and optimal algorithms in high-dimensional Gaussian mixture clustering
【24h】

Phase transitions and optimal algorithms in high-dimensional Gaussian mixture clustering

机译:高维高斯混合聚类中的相变和最优算法

获取原文

摘要

We consider the problem of Gaussian mixture clustering in the high-dimensional limit where the data consists of m points in n dimensions, n,m → ∞ and α = m stays finite. Using exact but non-rigorous methods from statistical physics, we determine the critical value of α and the distance between the clusters at which it becomes information-theoretically possible to reconstruct the membership into clusters better than chance. We also determine the accuracy achievable by the Bayes-optimal estimation algorithm. In particular, we find that when the number of clusters is sufficiently large, r > 4+2√α, there is a gap between the threshold for information-theoretically optimal performance and the threshold at which known algorithms succeed.
机译:我们考虑高维混合聚类的问题,在高维极限中,数据由n个维中的m个点组成,n,m→∞,α= m / n保持有限。使用统计物理学中的精确但非严格的方法,我们确定了α的临界值以及在簇之间的距离,在该距离上信息成为理论上理论上比隶属度更好地将成员重建为簇的可能性。我们还确定了贝叶斯最佳估计算法可达到的精度。尤其是,我们发现,当集群数量足够大时,r> 4 +2√α,信息理论上最佳性能的阈值与已知算法成功的阈值之间存在差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号