首页> 外文会议>Annual ACM-SIAM Symposium on Discrete Algorithms >Approximate Clustering without the Approximation
【24h】

Approximate Clustering without the Approximation

机译:近似聚类,无需近似

获取原文
获取外文期刊封面目录资料

摘要

Approximation algorithms for clustering points in metric spaces is a flourishing area of research, with much research effort spent on getting a better understanding of the approximation guarantees possible for many objective functions such as k-median, k-means, and min-sum clustering. This quest for better approximation algorithms is further fueled by the implicit hope that these better approximations also yield more accurate clusterings. E.g., for many problems such as clustering proteins by function, or clustering images by subject, there is some unknown correct "target" clustering and the implicit hope is that approximately optimizing these objective functions will in fact produce a clustering that is close pointwise to the truth. In this paper, we show that if we make this implicit assumption explicit - that is, if we assume that any capproximation to the given clustering objective Φ is ε-close to the target - then we can produce clusterings that are O(ε)- close to the target, even for values c for which obtaining a c-approximation is NP-hard. In particular, for k-median and k-means objectives, we show that we can achieve this guarantee for any constant c > 1, and for the min-sum objective we can do this for any constant c > 2. Our results also highlight a surprising conceptual difference between assuming that the optimal solution to, say, the k-median objective is ε-close to the target, and assuming that any approximately optimal solution is ε-close to the target, even for approximation factor say c = 1.01. In the former case, the problem of finding a solution that is O(ε)-close to the target remains computationally hard, and yet for the latter we have an efficient algorithm.
机译:在度量空间聚类点逼近算法是研究的一个繁华地段,与花在得到一个更好的理解近似的许多目标函数,如K-位数,K-均值,和最小和聚类,保证可以大量的研究工作。这种追求更好的近似算法是由隐希望这些更好的近似值也能产生更准确的聚类进一步加剧。例如,对于许多问题,如按功能聚类的蛋白质,或群集的被摄体图像,还有一些未知的正确的“目标”集群和隐含的希望是,大约优化这些目标函数实际上将产生集聚接近于逐点的真相。在本文中,我们证明,如果我们把这个隐含的假设明确 - 那就是,如果我们假设任何capproximation给定的聚类目标Φ是ε-接近目标 - 那么我们就可以产生聚类是O(ε) - 接近目标,即使是值c,对于它获得的c-近似是NP难题。特别是,对于k中值和K-手段目标,我们证明了我们能够做到这一点保证了任何常数c> 1,以及最小和目标,我们可以为任何常数c做到这一点> 2.我们也导致亮点一个令人惊讶的假设来,最优解说,第k位的目标是ε-接近目标,并且假设任何近似最优解是ε-接近目标,即使对于近似因子例如c = 1.01之间概念上的差异。在前者的情况下,找到一个解决方案,是O(ε)-close到目标的问题仍然存在计算硬,然而对于后者,我们有一个有效的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号