Approximate Clustering without the Approximation

机译：近似聚类，无需近似

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Approximation algorithms for clustering points in metric spaces is a flourishing area of research, with much research effort spent on getting a better understanding of the approximation guarantees possible for many objective functions such as k-median, k-means, and min-sum clustering. This quest for better approximation algorithms is further fueled by the implicit hope that these better approximations also yield more accurate clusterings. E.g., for many problems such as clustering proteins by function, or clustering images by subject, there is some unknown correct "target" clustering and the implicit hope is that approximately optimizing these objective functions will in fact produce a clustering that is close pointwise to the truth. In this paper, we show that if we make this implicit assumption explicit - that is, if we assume that any capproximation to the given clustering objective Φ is ε-close to the target - then we can produce clusterings that are O(ε)- close to the target, even for values c for which obtaining a c-approximation is NP-hard. In particular, for k-median and k-means objectives, we show that we can achieve this guarantee for any constant c > 1, and for the min-sum objective we can do this for any constant c > 2. Our results also highlight a surprising conceptual difference between assuming that the optimal solution to, say, the k-median objective is ε-close to the target, and assuming that any approximately optimal solution is ε-close to the target, even for approximation factor say c = 1.01. In the former case, the problem of finding a solution that is O(ε)-close to the target remains computationally hard, and yet for the latter we have an efficient algorithm.

机译：在度量空间聚类点逼近算法是研究的一个繁华地段，与花在得到一个更好的理解近似的许多目标函数，如K-位数，K-均值，和最小和聚类，保证可以大量的研究工作。这种追求更好的近似算法是由隐希望这些更好的近似值也能产生更准确的聚类进一步加剧。例如，对于许多问题，如按功能聚类的蛋白质，或群集的被摄体图像，还有一些未知的正确的“目标”集群和隐含的希望是，大约优化这些目标函数实际上将产生集聚接近于逐点的真相。在本文中，我们证明，如果我们把这个隐含的假设明确 - 那就是，如果我们假设任何capproximation给定的聚类目标Φ是ε-接近目标 - 那么我们就可以产生聚类是O（ε） - 接近目标，即使是值c，对于它获得的c-近似是NP难题。特别是，对于k中值和K-手段目标，我们证明了我们能够做到这一点保证了任何常数c> 1，以及最小和目标，我们可以为任何常数c做到这一点> 2.我们也导致亮点一个令人惊讶的假设来，最优解说，第k位的目标是ε-接近目标，并且假设任何近似最优解是ε-接近目标，即使对于近似因子例如c = 1.01之间概念上的差异。在前者的情况下，找到一个解决方案，是O（ε）-close到目标的问题仍然存在计算硬，然而对于后者，我们有一个有效的算法。

著录项

来源
《Annual ACM-SIAM Symposium on Discrete Algorithms》|2009年||共10页
会议地点
作者
Maria-Florina Balcan; Avrim Blum; Anupam Gupta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Communication: Random-phase approximation excitation energies from approximate equation-of-motion coupled-cluster doubles [J] . Berkelbach Timothy C. The Journal of Chemical Physics . 2018,第4期

机译：通信：来自近似运动耦合簇双打的随机相位近似激励能量
2. Optical rotation calculations on large molecules using the approximate coupled cluster model CC2 and the resolution-of-the-identity approximation [J] . Daniel H. Friese, Christof Hattig Physical chemistry chemical physics: PCCP . 2014,第13期

机译：使用近似耦合簇模型CC2和恒等分辨率近似对大分子进行旋光计算
3. Calculation of two-photon absorption strengths with the approximate coupled cluster singles and doubles model CC2 using the resolution-of-identity approximation [J] . Daniel H. Friese, Christof Hattig, Kenneth Ruu Physical chemistry chemical physics: PCCP . 2012,第3期

机译：用恒等式近似法计算带有近似耦合簇单双模型CC2的双光子吸收强度
4. Approximate clustering without the approximation [C] . Maria-Florina Balcan, Avrim Blum, Anupam Gupta Annual ACM-SIAM Symposium on Discrete Algorithms;ACM-SIAM Symposium on Discrete Algorithms . 2009

机译：没有近似的近似聚类
5. Best rank-1 approximations without orthogonal invariance for the 1-norm [D] . Vasudevan, Varun A. 2016

机译：1-范数的无正交不变性的最佳秩1近似
6. False Approximations of the Approximate Number System? [O] . Titia Gebuis, Maarten J. van der Smagt 2008

机译：近似数系统的虚假近似？
7. Approximate Clustering without the Approximation [O] . Balcan, Maria-Florina, Blum, Avrim, Gupta, Anupam 2009

机译：没有近似的近似聚类

Approximate Clustering without the Approximation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅