Accelerating Exact k-means Algorithms with Geometric Reasoning

机译：利用几何推理加速精确的k均值算法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present new algorithms for the k-means clustering problem. They use a new kind of kd-tree traversal algorithm supplemented with a novel pruning test to give sublinear cost both in the number of datapoints and in the number of centers. The kd-trees are decorated with extra 'cached sufficient statistics' as in (3). Sufficient statistics are stored in the nodes of the kd-tree. Then, an analysis of the geometry of the current cluster centers results in great reduction of the work needed to update the centers. Our algorithms behave exactly as the traditional k-means algorithm. Proofs of correctness are included. The kd-tree can also be used to initialize the k-means starting centers efficiently. Our algorithms can be easily extended to provide fast ways of computing the error of a given cluster assignment, regardless of the method in which those clusters were obtained. We also show how to use them in a setting which allows approximate clustering results, with the benefit of running faster.

机译：我们提出了针对k均值聚类问题的新算法。他们使用一种新型的kd-tree遍历算法，并辅以新颖的修剪测试，以给出数据点数量和中心数量的次线性成本。如（3）所示，kd树装饰有额外的“缓存的足够统计量”。足够的统计信息存储在kd-tree的节点中。然后，对当前群集中心的几何形状进行分析可以大大减少更新中心所需的工作。我们的算法与传统的k均值算法完全一样。包括正确性证明。 kd树还可以用于有效地初始化k均值起始中心。我们的算法可以轻松扩展，以提供快速的方法来计算给定群集分配的误差，而与获得这些群集的方法无关。我们还将展示如何在允许近似聚类结果的设置中使用它们，并具有运行速度更快的好处。

著录项

来源
《ACM SIGKDD international conference on knowledge discovery and data mining》|1999年|p.277-281|共5页
会议地点 San Digo CA(US)
作者
Dan Pelleg; Andrew Moore;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. GEOMETRIC ALGORITHMS FOR THE CONSTRAINED 1-D K-MEANS CLUSTERING PROBLEMS AND IMRT APPLICATIONS [J] . DANNY Z. CHEN, MARK A. HEALY, CHAO WANG, International Journal of Foundations of Computer Science . 2009,第2期

机译：约束一维K均值聚类问题的几何算法和IMRT应用
2. COMPARISON OF EFFICIENCY OF GEOMETRIC STRATIFICATION AND K-MEANS ALGORITHM IN UNIVARIATE STRATIFICATION OF SKEWED POPULATIONS [J] . Marcin Kozak International Journal of Agricultural and Statistical Sciences . 2011,第1期

机译：倾斜人口单层分层的几何分层效率和K-均值算法的比较
3. Improvement of K-Means Algorithm for Accelerated Big Data Clustering [J] . Chunqiong Wu, Bingwen Yan, Rongrui Yu, International journal of information technologies and systems approach . 2021,第2期

机译：加速大数据聚类的k均值算法的改进
4. Accelerating exact k-means algorithms with geometric reasoning [C] . Dan Pelleg, Andrew Moore ACM SIGKDD international conference on Knowledge discovery and data mining . 1999

机译：利用几何推理加速精确的k均值算法
5. 3D geometric reasoning algorithms for feature recognition. [D] . Han, JungHyun. 1996

机译：用于特征识别的3D几何推理算法。
6. A Fast Exact k-Nearest Neighbors Algorithm for High Dimensional Search Using k-Means Clustering and Triangle Inequality [O] . Xueyi Wang -1

机译：快速精确最近邻居法高维搜索使用K-均值聚类和三角不等式
7. Accelerating Exact k-means Algorithms with Geometric Reasoning [O] . Pelleg Dan, Moore Andrew 1999

机译：利用几何推理加速精确的k均值算法
8. Accelerating Exact k-means Algorithms with Geometric Reasoning [R] . Pelleg, D. , Moore, A. 2000

机译：用几何推理加速精确k-means算法

Accelerating Exact k-means Algorithms with Geometric Reasoning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅