...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Fast modified global k-means algorithm for incremental cluster construction
【24h】

Fast modified global k-means algorithm for incremental cluster construction

机译:快速改进的全局k-means算法用于增量聚类构建

获取原文
获取原文并翻译 | 示例

摘要

The k-means algorithm and its variations are known to be fast clustering algorithms. However, they are sensitive to the choice of starting points and are inefficient for solving clustering problems in large datasets. Recently, incremental approaches have been developed to resolve difficulties with the choice of starting points. The global k-means and the modified global k-means algorithms are based on such an approach. They iteratively add one cluster center at a time. Numerical experiments show that these algorithms considerably improve the k-means algorithm. However, they require storing the whole affinity matrix or computing this matrix at each iteration. This makes both algorithms time consuming and memory demanding for clustering even moderately large datasets. In this paper, a new version of the modified global k-means algorithm is proposed. We introduce an auxiliary cluster function to generate a set of starting points lying in different parts of the dataset. We exploit information gathered in previous iterations of the incremental algorithm to eliminate the need of computing or storing the whole affinity matrix and thereby to reduce computational effort and memory usage. Results of numerical experiments on six standard datasets demonstrate that the new algorithm is more efficient than the global and the modified global k-means algorithms.
机译:已知k均值算法及其变体是快速聚类算法。但是,它们对起点的选择敏感,并且对于解决大型数据集中的聚类问题效率不高。最近,已经开发了渐进的方法来解决选择起点的困难。全局k均值和改进的全局k均值算法均基于这种方法。他们一次迭代地添加一个集群中心。数值实验表明,这些算法大大改善了k均值算法。但是,它们需要存储整个亲和力矩阵或在每次迭代时计算此矩阵。这对于群集甚至中等大小的数据集都既耗时又耗费内存。本文提出了一种新的改进的全局k均值算法。我们引入一个辅助聚类函数来生成一组位于数据集不同部分的起点。我们利用在增量算法的先前迭代中收集的信息来消除计算或存储整个亲和矩阵的需要,从而减少了计算工作量和内存使用量。在六个标准数据集上的数值实验结果表明,该新算法比全局算法和改进的全局k均值算法效率更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号