首页> 外文期刊>Expert Systems with Application >A comparative study of efficient initialization methods for the k-means clustering algorithm
【24h】

A comparative study of efficient initialization methods for the k-means clustering algorithm

机译:k均值聚类算法有效初始化方法的比较研究

获取原文
获取原文并翻译 | 示例
           

摘要

K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. In this paper, we first present an overview of these methods with an emphasis on their computational efficiency. We then compare eight commonly used linear time complexity initialization methods on a large and diverse collection of data sets using various performance criteria. Finally, we analyze the experimental results using non-parametric statistical tests and provide recommendations for practitioners. We demonstrate that popular initialization methods often perform poorly and that there are in fact strong alternatives to these methods.
机译:K-means无疑是使用最广泛的分区聚类算法。不幸的是,由于其梯度下降特性,该算法对聚类中心的初始位置非常敏感。已经提出了许多初始化方法来解决这个问题。在本文中,我们首先概述这些方法,重点是它们的计算效率。然后,我们使用各种性能标准,在大量不同的数据集上比较八种常用的线性时间复杂度初始化方法。最后,我们使用非参数统计检验分析实验结果,并为从业人员提供建议。我们证明了流行的初始化方法通常性能较差,并且实际上有很多替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号