A comparative study of efficient initialization methods for the k-means clustering algorithm

M. Emre Celebi; Hassan A. Kingravi; Patricio A. Vela

首页> 外文期刊>Expert Systems with Application >A comparative study of efficient initialization methods for the k-means clustering algorithm

【24h】

A comparative study of efficient initialization methods for the k-means clustering algorithm

机译：k均值聚类算法有效初始化方法的比较研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. In this paper, we first present an overview of these methods with an emphasis on their computational efficiency. We then compare eight commonly used linear time complexity initialization methods on a large and diverse collection of data sets using various performance criteria. Finally, we analyze the experimental results using non-parametric statistical tests and provide recommendations for practitioners. We demonstrate that popular initialization methods often perform poorly and that there are in fact strong alternatives to these methods.

机译：K-means无疑是使用最广泛的分区聚类算法。不幸的是，由于其梯度下降特性，该算法对聚类中心的初始位置非常敏感。已经提出了许多初始化方法来解决这个问题。在本文中，我们首先概述这些方法，重点是它们的计算效率。然后，我们使用各种性能标准，在大量不同的数据集上比较八种常用的线性时间复杂度初始化方法。最后，我们使用非参数统计检验分析实验结果，并为从业人员提供建议。我们证明了流行的初始化方法通常性能较差，并且实际上有很多替代方法。

著录项

来源
《Expert Systems with Application》 |2013年第1期|200-210|共11页
作者
M. Emre Celebi; Hassan A. Kingravi; Patricio A. Vela;
展开▼
作者单位

Department of Computer Science, Louisiana State University, Shrevepon, LA, USA;

School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA;

School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
partitional clustering; sum of squared error criterion; k-means; cluster center initialization;

机译：分区聚类平方误差标准之和;k均值集群中心初始化;

相似文献

外文文献
中文文献
专利

1. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
2. Comparative analysis of clustering algorithms comprising GESC, UDCA, and k-Mean methods for wireless sensor networks [J] . Fareeha Zafar, Zaigham Mahmood URSI Radio Science Bulletin . 2011,第4期

机译：包含GESC，UDCA和k-Mean方法的无线传感器网络聚类算法的比较分析
3. Efficient and Fast Initialization Algorithm for K-means Clustering [J] . Mohammed El Agha, Wesam M. Ashour International Journal of Intelligent Systems and Applications . 2012,第1期

机译：K均值聚类的高效快速初始化算法
4. Methods for voltage sag source location by Cluster Algorithm and Decision Rule Labeling with a Comparative Approach of K-means and DBSCAN Clustering Algorithms [C] . Jose Lima Filho, Fabbio Anderson da Silva Borges, Ricardo de Andrade Lira Rabelo, International Conference on Smart and Sustainable Technologies . 2020

机译：聚类算法与决策规则标注的K值均值与DBSCAN聚类算法比较的电压暂降源定位方法
5. Efficient genetic k-means clustering algorithm and its application to data mining on different domains. [D] . Alsayat, Ahmed Mosa. 2016

机译：高效的遗传k均值聚类算法及其在不同领域数据挖掘中的应用。
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm [O] . Celebi, M. Emre, Kingravi, Hassan A., Vela, Patricio A. 2012

机译：K-means高效初始化方法的比较研究聚类算法

A comparative study of efficient initialization methods for the k-means clustering algorithm

摘要

著录项

相似文献

相关主题

期刊订阅