首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters
【24h】

Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters

机译:选择簇数的聚集模糊K-均值聚类算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we present an agglomerative fuzzy $k$-means clustering algorithm for numerical data, an extension to the standard fuzzy $k$-means algorithm by introducing a penalty term to the objective function to make the clustering process not sensitive to the initial cluster centers. The new algorithm can produce more consistent clustering results from different sets of initial clusters centers. Combined with cluster validation techniques, the new algorithm can determine the number of clusters in a data set, which is a well known problem in $k$-means clustering. Experimental results on synthetic data sets (2 to 5 dimensions, 500 to 5000 objects and 3 to 7 clusters), the BIRCH two-dimensional data set of 20000 objects and 100 clusters, and the WINE data set of 178 objects, 17 dimensions and 3 clusters from UCI, have demonstrated the effectiveness of the new algorithm in producing consistent clustering results and determining the correct number of clusters in different data sets, some with overlapping inherent clusters.
机译:在本文中,我们提出了一种用于数值数据的聚集模糊$ k $ -means聚类算法,它是对目标函数引入惩罚项以使聚类过程对数据不敏感的标准模糊$ k $ -means算法的扩展。最初的集群中心。新算法可以从不同的初始聚类中心集产生更一致的聚类结果。结合聚类验证技术,新算法可以确定数据集中的聚类数量,这是$ k $ -means聚类中众所周知的问题。关于合成数据集(2到5个维度,500到5000个对象和3到7个聚类),BIRCH二维数据集(20000个对象和100个聚类)以及WINE数据集(178个对象,17个维度和3个)的实验结果UCI的聚类证明了新算法在产生一致的聚类结果和确定不同数据集中正确数目的聚类(某些具有重叠的固有聚类)方面的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号