首页> 外文期刊>Journal of Global Optimization >A Decision Criterion for the Optimal Number of Clusters in Hierarchical Clustering
【24h】

A Decision Criterion for the Optimal Number of Clusters in Hierarchical Clustering

机译:分层聚类中最优聚类数的决策标准

获取原文
获取原文并翻译 | 示例
       

摘要

Clustering has been widely used to partition data into groups so that the degree of association is high among members of the same group and low among members of different groups. Though many effective and efficient clustering algorithms have been developed and deployed, most of them still suffer from the lack of automatic or online decision for optimal number of clusters. In this paper, we define clustering gain as a measure for clustering optimality, which is based on the squared error sum as a clustering algorithm proceeds. When the measure is applied to a hierarchical clustering algorithm, an optimal number of clusters can be found. Our clustering measure shows good performance producing intuitively reasonable clustering configurations in Euclidean space according to the evidence from experimental results. Furthermore, the measure can be utilized to estimate the desired number of clusters for partitional clustering methods as well. Therefore, the clustering gain measure provides a promising technique for achieving a higher level of quality for a wide range of clustering methods.
机译:聚类已广泛用于将数据划分为组,因此关联程度在同一组成员之间较高,而在不同组成员之间较低。尽管已经开发和部署了许多有效的聚类算法,但大多数算法仍然缺乏针对最佳数量的群集的自动或在线决策的功能。在本文中,我们将聚类增益定义为聚类最佳性的一种度量,它是基于聚类算法进行时的平方误差总和。当将度量应用于分层聚类算法时,可以找到最佳数目的聚类。根据实验结果,我们的聚类度量显示出良好的性能,可以在欧几里得空间中产生直观合理的聚类配置。此外,该方法也可用于估计分区聚类方法所需的聚类数量。因此,聚类增益量度提供了一种有前途的技术,可用于广泛的聚类方法,以达到更高的质量水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号