首页> 外文会议>International Conference on Advanced Computational Intelligence >A validity index method for clusters with different degrees of dispersion and overlap
【24h】

A validity index method for clusters with different degrees of dispersion and overlap

机译:具有不同分散度和重叠度的聚类的有效性指标方法

获取原文

摘要

Cluster validity index Is used for estimating the quality of partitions to a dataset by clustering algorithms, and finding the optimal number of clusters to be partitioned. In this paper, we propose a new validity index, which is based on a dispersion measure and an overlap measure. The dispersion measure estimates the overall data density of the clusters in the dataset; whereas the overlap measure estimates the degree of isolation among all clusters. Low degree of dispersion means that the overall clusters are densely distributed and hence are compact; and low degree of overlap means that clusters are overall well separated. Thus, a good clustering result is expected to have a lower dispersion measure and a lower overlap measure. We conducted several experiments to validate the effectiveness of our validity indexing method, including artificial datasets and public real datasets. Experimental results show that our validity indexing method has superior effectiveness and reliability for estimating the optimal number of clusters that widely differ in degrees of dispersion and overlap, when compared to nine other indices proposed in the literature.
机译:聚类有效性指数用于通过聚类算法评估数据集的分区质量,并找到要分区的最佳聚类数。在本文中,我们提出了一种新的有效性指标,该指标基于分散测度和重叠测度。离散度度量估计数据集中聚类的总体数据密度;而重叠量度则估计所有群集之间的隔离度。分散度低意味着整个群集密集分布,因此紧凑。重叠程度低意味着群集整体上很好地分开了。因此,期望良好的聚类结果具有较低的分散度和较低的重叠度。我们进行了一些实验来验证有效性索引方法的有效性,其中包括人工数据集和公共真实数据集。实验结果表明,与文献中提出的其他九种指标相比,我们的有效性索引方法在估计散布和重叠程度差异很大的最优簇数方面具有优异的有效性和可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号