...
【24h】

Robust cluster validity indexes

机译:稳健的集群有效性指标

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Cluster validity indexes can be used to evaluate the fitness of data partitions produced by a Clustering algorithm. Validity indexes are usually independent of Clustering algorithms. However, the values of validity indexes may be heavily influenced by noise and Outliers. These noise and Outliers may not influence the results from clustering algorithms, but they may affect the values of validity indexes. In the literature, there is little discussion about the robustness of Cluster validity indexes. In this paper, we analyze the robustness of a validity index using the phi function of M-estimate and then propose several robust-type validity indexes. Firstly, we discuss the validity measure oil a single data point and focus on those validity indexes that can be categorized as the mean type of validity indexes. We then propose median-type validity indexes that are robust to noise and Outliers. Comparative examples with numerical and real data sets show that the proposed median-type validity indexes work better than the mean-type validity indexes.
机译:聚类有效性指标可用于评估由聚类算法产生的数据分区的适用性。有效性索引通常独立于聚类算法。但是,有效性指标的值可能会受到噪声和异常值的严重影响。这些噪声和异常值可能不会影响聚类算法的结果,但可能会影响有效性指标的值。在文献中,很少有关于聚类有效性指标的鲁棒性的讨论。在本文中,我们使用M估计的phi函数分析有效性指标的鲁棒性,然后提出几种鲁棒型有效性指标。首先,我们讨论单个数据点上的有效性度量,并集中于那些可以归类为有效性指标的平均类型的有效性指标。然后,我们提出对噪声和异常值具有鲁棒性的中值类型有效性指标。具有数值和实际数据集的比较示例表明,所提出的中位类型有效性指标比均值类型有效性指标更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号