...
首页> 外文期刊>Informatica >Fuzzifier Selection in Fuzzy C-Means from Cluster Size Distribution Perspective
【24h】

Fuzzifier Selection in Fuzzy C-Means from Cluster Size Distribution Perspective

机译:簇大小分布视角下模糊C型方式的模糊选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Fuzzy c-means (FCM) is a well-known and widely applied fuzzy clustering method. Although there have been considerable studies which focused on the selection of better fuzzifier values in FCM, there is still not one widely accepted criterion. Also, in practical applications, the distributions of many data sets are not uniform. Hence, it is necessary to understand the impact of cluster size distribution on the selection of fuzzifier value. In this paper, the coefficient of variation (CV) is used to measure the variation of cluster sizes in a data set, and the difference of coefficient of variation (DCV) is the change of variation in cluster sizes after FCM clustering. Then, considering that the fuzzifier value with which FCM clustering produces minor change in cluster variation is better, a criterion for fuzzifier selection in FCM is presented from cluster size distribution perspective, followed by a fuzzifier selection algorithm called CSD-m (cluster size distribution for fuzzifier selection) algorithm. Also, we developed an indicator called Influence Coefficient of Fuzzifier (ICF) to measure the influence of fuzzifier values on FCM clustering results. Finally, experimental results on 8 synthetic data sets and 4 real-world data sets illustrate the effectiveness of the proposed criterion and CSD-m algorithm. The results also demonstrate that the widely used fuzzifier value m = 2 is not optimal for many data sets with large variation in cluster sizes. Based on the relationship between CV_0 and ICF, we further found that there is a linear correlation between the extent of fuzzifier value influence and the original cluster size distributions.
机译:模糊C型方式(FCM)是一种众所周知的广泛应用的模糊聚类方法。虽然已经有相当大的研究,其专注于在FCM中选择更好的模糊值,但仍然没有一个广泛接受的标准。此外,在实际应用中,许多数据集的分布不均匀。因此,有必要了解集群大小分布对模糊值的选择的影响。在本文中,使用变化系数(CV)来测量数据集中的簇大小的变化,并且变化系数(DCV)的差异是在FCM聚类之后簇大小的变化的变化。考虑到,考虑到FCM聚类产生群集变化的次要变化的模糊值更好,FCM中的模糊选择的标准从簇大小分布透视中提出,然后是一个名为CSD-M的模糊选择算法(群集大小分布模糊选择)算法。此外,我们开发了一种称为影响模糊系数(ICF)的指示器,以测量模糊值对FCM聚类结果的影响。最后,在8个合成数据集和4个现实数据集上的实验结果说明了所提出的标准和CSD-M算法的有效性。结果还证明广泛使用的模糊值M = 2对于许多具有群集尺寸的大变化的许多数据集不是最佳的。基于CV_0和ICF之间的关系,我们进一步发现,模糊值影响和原始簇大小分布之间存在线性相关性。

著录项

  • 来源
    《Informatica》 |2019年第3期|613-628|共16页
  • 作者

    Kaile ZHOU; Shanlin YANG;

  • 作者单位

    School of Management Hefei University of Technology Hefei 230009 China Key Laboratory of Process Optimization and Intelligent Decision-Making Ministry of Education Hefei University of Technology Hefei 230009 China;

    School of Management Hefei University of Technology Hefei 230009 China Key Laboratory of Process Optimization and Intelligent Decision-Making Ministry of Education Hefei University of Technology Hefei 230009 China;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    fuzzy c-means; fuzzifier; CSD-m algorithm; cluster size distribution;

    机译:模糊C型;模糊;CSD-M算法;群集大小分布;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号