首页> 外文期刊>Computing and informatics >A COMPARATIVE STUDY OF FUZZY C-MEANS ALGORITHM AND ENTROPY-BASED FUZZY CLUSTERING ALGORITHMS
【24h】

A COMPARATIVE STUDY OF FUZZY C-MEANS ALGORITHM AND ENTROPY-BASED FUZZY CLUSTERING ALGORITHMS

机译:模糊C均值算法与基于熵的模糊聚类算法的比较研究

获取原文
获取原文并翻译 | 示例
           

摘要

Fuzzy clustering is useful to mine complex and multi-dimensional data sets, where the members have partial or fuzzy relations. Among the various deve- loped techniques, fuzzy-C-means (FCM) algorithm is the most popular one, where a piece of data has partial membership with each of the pre-defined cluster centers. Moreover, in FCM, the cluster centers are virtual, that is, they are chosen at random and thus might be out of the data set. The cluster centers and membership values of the data points with them are updated through some iterations. On the other hand, entropy-based fuzzy clustering (EFC) algorithm works based on a similarity-threshold value. Contrary to FCM, in EFC, the cluster centers are real, that is, they are chosen from the data points. In the present paper, the performances of these algorithms have been compared on four data sets, such as IRIS, WINES, OLITOS and psychosis (collected with the help of forty doctors), in terms of the quality of the clusters (that is, discrepancy factor, compactness, distinctness) obtained and their computational time. Moreover, the best set of clusters has been mapped into 2-D for visualization using a self-organizing map (SOM).
机译:模糊聚类对于挖掘成员具有部分或模糊关系的复杂和多维数据集很有用。在各种开发技术中,模糊C均值(FCM)算法是最受欢迎的算法,其中一条数据与每个预定义的聚类中心具有部分隶属关系。此外,在FCM中,群集中心是虚拟的,也就是说,它们是随机选择的,因此可能不在数据集中。群集点和数据点的隶属度值通过一些迭代来更新。另一方面,基于熵的模糊聚类(EFC)算法基于相似度阈值工作。与FCM相反,在EFC中,群集中心是真实的,也就是说,它们是从数据点中选择的。在本文中,就聚类的质量(即差异)而言,已在IRIS,WINES,OLITOS和psychosis(由四十名医生协助收集)这四个数据集上比较了这些算法的性能。系数,紧密度,鲜明度)及其计算时间。此外,已使用自组织映射(SOM)将最佳的群集集映射到2-D以进行可视化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号