首页> 外文期刊>Expert Systems with Application >Applications of an enhanced cluster validity index method based on the Fuzzy C-means and rough set theories to partition and classification
【24h】

Applications of an enhanced cluster validity index method based on the Fuzzy C-means and rough set theories to partition and classification

机译:基于模糊C-均值和粗糙集理论的增强聚类有效性指标方法在分类中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This study proposes a method of cluster validity index that simultaneously provide the measurements of goodness of clustering on clustered data and of classification accuracy for complicated information sys-tems based upon the PBMF-index method and rough set (RS) theory. The maximum value of this index, called the Huang-index, not only provides the best partitioning, but also obtains the optimal accuracy of classification for the approximation sets. The traditional PBMF-index method is only used to ensure the formation of a small number of compact clusters with large separation between at least two clusters. In contrast to the traditional PBMF-index method, the Huang-index method extends the applications of unsupervised optimal cluster to the fields of classification. In the proposed algorithm, all the attributes of the data are first clustered into groups using the Fuzzy C-means (FCM) method. The clustered data are then used to identify approximate regions and classification accuracy and to calculate centroids of clusters for decision attribute based on the RS theory. Finally, all those calculated data are put into the proposed index method to find the cluster validity index. The validity of the proposed approach is dem-onstrated using the data derived from a hypothetical function of two independent variables and elec-tronic stock data extracted from the financial database maintained by the Taiwan Economic Journal (TEJ). The clustering results obtained using the proposed method are compared with the results obtained using the traditional PBMF-index partition method. The effects of the number of clusters on the partitions of clusters and the RS regions are systematically examined and compared. The results show that the pro-posed Huang-index method not only yields a superior clustering capability than the traditional clustering algorithm, but also yields a reliable classification and obtains a set of suitable decision rules extracted from the RS theory.
机译:这项研究提出了一种聚类有效性指数的方法,该方法同时提供了基于PBMF指数方法和粗糙集(RS)理论的聚类数据在聚类数据上的优良性和复杂信息系统分类精度的度量。该索引的最大值称为Huang索引,不仅可以提供最佳的划分,而且可以为近似集获得最佳的分类精度。传统的PBMF指数方法仅用于确保形成至少两个簇之间间隔较大的少量紧凑簇。与传统的PBMF指数方法相比,Huang指数方法将无监督最优聚类的应用扩展到了分类领域。在提出的算法中,首先使用模糊C均值(FCM)方法将数据的所有属性聚类为组。然后,基于RS理论,将聚类的数据用于识别近似区域和分类精度,并为决策属性计算聚类的质心。最后,将所有这些计算数据放入建议的索引方法中,以找到聚类有效性指标。使用从两个自变量的假设函数得出的数据以及从《台湾经济日报》(TEJ)维护的金融数据库中提取的电子股票数据,证明了该方法的有效性。将使用建议方法获得的聚类结果与使用传统PBMF索引分配方法获得的结果进行比较。系统地检查和比较了群集数对群集分区和RS区域的影响。结果表明,提出的Huang-index方法不仅比传统的聚类算法具有更好的聚类能力,而且分类可靠,并从RS理论中提取了一套合适的决策规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号