...
首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Rough subspace-based clustering ensemble for categorical data
【24h】

Rough subspace-based clustering ensemble for categorical data

机译:分类数据的基于子空间的粗糙聚类集成

获取原文
获取原文并翻译 | 示例

摘要

Clustering categorical data arising as an important problem of data mining has recently attracted much attention. In this paper, the problem of unsupervised dimensionality reduction for categorical data is first studied. Based on the theory of rough sets, the attributes of categorical data are decomposed into a number of rough subspaces. A novel clustering ensemble algorithm based on rough subspaces is then proposed to deal with categorical data. The algorithm employs some of rough subspaces with high quality to cluster the data and yields a robust and stable solution by exploiting the resulting partitions. We also introduce a cluster index to evaluate the solution of clustering algorithm for categorical data. Experimental results for selected UCI data sets show that the proposed method produces better results than those obtained by other methods when being evaluated in terms of cluster validity indexes.
机译:对作为数据挖掘的重要问题的分类数据进行聚类最近引起了很多关注。本文首先研究了分类数据的无监督降维问题。基于粗糙集理论,将分类数据的属性分解为多个粗糙子空间。提出了一种基于粗糙子空间的新型聚类集成算法来处理分类数据。该算法使用了一些高质量的粗糙子空间来对数据进行聚类,并通过利用所得的分区来产生鲁棒且稳定的解决方案。我们还引入了聚类索引以评估分类数据聚类算法的解决方案。针对选定的UCI数据集的实验结果表明,在对聚类有效性指标进行评估时,所提出的方法比其他方法获得的结果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号