...
首页> 外文期刊>Neurocomputing >A weighting k-modes algorithm for subspace clustering of categorical data
【24h】

A weighting k-modes algorithm for subspace clustering of categorical data

机译:用于分类数据子空间聚类的加权k模式算法

获取原文
获取原文并翻译 | 示例
           

摘要

Traditional clustering algorithms consider all of the dimensions of an input data set equally. However, in the high dimensional data, a common property is that data points are highly clustered in subspaces, which means classes of objects are categorized in subspaces rather than the entire space. Subspace clustering is an extension of traditional clustering that seeks to find clusters in different subspaces within a data set. In this paper, a weighting k-modes algorithm is presented for subspace clustering of categorical data and its corresponding time complexity is analyzed as well. In the proposed algorithm, an additional step is added to the k-modes clustering process to automatically compute the weight of all dimensions in each cluster by using complement entropy. Furthermore, the attribute weight can be used to identify the subsets of important dimensions that categorize different clusters. The effectiveness of the proposed algorithm is demonstrated with real data sets and synthetic data sets.
机译:传统的聚类算法会平等地考虑输入数据集的所有维度。但是,在高维数据中,一个共同的特性是数据点在子空间中高度聚类,这意味着对象类在子空间中而不是整个空间中分类。子空间聚类是传统聚类的扩展,它试图在数据集中的不同子空间中找到聚类。针对分类数据的子空间聚类,提出了一种加权k模式算法,并分析了其对应的时间复杂度。在提出的算法中,在k模式聚类过程中增加了一个附加步骤,以通过使用互补熵自动计算每个聚类中所有维度的权重。此外,属性权重可用于识别对不同聚类进行分类的重要维子集。实际数据集和综合数据集证明了该算法的有效性。

著录项

  • 来源
    《Neurocomputing》 |2013年第2期|23-30|共8页
  • 作者单位

    School of Computer and Information Technology, Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University,Taiyuan, 030006 Shanxi, China;

    School of Computer and Information Technology, Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University,Taiyuan, 030006 Shanxi, China;

    School of Computer and Information Technology, Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University,Taiyuan, 030006 Shanxi, China;

    School of Computer and Information Technology, Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University,Taiyuan, 030006 Shanxi, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    subspace clustering; weight; k-modes algorithm; categorical data;

    机译:子空间聚类;重量;k模式算法;分类数据;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号