首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
【24h】

Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data

机译:基于子空间随机化和图融合的高维数据谱聚类

获取原文

摘要

Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various sub-spaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach.
机译:子空间聚类近年来因其在处理高维数据方面的潜力而受到越来越多的关注。但是,大多数现有的子空间聚类方法倾向于仅利用子空间信息来构建单个亲和图(通常用于频谱聚类),而这些亲和图通常缺乏超越单个图来探索在各种子空间中构建的多个图的能力。在高维空间中。为了解决这个问题,本文提出了一种基于子空间随机化和图融合(SC-SRGF)的高维数据谱聚类新方法。特别地,首先通过对原始特征空间执行随机采样来生成一组随机子空间。然后,构造多个K最近邻(K-NN)亲和图以捕获生成的子空间中的局部结构。为了融合来自多个子空间的多个亲和度图,使用迭代相似性网络融合方案来实现用于最终频谱聚类的统一图。在十二个现实世界的高维数据集上进行的实验证明了该方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号