Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data

机译：基于子空间随机化和图融合的高维数据谱聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various sub-spaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach.

机译：子空间聚类近年来因其在处理高维数据方面的潜力而受到越来越多的关注。但是，大多数现有的子空间聚类方法倾向于仅利用子空间信息来构建单个亲和图（通常用于频谱聚类），而这些亲和图通常缺乏超越单个图来探索在各种子空间中构建的多个图的能力。在高维空间中。为了解决这个问题，本文提出了一种基于子空间随机化和图融合（SC-SRGF）的高维数据谱聚类新方法。特别地，首先通过对原始特征空间执行随机采样来生成一组随机子空间。然后，构造多个K最近邻（K-NN）亲和图以捕获生成的子空间中的局部结构。为了融合来自多个子空间的多个亲和度图，使用迭代相似性网络融合方案来实现用于最终频谱聚类的统一图。在十二个现实世界的高维数据集上进行的实验证明了该方法的优越性。

著录项

来源
《Pacific-Asia Conference on Knowledge Discovery and Data Mining》|2020年|330-342|共13页
会议地点
作者
Xiaosha Cai; Dong Huang; Chang-Dong Wang; Chee-Keong Kwoh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data clustering; Spectral clustering; Subspace clustering; High-dimensional data; Random subspaces; Graph fusion;

机译：数据聚类;光谱聚类;子空间聚类;高维数据;随机子空间;图融合;

相似文献

外文文献
中文文献
专利

1. Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications (Advanced Science, Engineering and Medicine, Vol. 8(9), pp. 749–757 (2016)) [J] . Baghernia Ali, Pavin Hamid, Mirnabibaboli Miresmail, Advanced Science, Engineering and Medicine . 2017,第7期

机译：聚类高维数据流：生物信息学应用中预计集群的子空间聚类调查（高级科学，工程和医学，Vol.8（9），PP。749-757（2016））
2. ERRATUM: Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications [J] . Ali Baghernia, Hamid Pavin, Miresmail Mirnabibaboli, Advanced Science, Engineering and Medicine . 2017,第7期

机译：erratum：群集高维数据流：生物信息学应用中的子空间聚类调查，投影群集
3. Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications [J] . Ali Baghernia, Hamid Pavin, Miresmail Mirnabibaboli, Advanced Science, Engineering and Medicine . 2016,第9期

机译：聚类高维数据流：子空间聚类调查，生物信息学应用的预测聚类调查
4. Large scale hyperspectral data segmentation by random spatial subspace clustering [C] . Yi Guo, Junbin Gao, Feng Li IEEE International Geoscience and Remote Sensing Symposium . 2013

机译：随机空间子空间聚类的大规模高光谱数据分割
5. High-dimensional data mining: Subspace clustering, outlier detection and applications to classification. [D] . Foss, Andrew Philip Ogilvie. 2010

机译：高维数据挖掘：子空间聚类，离群值检测和分类应用。
6. Dimensionality Reduction and Subspace Clustering in Mixed Reality for Condition Monitoring of High-Dimensional Production Data [O] . Burkhard Hoppenstedt, Manfred Reichert, Klaus Kammerer, 2019

机译：混合现实中的降维和子空间聚类用于高维生产数据的状态监测
7. Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data [O] . Xiaosha Cai, Dong Huang, Chang-Dong Wang, 2020

机译：通过子空间随机化和曲线图融合的光谱簇，用于高维数据

Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data

摘要

著录项

相似文献

相关主题

期刊订阅