Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity

Kavan Fatehi; Mohsen Rezvani; Mansoor Fateh; Mohammad-Reza Pajoohan

首页> 外文期刊>International Journal of Intelligent Information Technologies >Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity

【24h】

Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity

机译：使用集群结构相似性的高维数据子空间聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article describes how recently, because of the curse of dimensionality in high dimensional data, a significant amount of research has been conducted on subspace clustering aiming at discovering clusters embedded in any possible attributes combination. The main goal of subspace clustering algorithms is to find all clusters in all subspaces. Previous studies have mostly been generating redundant subspace clusters, leading to clustering accuracy loss and also increasing the running time of the algorithms. A bottom-up density-based approach is suggested in this article, in which the cluster structure serves as a similarity measure to generate the optimal subspaces which result in raising the accuracy of the subspace clustering. Based on this idea, the algorithm discovers similar subspaces by considering similarity in their cluster structure, then combines them and the data in the new subspaces would be clustered again. Finally, the algorithm determines all the subspaces and also finds all clusters within them. Experiments on various synthetic and real datasets show that the results of the proposed approach are significantly better in quality and runtime than the state-of-the-art on clustering high-dimensional data.

机译：本文介绍了最近，由于高维数据中的维度诅咒，对旨在在任何可能的属性组合中嵌入的集群的子空间聚类进行了大量的研究。子空间聚类算法的主要目标是在所有子空间中找到所有群集。以前的研究主要是产生冗余子空间集群，导致聚类精度损耗以及增加算法的运行时间。在本文中提出了一种自下而上的基于密度的方法，其中簇结构用作生成最佳子空间的相似性度量，这导致了提高子空间聚类的准确性。基于此思想，该算法通过考虑其群集结构中的相似性来发现类似的子空间，然后将它们组合，并将再次聚集新子空间中的数据。最后，该算法确定所有子空间，也可以找到它们内的所有群集。各种合成和实际数据集的实验表明，拟议方法的结果质量和运行时间明显更好，而不是最先进的聚类高维数据。

著录项

来源
《International Journal of Intelligent Information Technologies》 |2018年第3期|共18页
作者
Kavan Fatehi; Mohsen Rezvani; Mansoor Fateh; Mohammad-Reza Pajoohan;
展开▼
作者单位

Yazd University Department of Computer Engineering Yazd Islamic Republic of Iran;

Shahrood University of Technology Department of Computer Engineering Shahrood Islamic Republic of Iran;

Shahrood University of Technology Department of Computer Engineering Shahrood Islamic Republic of Iran;

Yazd University Department of Computer Engineering Yazd Islamic Republic of Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
Algorithm; Cluster Similarity; High Dimensional Data; Subspace Clustering;

机译：算法;集群相似性;高维数据;子空间聚类;

相似文献

外文文献
中文文献
专利

1. Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity [J] . Kavan Fatehi, Mohsen Rezvani, Mansoor Fateh, International Journal of Intelligent Information Technologies . 2018,第3期

机译：使用集群结构相似性的高维数据子空间聚类
2. ERRATUM: Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications [J] . Ali Baghernia, Hamid Pavin, Miresmail Mirnabibaboli, Advanced Science, Engineering and Medicine . 2017,第7期

机译：erratum：群集高维数据流：生物信息学应用中的子空间聚类调查，投影群集
3. Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications [J] . Ali Baghernia, Hamid Pavin, Miresmail Mirnabibaboli, Advanced Science, Engineering and Medicine . 2016,第9期

机译：聚类高维数据流：子空间聚类调查，生物信息学应用的预测聚类调查
4. Subspace search and visualization to make sense of alternative clusterings in high-dimensional data [C] . Tatu Andrada, Maas Fabian, Farber Ines, IEEE Conference on Visual Analytics Science amp; Technology 2012. . 2012

机译：子空间搜索和可视化，使高维数据中的替代聚类有意义
5. High-dimensional data mining: Subspace clustering, outlier detection and applications to classification. [D] . Foss, Andrew Philip Ogilvie. 2010

机译：高维数据挖掘：子空间聚类，离群值检测和分类应用。
6. Dimensionality Reduction and Subspace Clustering in Mixed Reality for Condition Monitoring of High-Dimensional Production Data [O] . Burkhard Hoppenstedt, Manfred Reichert, Klaus Kammerer, 2019

机译：混合现实中的降维和子空间聚类用于高维生产数据的状态监测
7. Subspace Search and Visualization to Make Sense of Alternative Clusterings in High-Dimensional Data [O] . Tatu Andrada, Maaß Fabian, Färber Ines, 2012

机译：子空间搜索和可视化，使高维数据中的替代聚类有意义

Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity

摘要

著录项

相似文献

相关主题

期刊订阅