首页> 外文会议>2018 IEEE Data Science Workshop >SPARSE SUBSPACE CLUSTERING WITH MISSING AND CORRUPTED DATA
【24h】

SPARSE SUBSPACE CLUSTERING WITH MISSING AND CORRUPTED DATA

机译:缺少和损坏的数据的稀疏子集群

获取原文
获取原文并翻译 | 示例

摘要

In many settings, we can accurately model high-dimensional data as lying in a union of subspaces. Subspace clustering is the process of inferring the subspaces and determining which point belongs to each subspace. In this paper we study a robust variant of sparse subspace clustering (SSC) [1]. While SSC is well-understood when there is little or no noise, less is known about SSC under significant noise or missing entries. We establish clustering guarantees in the presence of corrupted or missing entries. We give explicit bounds on the amount of additive noise and the number of missing entries the algorithm can tolerate, both in deterministic settings and in a random generative model. Our analysis shows that this method can tolerate up to$O(n/d)$ missing entries per column instead of$O(n/d^{2})$ as previous analyses show, where we have$d$-dimensional subspaces in an$n$-dimensional ambient space. Moreover, our method and analysis work by simply filling in the missing entries with zeros and do not need to know the location of missing entries.
机译:在许多情况下,我们可以将高维数据准确地建模为位于子空间的并集中。子空间聚类是推断子空间并确定哪个点属于每个子空间的过程。在本文中,我们研究了稀疏子空间聚类(SSC)的强大变体[1]。尽管很少或根本没有噪声时,对SSC的理解是很好的,但是在噪声很大或条目丢失的情况下,对SSC的了解却很少。我们在存在损坏或丢失的条目的情况下建立聚类保证。在确定性设置和随机生成模型中,我们对加性噪声的数量和算法可以容忍的缺失项的数量给出明确的界限。我们的分析表明,该方法最多可以容忍每列$ O(n / d)$个丢失的条目,而不是像以前的分析所示的那样容忍$ O(n / d ^ {2})$,其中我们有$ d $维子空间在一个n维空间中。而且,我们的方法和分析通过简单地用零填充缺失条目而工作,而无需知道缺失条目的位置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号