首页> 外文会议>IEEE Data Science Workshop >SPARSE SUBSPACE CLUSTERING WITH MISSING AND CORRUPTED DATA
【24h】

SPARSE SUBSPACE CLUSTERING WITH MISSING AND CORRUPTED DATA

机译:稀疏的子空间群集与缺失和损坏的数据

获取原文

摘要

In many settings, we can accurately model high-dimensional data as lying in a union of subspaces. Subspace clustering is the process of inferring the subspaces and determining which point belongs to each subspace. In this paper we study a robust variant of sparse subspace clustering (SSC) [1]. While SSC is well-understood when there is little or no noise, less is known about SSC under significant noise or missing entries. We establish clustering guarantees in the presence of corrupted or missing entries. We give explicit bounds on the amount of additive noise and the number of missing entries the algorithm can tolerate, both in deterministic settings and in a random generative model. Our analysis shows that this method can tolerate up to$O(n/d)$ missing entries per column instead of$O(n/d^{2})$ as previous analyses show, where we have$d$-dimensional subspaces in an$n$-dimensional ambient space. Moreover, our method and analysis work by simply filling in the missing entries with zeros and do not need to know the location of missing entries.
机译:在许多设置中,我们可以将高维数据准确地显示为位于子空间的联合中。子空间群集是推断子空间并确定属于每个子空间的哪个点的过程。在本文中,我们研究了一种稀疏子空间聚类的强大变体(SSC)[1]。当SSC很清楚时,当噪音很少或没有噪声时,在显着的噪声或缺失的条目下,关于SSC的较少。我们在存在损坏或遗失的条目存在时建立聚类保证。我们在附加噪声的数量和算法可以在确定性设置和随机生成模型中提供缺失条目数量的显式界限。我们的分析表明,此方法可以容忍最多$ O(n / d)$每列的$丢失条目而不是$ o(n / d ^ {2})$作为之前的分析显示,我们拥有$ -dimensional子空间在$ n $ -dimimentional环境空间。此外,我们的方法和分析工作通过简单地填写零零的缺失条目,不需要知道丢失条目的位置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号