首页> 外文会议>Annual Allerton Conference on Communication, Control, and Computing >Necessary and sufficient conditions for sketched subspace clustering
【24h】

Necessary and sufficient conditions for sketched subspace clustering

机译:速写子空间聚类的必要和充分条件

获取原文

摘要

This paper is about an interesting phenomenon: two r-dimensional subspaces, even if they are orthogonal to one an other, can appear identical if they are only observed on a subset of coordinates. Understanding this phenomenon is of particular importance for many modern applications of subspace clustering where one would like to subsample in order to improve computational efficiency. Examples include real-time video surveillance and datasets so large that cannot even be stored in memory. In this paper we introduce a new metric between subspaces, which we call partial coordinate discrepancy. This metric captures a notion of similarity between subsampled subspaces that is not captured by other distance measures between subspaces. With this, we are able to show that subspace clustering is theoretically possible in lieu of coherence assumptions using only r + 1 rows of the dataset at hand. This gives precise information-theoretic necessary and sufficient conditions for sketched subspace clustering. This can greatly improve computational efficiency without compromising performance. We complement our theoretical analysis with synthetic and real data experiments.
机译:本文是关于一个有趣的现象:两个R维子空间,即使它们与另一个垂直,如果仅在坐标的子集上观察到它们,则可以看起来相同。了解这种现象对子空间聚类的许多现代应用特别重要,其中一个人想要提高计算效率。示例包括实时视频监控和数据集,如此之大,甚至无法存储在内存中。在本文中,我们在子空间之间介绍了一个新的度量,我们调用部分坐标差异。该度量标准捕获子页面不被子空间之间的其他距离测量捕获的子空间之间的相似性的概念。有了这个,我们能够理论上表明子空间群集是可能的,代替手头的数据集只使用R + 1行的连贯假设。这为草图子空间聚类提供了精确的信息 - 理论必要和充分条件。这可以大大提高计算效率而不会影响性能。我们通过合成和实际数据实验补充我们的理论分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号