首页> 外文会议>Annual Allerton Conference on Communication, Control, and Computing >Necessary and sufficient conditions for sketched subspace clustering
【24h】

Necessary and sufficient conditions for sketched subspace clustering

机译:草绘的子空间聚类的充要条件

获取原文

摘要

This paper is about an interesting phenomenon: two r-dimensional subspaces, even if they are orthogonal to one an other, can appear identical if they are only observed on a subset of coordinates. Understanding this phenomenon is of particular importance for many modern applications of subspace clustering where one would like to subsample in order to improve computational efficiency. Examples include real-time video surveillance and datasets so large that cannot even be stored in memory. In this paper we introduce a new metric between subspaces, which we call partial coordinate discrepancy. This metric captures a notion of similarity between subsampled subspaces that is not captured by other distance measures between subspaces. With this, we are able to show that subspace clustering is theoretically possible in lieu of coherence assumptions using only r + 1 rows of the dataset at hand. This gives precise information-theoretic necessary and sufficient conditions for sketched subspace clustering. This can greatly improve computational efficiency without compromising performance. We complement our theoretical analysis with synthetic and real data experiments.
机译:本文是关于一个有趣的现象的:两个r维子空间,即使它们彼此正交,即使仅在一个坐标子集上观察也可以看起来相同。了解这种现象对于子空间聚类的许多现代应用特别重要,在这些应用中,人们希望对子空间进行子采样以提高计算效率。示例包括实时视频监视和如此庞大的数据集,甚至无法将其存储在内存中。在本文中,我们介绍了子空间之间的新度量标准,我们称其为部分坐标差异。该度量捕获子采样的子空间之间的相似性的概念,而子空间之间的其他距离度量未捕获该相似性的概念。这样,我们就能证明,仅使用手头的数据集的r + 1行,就可以在理论上实现子空间聚类来代替相干性假设。这为草绘的子空间聚类提供了精确的信息理论必要条件和充分条件。这可以极大地提高计算效率,而不会影响性能。我们通过合成和真实数据实验来补充理论分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号