首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering
【24h】

Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering

机译:基于Oracle的可扩展弹性网子空间聚类主动集算法。

获取原文

摘要

State-of-the-art subspace clustering methods are based on expressing each data point as a linear combination of other data points while regularizing the matrix of coefficients with ℓ1, ℓ2 or nuclear norms. ℓ1 regularization is guaranteed to give a subspace-preserving affinity (i.e., there are no connections between points from different subspaces) under broad theoretical conditions, but the clusters may not be connected. ℓ2 and nuclear norm regularization often improve connectivity, but give a subspace-preserving affinity only for independent subspaces. Mixed ℓ1, ℓ2 and nuclear norm regularizations offer a balance between the subspace-preserving and connectedness properties, but this comes at the cost of increased computational complexity. This paper studies the geometry of the elastic net regularizer (a mixture of the ℓ1 and ℓ2 norms) and uses it to derive a provably correct and scalable active set method for finding the optimal coefficients. Our geometric analysis also provides a theoretical justification and a geometric interpretation for the balance between the connectedness (due to ℓ2 regularization) and subspace-preserving (due to ℓ1 regularization) properties for elastic net subspace clustering. Our experiments show that the proposed active set method not only achieves state-of-the-art clustering performance, but also efficiently handles large-scale datasets.
机译:最新的子空间聚类方法基于将每个数据点表示为其他数据点的线性组合,同时用ℓ1,,2或核范数对系数矩阵进行正则化。 broad1正则化可确保在宽泛的理论条件下提供保留子空间的亲和力(即,来自不同子空间的点之间没有连接),但可能无法连接聚类。 and2和核规范正则化通常会改善连通性,但仅对独立子空间提供保留子空间的亲和力。 ℓ1,,2和核规范的混合正则化在子空间保留和连通性之间取得了平衡,但这是以增加计算复杂性为代价的。本文研究了弹性净正则器(the1和ℓ2范数的混合体)的几何形状,并使用它来推导可证明正确和可扩展的主动集方法,以找到最佳系数。我们的几何分析还为弹性净子空间聚类的连通性(由于ℓ2正则化)和保留子空间(由于ℓ1正则化)性质之间的平衡提供了理论依据和几何解释。我们的实验表明,提出的主动集方法不仅可以实现最新的聚类性能,而且可以有效地处理大规模数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号