【24h】

Orthogonal NMF through Subspace Exploration

机译:通过子空间探索的正交NMF

获取原文

摘要

Orthogonal Nonnegative Matrix Factorization (ONMF) aims to approximate a nonnegative matrix as the product of two k-dimensional nonnegative factors, one of which has orthonormal columns. It yields potentially useful data representations as superposition of disjoint parts, while it has been shown to work well for clustering tasks where traditional methods underperform. Existing algorithms rely mostly on heuristics, which despite their good empirical performance, lack provable performance guarantees. We present a new ONMF algorithm with provable approximation guarantees. For any constant dimension k, we obtain an additive EPTAS without any assumptions on the input. Our algorithm relies on a novel approximation to the related Non-negative Principal Component Analysis (NNPCA) problem; given an arbitrary data matrix, NNPCA seeks k nonnegative components that jointly capture most of the variance. Our NNPCA algorithm is of independent interest and generalizes previous work that could only obtain guarantees for a single component. We evaluate our algorithms on several real and synthetic datasets and show that their performance matches or outperforms the state of the art.
机译:正交非负矩阵分解(ONMF)旨在将非负矩阵近似为两个k维非负因子的乘积,其中两个具有正交列。它产生了可能有用的数据表示形式,即不相交部分的叠加,而事实证明,它可以很好地用于传统方法表现不佳的聚类任务。现有算法主要依靠启发式算法,尽管它们具有良好的经验性能,但缺乏可证明的性能保证。我们提出了具有可证明的近似保证的新ONMF算法。对于任何恒定的尺寸k,我们无需输入就可以得到加法的EPTAS。我们的算法基于对相关的非负主成分分析(NNPCA)问题的新颖近似;在给定任意数据矩阵的情况下,NNPCA会寻找k个非负分量,以共同捕获大部分方差。我们的NNPCA算法具有独立的利益,它概括了以前只能为单个组件获得保证的工作。我们在几个真实的和综合的数据集上评估了我们的算法,并证明了它们的性能与现有技术相匹配或优于现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号