首页> 外文期刊>Bernoulli: official journal of the Bernoulli Society for Mathematical Statistics and Probability >Subspace perspective on canonical correlation analysis: Dimension reduction and minimax rates
【24h】

Subspace perspective on canonical correlation analysis: Dimension reduction and minimax rates

机译:典型相关分析的子空间透视:尺寸减少和最小值率

获取原文
获取原文并翻译 | 示例
           

摘要

Canonical correlation analysis (CCA) is a fundamental statistical tool for exploring the correlation structure between two sets of random variables. In this paper, motivated by the recent success of applying CCA to learn low dimensional representations of high dimensional objects, we propose two losses based on the principal angles between the model spaces spanned by the sample canonical variates and their population correspondents, respectively. We further characterize the non-asymptotic error bounds for the estimation risks under the proposed error metrics, which reveal how the performance of sample CCA depends adaptively on key quantities including the dimensions, the sample size, the condition number of the covariance matrices and particularly the population canonical correlation coefficients. The optimality of our uniform upper bounds is also justified by lower-bound analysis based on stringent and localized parameter spaces. To the best of our knowledge, for the first time our paper separates p(1) and p(2) for the first order term in the upper bounds without assuming the residual correlations are zeros. More significantly, our paper derives (1 - lambda(2)(k))( 1 - lambda(2)(k+1))/(lambda(k) - lambda(k+1))(2) for the first time in the non-asymptotic CCA estimation convergence k+1 rates, which is essential to understand the behavior of CCA when the leading canonical correlation coefficients are close to 1.
机译:规范相关性分析(CCA)是一种用于探索两组随机变量之间的相关结构的基本统计工具。在本文中,通过初步施加CCA来学习高尺寸物体的低尺寸表示的成功的动机,我们提出了基于示例规范变异的模型空间与其人口对应者之间的模型空间之间的主要角度。我们进一步表征了在所提出的误差指标下的估计风险的非渐近误差界限,这揭示了样本CCA的性能如何自适应地取决于包括尺寸,样本大小,协方差矩阵的条件数量的关键量,特别是人口规范相关系数。通过基于严格和局部参数空间的较低分析,我们均匀上限的最优性也是合理的。据我们所知,首次我们的纸张将P(1)和P(2)分离在上限的第一阶项而不假设残余相关是零。更重要的是,我们的纸张衍生(1 - λ(2)(k))(1 - λ(2)(k + 1))/(Lambda(k) - lambda(k + 1))(2)是第一个非渐近CCA估计会聚K + 1率的时间,对于理解CCA的行为是必要的,当领先的规范相关系数接近1时是必不可少的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号