首页> 外文期刊>Markov Processes and Related Fields >Analytical Formula for Large Eigenvalues of Sample Covariance Matrix in Infinite Dimensional Case
【24h】

Analytical Formula for Large Eigenvalues of Sample Covariance Matrix in Infinite Dimensional Case

机译:Analytical Formula for Large Eigenvalues of Sample Covariance Matrix in Infinite Dimensional Case

获取原文
获取原文并翻译 | 示例
       

摘要

In Modern Data Science the estimated covariance matrix or Sample Covariance, plays an important role. Indeed many of the main Machine Learning algorithms, take it as input. In that context, it is often referred to as Statistical Kernel. With high dimensional data, there is often a large discrepancy between the covariance matrix and its estimate. This error often causes machine learning algorithms to fail, in what is referred to as "the curse of dimensionality" . We present a simple analytical formula for the distortion between the spectrum of the covariance matrix and estimated covariance matrix, which we prove for eigenvalues with size above a certain order. The traditional approach is based on Free Probability Theory and often performs poorly with real-life high-dimensional data. Also, in real-life data the spectrum of the covariance matrix usually contains eigenvalues with different orders of magnitudes at the same time: Free Probability Theory does a priori not apply in that situation. This is the first of a series of ongoing articles, each for a different cases in which our simple analytic formula holds.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号