...
首页> 外文期刊>Journal of Agricultural, Biological, and Environmental Statistics >Hypothesis Tests for Principal Component Analysis When Variables are Standardized
【24h】

Hypothesis Tests for Principal Component Analysis When Variables are Standardized

机译:当变量标准化时主成分分析的假设试验

获取原文
获取原文并翻译 | 示例
           

摘要

In principal component analysis (PCA), the first few principal components possibly reveal interesting systematic patterns in the data, whereas the last may reflect random noise. The researcher may wonder how many principal components are statistically significant. Many methods have been proposed for determining how many principal components to retain in the model, but most of these assume non-standardized data. In agricultural, biological and environmental applications, however, standardization is often required. This article proposes parametric bootstrap methods for hypothesis testing of principal components when variables are standardized. Unlike previously proposed methods, the proposed parametric bootstrap methods do not rely on any asymptotic results requiring large dimensions. In a simulation study, the proposed parametric bootstrap methods for standardized data were compared with parallel analysis for PCA and methods using the Tracy-Widom distribution. Parallel analysis performed well when testing the first principal component, but was much too conservative when testing higher-order principal components not reflecting random noise. When variables are standardized, the Tracy-Widom distribution may not approximate the distribution of the largest eigenvalue. The proposed parametric bootstrap methods maintained the level of significance approximately and were up to twice as powerful as the methods using the Tracy-Widom distribution. SAS and R computer code is provided for the recommended methods.
机译:在主成分分析(PCA)中,最前几个主要成分可能会发现数据中有趣的系统模式,而最后可能反映随机噪声。研究人员可能想知道有多少主成分是有统计学意义的。已经提出了许多方法来确定要在模型中保留的主要成分数量,但大多数这些假设非标准化数据。然而,在农业,生物和环境应用中,通常需要标准化。本文提出了在变量标准化时主组件的假设检测的参数引导方法。与先前提出的方法不同,所提出的参数释放方法不依赖于需要大维度的任何渐近结果。在仿真研究中,将所提出的标准化数据的参数自引导方法与PCA和方法的平行分析进行了比较,使用TRACY-WIDOM分布。在测试第一主成分时,并行分析良好,但在测试不反映随机噪声的高阶主成分时,太保守了太保守。当变量标准化时,特雷西 - 拓展分布可能不会近似于最大的特征值的分布。所提出的参数释放方法大致保持了重要性的级别,并且达到了使用Tracy-Widom分布的方法的两倍。为推荐的方法提供了SAS和R计算机代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号