【24h】

Optimal rank-based tests for Common Principal Components

机译:通用主成分的最佳基于等级的测试

获取原文
获取原文并翻译 | 示例
           

摘要

This paper provides optimal testing procedures for the m-sample null hypothesis of Common Principal Components (CPC) under possibly non-Gaussian and heterogeneous elliptical densities. We first establish, under very mild assumptions that do not require finite moments of order four, the local asymptotic normality (LAN) of the model. Based on that result, we show that the pseudo-Gaussian test proposed in Hallin et al. (J. Nonparametr. Stat. 22 (2010) 879-895) is locally and asymptotically optimal under Gaussian densities, and show how to compute its local powers. A numerical evaluation of those powers, however, reveals that, while remaining valid, this test is poorly efficient away from the Gaussian. Moreover, it still requires finite moments of order four. We therefore propose rank-based procedures that remain valid under any possibly heterogeneous m-tuple of elliptical densities, irrespective of the existence of any moments. In elliptical families, indeed, principal components naturally can be based on the scatter matrices characterizing the density contours, hence do not require finite variances. Those rank-based tests, as usual, involve score functions, which may or may not be associated with a reference density at which they achieve optimality. A major advantage of our rank tests is that they are not only validity-robust, in the sense of surviving arbitrary elliptical population densities: unlike their pseudo-Gaussian counterparts, they also are efficiency-robust, in the sense that their local powers do not deteriorate away from the reference density at which they are optimal. We show, in particular, that in the homokurtic case, their normal-score version uniformly dominates, in the Pitman sense, the aforementioned pseudo-Gaussian generalization of Flury's test. Theoretical results are obtained via a nonstandard application of Le Cam's methodology in the context of curved LAN experiments. The finite-sample properties of the proposed tests are investigated via simulations.
机译:本文为可能存在非高斯和非均质椭圆密度的公共主成分(CPC)的m样本零假设提供了最佳测试程序。我们首先在不需要四阶有限矩的非常温和的假设下,建立模型的局部渐近正态性(LAN)。基于该结果,我们证明了Hallin等人提出的伪高斯检验。 (J. Nonparametr。Stat。22(2010)879-895)在高斯密度下是局部和渐近最优的,并展示了如何计算其局部幂。然而,对这些能力的数值评估表明,尽管保持有效,但该测试在远离高斯的情况下效率很低。而且,它仍然需要四阶的有限矩。因此,我们提出了一种基于等级的规程,该规程在任何可能的椭圆密度m元组下都有效,而与任何时刻的存在无关。实际上,在椭圆族中,主成分自然可以基于表征密度轮廓的散布矩阵,因此不需要有限的方差。通常,那些基于等级的测试涉及得分函数,这些得分函数可能会或可能不会与达到最佳状态的参考密度相关联。我们的等级测试的主要优势在于,它们在生存于任意椭圆人口密度的意义上不仅具有有效性,而且具有鲁棒性:与伪高斯模型不同的是,它们在效率上也具有鲁棒性,因为它们的本地力量不会偏离最佳值的参考密度会变差。我们特别表明,在同调的情况下,在Pitman的意义上,它们的正态分数版本统一主导了Flury检验的上述伪高斯泛化。理论结果是通过在弯曲的局域网实验中通过非标准应用Le Cam方法获得的。拟议的测试的有限样本属性通过仿真进行了研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号