首页> 外文期刊>Statistics and computing >Large-scale kernel methods for independence testing
【24h】

Large-scale kernel methods for independence testing

机译:用于独立测试的大规模内核方法

获取原文
获取原文并翻译 | 示例
           

摘要

Representations of probability measures in reproducing kernel Hilbert spaces provide a flexible framework for fully nonparametric hypothesis tests of independence, which can capture any type of departure from independence, including nonlinear associations and multivariate interactions. However, these approaches come with an at least quadratic computational cost in the number of observations, which can be prohibitive in many applications. Arguably, it is exactly in such large-scale datasets that capturing any type of dependence is of interest, so striking a favourable trade-off between computational efficiency and test performance for kernel independence tests would have a direct impact on their applicability in practice. In this contribution, we provide an extensive study of the use of large-scale kernel approximations in the context of independence testing, contrasting block-based, Nystrom and random Fourier feature approaches. Through a variety of synthetic data experiments, it is demonstrated that our large-scale methods give comparable performance with existing methods while using significantly less computation time and memory.
机译:再现内核希尔伯特空间中的概率测度表示为独立性的完全非参数假设检验提供了灵活的框架,该检验可以捕获独立性的任何类型的偏离,包括非线性关联和多元交互。但是,这些方法在观察数量上至少要有二次计算量,这在许多应用中是无法实现的。可以说,正是在这样的大规模数据集中,捕获任何类型的依存关系才是令人感兴趣的,因此,在内核独立性测试的计算效率和测试性能之间达成有利的折衷将直接影响其在实践中的适用性。在此贡献中,我们提供了在独立性测试,对比基于块的Nystrom和随机傅立叶特征方法的背景下使用大规模核近似的广泛研究。通过各种综合数据实验,证明了我们的大规模方法可以提供与现有方法相当的性能,同时使用更少的计算时间和内存。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号