Large-scale kernel methods for independence testing

Zhang Qinyi; Filippi Sarah; Gretton Arthur; Sejdinovic Dino

首页> 外文期刊>Statistics and computing >Large-scale kernel methods for independence testing

【24h】

Large-scale kernel methods for independence testing

机译：用于独立测试的大规模内核方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Representations of probability measures in reproducing kernel Hilbert spaces provide a flexible framework for fully nonparametric hypothesis tests of independence, which can capture any type of departure from independence, including nonlinear associations and multivariate interactions. However, these approaches come with an at least quadratic computational cost in the number of observations, which can be prohibitive in many applications. Arguably, it is exactly in such large-scale datasets that capturing any type of dependence is of interest, so striking a favourable trade-off between computational efficiency and test performance for kernel independence tests would have a direct impact on their applicability in practice. In this contribution, we provide an extensive study of the use of large-scale kernel approximations in the context of independence testing, contrasting block-based, Nystrom and random Fourier feature approaches. Through a variety of synthetic data experiments, it is demonstrated that our large-scale methods give comparable performance with existing methods while using significantly less computation time and memory.

机译：再现内核希尔伯特空间中的概率测度表示为独立性的完全非参数假设检验提供了灵活的框架，该检验可以捕获独立性的任何类型的偏离，包括非线性关联和多元交互。但是，这些方法在观察数量上至少要有二次计算量，这在许多应用中是无法实现的。可以说，正是在这样的大规模数据集中，捕获任何类型的依存关系才是令人感兴趣的，因此，在内核独立性测试的计算效率和测试性能之间达成有利的折衷将直接影响其在实践中的适用性。在此贡献中，我们提供了在独立性测试，对比基于块的Nystrom和随机傅立叶特征方法的背景下使用大规模核近似的广泛研究。通过各种综合数据实验，证明了我们的大规模方法可以提供与现有方法相当的性能，同时使用更少的计算时间和内存。

著录项

来源
《Statistics and computing》 |2018年第1期|113-130|共18页
作者
Zhang Qinyi; Filippi Sarah; Gretton Arthur; Sejdinovic Dino;
展开▼
作者单位

Univ Oxford, Dept Stat, Oxford, England;

Imperial Coll, Dept Math, London, England|Imperial Coll, Dept Epidemiol & Biostat, London, England;

UCL, Gatsby Computat Neurosci Unit, London, England;

Univ Oxford, Dept Stat, Oxford, England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Independence testing; Large-scale kernel method; Hilbert-Schmidt independence criteria; Random Fourier features; Nystrom method;

机译：独立性测试;大规模核方法;希尔伯特-施密特独立性准则;随机傅里叶特征;奈斯特罗姆方法;

相似文献

外文文献
中文文献
专利

1. Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT) [J] . Urrutia Eugene, Lee Seunggeun, Maity Arnab, Statistics and Its Interface . 2015,第4期

机译：使用多内核序列内核关联测试（MK-SKAT）跨方法和阈值进行的罕见变体测试
2. Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT) [J] . Eugene Urrutia, Seunggeun Lee, Arnab Maity, Statistics and Its Interface . 2015,第4期

机译：使用多内核序列内核关联测试（MK-SKAT）跨方法和阈值进行稀有变异测试
3. KERNEL METHODS FOR INDEPENDENCE MEASUREMENT WITH COEFFICIENT CONSTRAINTS [J] . HENG CHEN, JITAO WU International Journal of Wavelets, Multiresolution and Information Processing . 2014,第1期

机译：具有约束条件的独立性测量的核方法
4. Nonparametric Independence Tests: Space Partitioning and Kernel Approaches [C] . Arthur Gretton, Laszlo Gyoerfi Algorithmic learning theory . 2008

机译：非参数独立测试：空间划分和内核方法
5. Large-scale machine learning using kernel methods. [D] . Wu, Gang. 2006

机译：使用内核方法的大规模机器学习。
6. Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT) [O] . Eugene Urrutia, Seunggeun Lee, Arnab Maity, -1

机译：使用多内核序列内核关联测试（MK-SKAT）跨方法和阈值进行稀有变异测试
7. Large-scale kernel methods for independence testing [O] . Zhang, Q, Filippi, S, Gretton, A, 2017

机译：用于独立测试的大规模内核方法

Large-scale kernel methods for independence testing

摘要

著录项

相似文献

相关主题

期刊订阅