首页> 外文会议>Annual conference on Neural Information Processing Systems >Fast Randomized Kernel Ridge Regression with Statistical Guarantees
【24h】

Fast Randomized Kernel Ridge Regression with Statistical Guarantees

机译:具有统计保证的快速随机核岭回归

获取原文

摘要

One approach to improving the running time of kernel-based methods is to build a small sketch of the kernel matrix and use it in lieu of the full matrix in the machine learning task of interest. Here, we describe a version of this approach that comes with running time guarantees as well as improved guarantees on its statistical performance. By extending the notion of statistical leverage scores to the setting of kernel ridge regression, we are able to identify a sampling distribution that reduces the size of the sketch (i.e., the required number of columns to be sampled) to the effective dimensionality of the problem. This latter quantity is often much smaller than previous bounds that depend on the maximal degrees of freedom. We give an empirical evidence supporting this fact. Our second contribution is to present a fast algorithm to quickly compute coarse approximations to these scores in time linear in the number of samples. More precisely, the running time of the algorithm is O(np~2) with p only depending on the trace of the kernel matrix and the regularization parameter. This is obtained via a variant of squared length sampling that we adapt to the kernel setting. Lastly, we discuss how this new notion of the leverage of a data point captures a fine notion of the difficulty of the learning problem.
机译:改善基于内核的方法的运行时间的一种方法是构建内核矩阵的小图,并用它代替感兴趣的机器学习任务中的完整矩阵。在这里,我们描述了这种方法的一种版本,该版本附带运行时间保证以及对其统计性能的改进保证。通过将统计杠杆分数的概念扩展到内核岭回归的设置,我们能够确定一种抽样分布,从而将草图的大小(即,需要抽样的列数)减小到问题的有效维度。 。后一个数量通常比依赖于最大自由度的先前范围小得多。我们给出了支持这一事实的经验证据。我们的第二个贡献是提出了一种快速算法,可以在样本数量的时间线性范围内快速计算出这些分数的粗略近似值。更准确地说,该算法的运行时间为O(np〜2),其中p仅取决于核矩阵的轨迹和正则化参数。这是通过我们适应内核设置的平方长度采样的一种变体获得的。最后,我们讨论这个新的关于数据点杠杆作用的概念如何捕获学习问题难度的一个很好的概念。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号