首页> 外文会议>International Conference on Machine Learning >Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging
【24h】

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

机译:速写脊回归:优化透视,统计视角和型号平均

获取原文

摘要

We address the statistical and optimization impacts of using classical sketch versus Hessian sketch to solve approximately the Matrix Ridge Regression (MRR) problem. Prior research has considered the effects of classical sketch on least squares regression (LSR), a strictly simpler problem. We establish that classical sketch has a similar effect upon the optimization properties of MRR as it does on those of LSR-namely, it recovers nearly optimal solutions. In contrast, Hessian sketch does not have this guarantee; instead, the approximation error is governed by a subtle interplay between the "mass" in the responses and the optimal objective value. For both types of approximations, the regularization in the sketched MRR problem gives it significantly different statistical properties from the sketched LSR problem. In particular, there is a bias-variance trade-off in sketched MRR that is not present in sketched LSR. We provide upper and lower bounds on the biases and variances of sketched MRR; these establish that the variance is significantly increased when classical sketches are used, while the bias is significantly increased when using Hessian sketches. Empirically, sketched MRR solutions can have risks that are higher by an order-of-magnitude than those of the optimal MRR solutions. We establish theoretically and empirically that model averaging greatly decreases this gap. Thus, in the distributed setting, sketching combined with model averaging is a powerful technique that quickly obtains near-optimal solutions to the MRR problem while greatly mitigating the statistical risks incurred by sketching.
机译:我们解决了使用古典草图与Hessian草图的统计和优化影响,以解决大致矩阵脊回归(MRR)问题。现有研究考虑了古典草图对最小二乘回归(LSR)的影响,严格更简单的问题。我们建立了古典草图对MRR的优化特性的效果类似,因为它在LSR的那些时,它恢复了几乎最佳的解决方案。相比之下,黑森州素描没有这种保证;相反,近似误差由响应中的“质量”与最佳目标值之间的微妙相互作用。对于这两种类型的近似,草图MRR问题中的正则化从草图的LSR问题中提供了显着不同的统计特性。特别是,在草图的MRR中,在草图中的MRR中有一个偏差差异。我们为草图MRR的偏差和差异提供上下界限;当使用经典草图时,这些方案明显增加,虽然使用黑森州草图时偏差显着增加。经验上,速写的MRR解决方案可能具有比最佳MRR解决方案更高的风险幅度较高。我们理论上和经验地制定了模型平均值大大降低了这种差距。因此,在分布式设置中,与模型平均相结合的速写是一种强大的技术,可以快速获得MRR问题的近最佳解决方案,同时大大减轻了草图所产生的统计风险。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号