Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

机译：速写脊回归：优化透视，统计视角和型号平均

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the statistical and optimization impacts of using classical sketch versus Hessian sketch to solve approximately the Matrix Ridge Regression (MRR) problem. Prior research has considered the effects of classical sketch on least squares regression (LSR), a strictly simpler problem. We establish that classical sketch has a similar effect upon the optimization properties of MRR as it does on those of LSR-namely, it recovers nearly optimal solutions. In contrast, Hessian sketch does not have this guarantee; instead, the approximation error is governed by a subtle interplay between the "mass" in the responses and the optimal objective value. For both types of approximations, the regularization in the sketched MRR problem gives it significantly different statistical properties from the sketched LSR problem. In particular, there is a bias-variance trade-off in sketched MRR that is not present in sketched LSR. We provide upper and lower bounds on the biases and variances of sketched MRR; these establish that the variance is significantly increased when classical sketches are used, while the bias is significantly increased when using Hessian sketches. Empirically, sketched MRR solutions can have risks that are higher by an order-of-magnitude than those of the optimal MRR solutions. We establish theoretically and empirically that model averaging greatly decreases this gap. Thus, in the distributed setting, sketching combined with model averaging is a powerful technique that quickly obtains near-optimal solutions to the MRR problem while greatly mitigating the statistical risks incurred by sketching.

机译：我们解决了使用古典草图与Hessian草图的统计和优化影响，以解决大致矩阵脊回归（MRR）问题。现有研究考虑了古典草图对最小二乘回归（LSR）的影响，严格更简单的问题。我们建立了古典草图对MRR的优化特性的效果类似，因为它在LSR的那些时，它恢复了几乎最佳的解决方案。相比之下，黑森州素描没有这种保证;相反，近似误差由响应中的“质量”与最佳目标值之间的微妙相互作用。对于这两种类型的近似，草图MRR问题中的正则化从草图的LSR问题中提供了显着不同的统计特性。特别是，在草图的MRR中，在草图中的MRR中有一个偏差差异。我们为草图MRR的偏差和差异提供上下界限;当使用经典草图时，这些方案明显增加，虽然使用黑森州草图时偏差显着增加。经验上，速写的MRR解决方案可能具有比最佳MRR解决方案更高的风险幅度较高。我们理论上和经验地制定了模型平均值大大降低了这种差距。因此，在分布式设置中，与模型平均相结合的速写是一种强大的技术，可以快速获得MRR问题的近最佳解决方案，同时大大减轻了草图所产生的统计风险。

著录项

来源
《International Conference on Machine Learning》|2018年|4774-5589p|共9页
会议地点
作者
Shusen Wang; Alex Gittens; Michael W. Mahoney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging [J] . Shusen Wang, Alex Gittens, Michael W. Mahoney JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：草绘的岭回归：优化角度，统计角度和模型平均
2. On the statistical and economic performance of stock return predictive regression models: An international perspective [J] . Giot P., Petitjean M. Quantitative finance . 2011,第2期

机译：股票收益预测回归模型的统计和经济绩效：国际视角
3. On the statistical and economic performance of stock return predictive regression models: an international perspective [J] . PierreGiotaMikaelPetitjeanb* Quantitative Finance . 2011,第2期

机译：股票收益预测回归模型的统计和经济绩效：国际视角
4. Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging [C] . Shusen Wang, Alex Gittens, Michael W. Mahoney International Conference on Machine Learning . 2018

机译：速写脊回归：优化透视，统计视角和型号平均
5. Modeling differential item functioning (DIF) using multilevel logistic regression models: A Bayesian perspective. [D] . Chaimongkol, Saengla. 2005

机译：使用多级逻辑回归模型对差异项功能（DIF）进行建模：贝叶斯观点。
6. The Stream Algorithm: Computationally Efficient Ridge-Regression via Bayesian Model Averaging and Applications to Pharmacogenomic Prediction of Cancer Cell Line Sensitivity [O] . Elias Chaibub Neto, In Sock Jang, Stephen H. Friend, -1

机译：流算法：通过贝叶斯模型平均计算有效的岭回归及其在癌细胞系敏感性药物基因组学预测中的应用
7. Table 3: Discriminative power of the ReproQ based on statistical power (significance perspective) and the ability to detect 1.0 and 0.5 MID difference (relevance perspective) for all ReproQ outcomes during pregnancy (npu = 42), childbirth and postnatal period (npu = 55). (A) The mean best-practices is the pooled average of the 10 best performing units. [O] . -1

机译：表3：基于统计权力（重要性观点）的Reproq的辨别力和检测妊娠期间所有Reproq结果的1.0和0.5个中差（相关性观点）的能力（NPU = 42），分娩和产后期（NPU = 55 ）。（a）平均最佳实践是10％最佳表现单位的汇总平均值。

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

摘要

著录项

相似文献

相关主题

期刊订阅