【24h】

Estimating LASSO Risk and Noise Level

机译:估算套索风险和噪声水平

获取原文

摘要

We study the fundamental problems of variance and risk estimation in high dimensional statistical modeling. In particular, we consider the problem of learning a coefficient vector θ_0 ∈ R~p from noisy linear observations y = Xθ_0 +w ∈R~n (p > n) and the popular estimation procedure of solving the ?_1 -penalized least squares objective known as the LASSO or Basis Pursuit DeNoising (BPDN). In this context, we develop new estimators for the ?_2 estimation risk ‖θ-circumfles-θ_0‖_2 and the variance of the noise when distributions of θ_0 and w are unknown. These can be used to select the regularization parameter optimally. Our approach combines Stein's unbiased risk estimate [Ste81] and the recent results of [BM12a][BM12b] on the analysis of approximate message passing and the risk of LASSO. We establish high-dimensional consistency of our estimators for sequences of matrices X of increasing dimensions, with independent Gaussian entries. We establish validity for a broader class of Gaussian designs, conditional on a certain conjecture from statistical physics. To the best of our knowledge, this result is the first that provides an asymptotically consistent risk estimator for the LASSO solely based on data. In addition, we demonstrate through simulations that our variance estimation outperforms several existing methods in the literature.
机译:我们研究在高维统计建模方差和风险评估的基本问题。特别是,我们考虑学习系数矢量θ_0∈的R〜从嘈杂线性观测P Y [=Xθ_0+ W∈R〜N(P> n)和解决的流行估计过程?_1 -penalized最小二乘目标问题被称为LASSO或基追踪去噪(BPDN)。在此背景下,我们开发了?_2估计风险‖θ-circumfles-θ_0‖_2和噪声方差估计新时θ_0的和W的分布是未知的。这些可以用来选择最佳的调整参数。我们的方法结合了斯坦因的无偏估计风险[Ste81]和最近的结果[BM12a] [BM12b]在经过近似消息的分析和套索风险。我们建立增加维度的矩阵X的序列,我们估计高维的一致性,具有独立的高斯项。我们为更广泛的类高斯设计,从统计物理学中的某些猜想条件的确立有效性。据我们所知,这样的结果就是提供完全基于数据套索渐近一致的风险估计第一。此外,我们通过模拟证明,我们的方差估计优于文献中现有的几种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号