Minimax rates of convergence for high-dimensional regression under e_q-ball sparsity

机译：e_q球稀疏性下高维回归的最小极大收敛速度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Consider the standard linear regression model y = Xβ~* + w, where y ∈R~n is an observation vector, X ∈ R~(n×d) is a measurement matrix, β~* ∈ R~d is the unknown regression vector, and w ～ N(O,σ~2I) is additive Gaussian noise. This paper determines sharp minimax rates of convergence for estimation of β* in e_2 norm, assuming that β~* belongs to a weak e_q-ball B_q (R_q) for some q ∈ [0,1]. We show that under suitable regularity conditions on the design matrix X, the minimax error in squared e_2-norm scales as R_q((log d))~(1-(q/2)). In addition, we provide lower bounds on rates of convergence for general e_p norm (for all p ∈ [1,+∞],p ≠ q). Our proofs of the lower bounds are information-theoretic in nature, based on Fano's inequality and results on the metric entropy of the balls B_q(R_q). Matching upper bounds are derived by direct analysis of the solution to an optimization algorithm over B_q(R_q). We prove that the conditions on X required by optimal algorithms are satisfied with high probability by broad classes of non-i.i.d. Gaussian random matrices, for which RIP or other sparse eigenvalue conditions are violated. For q = 0, e_1-based methods (Lasso and Dantzig selector) achieve the minimax optimal rates in e_2 error, but require stronger regularity conditions on the design than the non-convex optimization algorithm used to determine the minimax upper bounds.

机译：考虑标准线性回归模型y =Xβ〜* + w，其中y∈R〜n是观测向量，X∈R〜（n×d）是测量矩阵，β〜*∈R〜d是未知回归向量，w〜N（O，σ〜2I）是加性高斯噪声。假设β〜*属于某个q∈[0,1]的弱e_q-ball B_q（R_q），本文确定了在e_2范数中估计β*的最小收敛极速。我们表明，在设计矩阵X的适当规则性条件下，平方e_2-范数的最小极大误差为R_q（（log d）/ n）〜（1-（q / 2））。另外，对于一般的e_p范数（对于所有p∈[1，+∞]，p≠q），我们提供了收敛速度的下界。基于Fano不等式以及球B_q（R_q）的度量熵的结果，我们关于下界的证明本质上是信息论的。通过对B_q（R_q）上的优化算法的解决方案进行直接分析，得出匹配的上限。我们证明了非i.i.d的宽泛类很可能满足最优算法在X上的条件。高斯随机矩阵，违反了RIP或其他稀疏特征值条件。对于q = 0，基于e_1的方法（Lasso和Dantzig选择器）实现e_2误差中的最小最大最优速率，但是在设计上需要比用于确定最小最大上限的非凸优化算法更强的规则性条件。

著录项

来源
《Annual allerton conference on communication control, and computing;Allerton conference on communication control, and computing;Allerton 2009》|2009年|P.251-257|共7页
会议地点
作者
Garvesh Raskutti; Martin J. Wainwright; Bin Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. MINIMAX POSTERIOR CONVERGENCE RATES AND MODEL SELECTION CONSISTENCY IN HIGH-DIMENSIONAL DAG MODELS BASED ON SPARSE CHOLESKY FACTORS [J] . Lee Kyoungjae, Lee Jaeyong, Lin Lizhen The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2019,第6期

机译：基于稀疏尖孔因子的高维DAG模型中的最小石头融合速率和模型选择一致性
2. NEARLY OPTIMAL MINIMAX ESTIMATOR FOR HIGH-DIMENSIONAL SPARSE LINEAR REGRESSION [J] . LI ZHANG The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2013,第4期

机译：高维稀疏线性回归的近似最优MINMAX估计
3. Minimax Sparse Logistic Regression for Very High-Dimensional Feature Selection [J] . Tan M., Tsang I.W., Wang L. Neural Networks and Learning Systems, IEEE Transactions on . 2013,第10期

机译：用于极高维特征选择的Minimax稀疏Logistic回归
4. Minimax rates of convergence for high-dimensional regression under l_q-ball sparsity [C] . Garvesh Raskutti, Martin J. Wainwright, Bin Yu Annual Allerton Conference on Communication, Control, and Computing . 2009

机译：L_Q-BALL稀疏下的高维回归收敛速率最小
5. High-dimensional regression with random design, including sparse superposition codes. [D] . Cho, Sanghee. 2014

机译：具有随机设计的高维回归，包括稀疏叠加代码。
6. MINIMAX BOUNDS FOR SPARSE PCA WITH NOISY HIGH-DIMENSIONAL DATA [O] . Aharon Birnbaum, Iain M. Johnstone, Boaz Nadler, -1

机译：具有嘈杂高维数据的稀疏PCA的MINIMAX界线
7. Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors [O] . Kyoungjae Lee, Jaeyong Lee, Lizhen Lin 2019

机译：基于稀疏尖峰因子的高维DAG模型的最小兴趣后融合率和模型选择一致性

Minimax rates of convergence for high-dimensional regression under e_q-ball sparsity

摘要

著录项

相似文献

相关主题

期刊订阅