Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression

机译：预测变量已损坏和缺失：高维线性回归的Minimax界

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Missing and corrupted data are ubiquitous in many science and engineering domains. We analyze the information-theoretic limits of recovering sparse vectors under various models of corrupted and missing data. In particular, consider a high-dimensional linear regression model y = X β^∗ + є, where y Є Rⁿ is the response vector, X Є R^nXp is a random design matrix with p ≫ n and rows distributed i.i.d. as Ν(0, Σx), β^∗ є R^p is the unknown regression vector, and є ∼ Ν(0,σ²єΙ) is independent additive noise. Whereas a traditional approach assumes that the covariates X are fully observed, we assume only that a corrupted version Z is observed. Our main contribution is to establish minimax rates of convergence for estimating β^∗ in squared ℓ2-loss, assuming β^∗ is k-sparse. Our upper and lower bounds in both additive noise and missing data cases scale as k log(p/k), with prefactors depending only on the corruption and/or missing pattern of the data.

机译：丢失和损坏的数据在许多科学和工程领域无处不在。我们分析了在损坏和丢失数据的各种模型下恢复稀疏向量的信息理论极限。特别是，考虑一个高维线性回归模型y = Xβ^{∗ +є，其中yЄR ^{n 是响应向量XЄR ^{nXp 是一个随机设计矩阵，其中p≫ n和行分布为iid由于Ν（0，Σx），β^{∗ єR ^{p 是未知回归向量，є〜Ν（0，σ^{2 єΙ ）是独立的加性噪声。传统方法假定已完全观察到协变量X，而我们仅假定观察到了损坏的版本Z。假设β^{∗ 是k稀疏的，我们的主要贡献是建立最小最大收敛速率来估计平方ℓ2-损失中的β^{∗ 。我们在加性噪声和丢失数据情况下的上限和下限按k log（p / k）/ n缩放，其前置因子仅取决于数据的损坏和/或丢失模式。}}}}}}}}

著录项

来源
《2012 IEEE International Symposium on Information Theory Proceedings》|2012年|p.2601- 2605|共5页
会议地点 Cambridge MA(US)
作者
Loh Po-Ling; Wainwright Martin J.;
展开▼
作者单位

Department of Statistics, University of California, Berkeley, Berkeley, CA 94720;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词
入库时间 2022-08-26 13:54:36

相似文献

外文文献
中文文献
专利

1. NEARLY OPTIMAL MINIMAX ESTIMATOR FOR HIGH-DIMENSIONAL SPARSE LINEAR REGRESSION [J] . LI ZHANG The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2013,第4期

机译：高维稀疏线性回归的近似最优MINMAX估计
2. Minimax Rates of Estimation for High-Dimensional Linear Regression Over $ell_q$ -Balls [J] . Raskutti G., Wainwright M. J., Yu B. Information Theory, IEEE Transactions on . 2011,第10期

机译：$ ell_q $ -Balls上的高维线性回归的最小估计速率
3. Missing Values in Linear Regression: Imputations Using An Error-Contaminated Linear Predictor [J] . Guria Sibnarayan, Sen Roy Sugata Communications in Statistics . 2015,第7a9期

机译：线性回归中缺少值：使用错误污染的线性预测器的避免
4. Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression [C] . Loh Po-Ling, Wainwright Martin J. IEEE International Symposium on Information Theory . 2012

机译：损坏且缺少的预测器：高维线性回归的Minimax界限
5. A COMPARISON OF SIX MODELS FOR PREDICTING CORPORATE BANKRUPTCY: MULTIPLE LINEAR REGRESSION ANALYSIS, MULTIPLE LINEAR DISCRIMINANT ANALYSIS, STEPWISE REGRESSION ANALYSIS, STEPWISE DISCRIMINANT ANALYSIS, MULTIPLE LINEAR REGRESSION ANALYSIS WITH RIDGE REGRESSION, AND MULTIPLE LINEAR DISCRIMINANT ANALYSIS WITH BIASED MINIMUM CHI-SQUARE RULE [D] . MAPP, JOHNNIE ALBERT. 1981

机译：六种预测公司破产的模型的比较：多个线性回归分析，多个线性判别分析，逐步回归分析，逐步判别分析，多个带岭点回归的线性回归分析，以及多个线性离散
6. MINIMAX BOUNDS FOR SPARSE PCA WITH NOISY HIGH-DIMENSIONAL DATA [O] . Aharon Birnbaum, Iain M. Johnstone, Boaz Nadler, -1

机译：具有嘈杂高维数据的稀疏PCA的MINIMAX界线
7. Confidence Intervals for High-Dimensional Linear Regression: Minimax Rates and Adaptivity [O] . Cai, Tony, Guo, Zijian 2017

机译：高维线性回归的置信区间：Minimax速率和适应性
8. Minimax Linear Predictor under Lipschitz' Type Conditions for the Regression Function [R] . Takeuchi, K. 1968

机译：回归函数Lipschitz型条件下的minimax线性预测器

Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression

摘要

著录项

相似文献

相关主题

期刊订阅