首页> 美国卫生研究院文献>other >Estimation of high dimensional mean regression in the absence of symmetry andlight tail assumptions
【2h】

Estimation of high dimensional mean regression in the absence of symmetry andlight tail assumptions

机译:在不存在对称性的情况下估计高维均值轻尾假设

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Data subject to heavy-tailed errors are commonly encountered in various scientific fields. To address this problem, procedures based on quantile regression and Least Absolute Deviation (LAD) regression have been developed in recent years. These methods essentially estimate the conditional median (or quantile) function. They can be very different from the conditional mean functions, especially when distributions are asymmetric and heteroscedastic. How can we efficiently estimate the mean regression functions in ultra-high dimensional setting with existence of only the second moment? To solve this problem, we propose a penalized Huber loss with diverging parameter to reduce biases created by the traditional Huber loss. Such a penalized robust approximate quadratic (RA-quadratic) loss will be called RA-Lasso. In the ultra-high dimensional setting, where the dimensionality can grow exponentially with the sample size, our results reveal that the RA-lasso estimator produces a consistent estimator at the same rate as the optimal rate under the light-tail situation. We further study the computational convergence of RA-Lasso and show that the composite gradient descent algorithm indeed produces a solution that admits the same optimal rate after sufficient iterations. As a byproduct, we also establish the concentration inequality for estimating population mean when there exists only the second moment. We compare RA-Lasso with other regularizedrobust estimators based on quantile regression and LAD regression. Extensive simulationstudies demonstrate the satisfactory finite-sample performance of RA-Lasso.
机译:遭受重尾错误的数据通常在各个科学领域中遇到。为了解决这个问题,近年来已经开发了基于分位数回归和最小绝对偏差(LAD)回归的程序。这些方法从本质上估计条件中值(或分位数)函数。它们可能与条件均值函数有很大不同,尤其是当分布是不对称和异方差时。仅存在第二矩,如何才能在超高维环境中有效地估计均值回归函数?为了解决这个问题,我们提出了一种带有发散参数的惩罚性Huber损失,以减少传统Huber损失产生的偏差。这样的惩罚性鲁棒近似二次(RA-二次)损失将被称为RA-套索。在超高维环境中,维数可以随样本大小呈指数增长,我们的结果表明,RA-lasso估计器在轻尾情况下以与最佳比率相同的比率生成一致的估计器。我们进一步研究了RA-Lasso的计算收敛性,并证明了复合梯度下降算法的确产生了一种解决方案,该解决方案在经过足够的迭代后即可获得相同的最优速率。作为副产品,当仅存在第二个矩时,我们还建立了浓度不等式以估计总体均值。我们将RA-Lasso与其他正规化进行比较基于分位数回归和LAD回归的稳健估计量。广泛的模拟研究表明,RA-Lasso具有令人满意的有限样本性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号