首页> 外文期刊>Computational statistics & data analysis >Variable selection in high-dimensional partially linear additive models for composite quantile regression
【24h】

Variable selection in high-dimensional partially linear additive models for composite quantile regression

机译:高维部分线性相加模型中用于复合分位数回归的变量选择

获取原文
获取原文并翻译 | 示例
           

摘要

A new estimation procedure based on the composite quantile regression is proposed for the semiparametric additive partial linear models, of which the nonparametric components are approximated by polynomial splines. The proposed estimation method can simultaneously estimate both the parametric regression coefficients and nonparametric components without any specification of the error distributions. The proposed estimation method is empirically shown to be much more efficient than the popular least-squares-based estimation method for non-normal random errors, especially for Cauchy error, and almost as efficient for normal random errors. To achieve sparsity in high-dimensional and sparse additive partial linear models, of which the number of linear covariates is much larger than the sample size but that of significant covariates is small relative to the sample size, a variable selection procedure based on adaptive Lasso is proposed to conduct estimation and variable selection simultaneously. The procedure is shown to possess the oracle property, and is much superior to the adaptive Lasso penalized least-squares-based method regardless of the random error distributions. In particular, two kinds of weights in the penalty are considered, namely the composite quantile regression estimates and Lasso penalized composite quantile regression estimates. Both types of weights perform very well with the latter performing especially well in terms of precisely selecting significant variables. The simulation results are consistent with the theoretical properties. A real data example is used to illustrate the application of the proposed methods.
机译:针对半参数加法部分线性模型,提出了一种基于复合分位数回归的新估计程序,该估计模型的非参数分量由多项式样条近似。所提出的估计方法可以同时估计参数回归系数和非参数分量,而无需任何误差分布的说明。实验证明,对于非正常随机误差,特别是对于柯西误差,拟议的估计方法比流行的基于最小二乘法的估计方法有效得多,对于正常随机误差,效率几乎相同。为了在高维稀疏的部分线性模型中获得稀疏性,其中线性协变量的数量远大于样本量,而重要协变量的数量相对于样本量较小,基于自适应Lasso的变量选择程序为建议同时进行估计和变量选择。该过程显示出具有oracle属性,并且无论随机误差分布如何,它都比基于自适应套索惩罚最小二乘法的方法优越得多。特别地,考虑了罚分中的两种权重,即复合分位数回归估计和拉索罚分复合分位数回归估计。两种权重的表现都非常好,后者在精确选择重要变量方面表现尤其出色。仿真结果与理论性质吻合。一个真实的数据示例用于说明所提出方法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号