首页> 外文期刊>Statistical Methods and Applications >A group VISA algorithm for variable selection
【24h】

A group VISA algorithm for variable selection

机译:一组用于选择变量的VISA算法

获取原文
获取原文并翻译 | 示例
       

摘要

We consider the problem of selecting grouped variables in a linear regression model based on penalized least squares. The group-Lasso and the group-Lars procedures are designed for automatically performing both the shrinkage and the selection of important groups of variables. However, since the same tuning parameter is used (as in Lasso or Lars ) for both group variable selection and shrinkage coefficients, it can lead to over shrinkage the significant groups of variables or inclusion of many irrelevant groups of predictors. This situation occurs when the true number of non-zero groups of coefficients is small relative to the number of variables. We introduce a novel sparse regression method, called the Group-VISA (GVISA), which extends the VISA effect to grouped variables. It combines the idea of VISA algorithm which avoids the over shrinkage problem of regression coefficients and the idea of the GLars-type estimator which shrinks and selects the members of the group together. Hence, GVISA is able to select a sparse group model by avoiding the over shrinkage of GLars-type estimator. We distinguish two variants of the GVISA algorithm, each one is associated with each version of GLars (I and II). Moreover, we provide a path algorithm, similar to GLars, for efficiently computing the entire sample path of GVISA coefficients. We establish a theoretical property on sparsity inequality of GVISA estimator that is a non-asymptotic bound on the estimation error. A detailed simulation study in small and high dimensional settings is performed, which illustrates the advantages of the new approach in relation to several other possible methods. Finally, we apply GVISA on two real data sets.
机译:我们考虑在基于惩罚最小二乘的线性回归模型中选择分组变量的问题。 group-Lasso和group-Lars过程旨在自动执行收缩和重要变量组的选择。但是,由于对组变量选择和收缩系数使用了相同的调整参数(如在Lasso或Lars中),因此它可能导致大量变量过度收缩或包含许多无关的预测变量组。当系数的非零组的真实数目相对于变量数目较小时,会发生这种情况。我们介绍了一种新的稀疏回归方法,称为Group-VISA(GVISA),该方法将VISA效果扩展到分组变量。它结合了避免回归系数过度收缩问题的VISA算法的思想和GLars型估计器的思想,后者收缩并选择了组中的成员。因此,GVISA可以避免GLars型估计器的过度收缩,从而选择稀疏组模型。我们区分GVISA算法的两个变体,每个变体与GLars的每个版本(I和II)相关联。此外,我们提供了一种类似于GLars的路径算法,可以有效地计算GVISA系数的整个样本路径。我们建立了GVISA估计的稀疏不等式的理论性质,该性质是估计误差的非渐近界。在小尺寸和高尺寸环境下进行了详细的仿真研究,这说明了新方法相对于其他几种可能方法的优势。最后,我们将GVISA应用于两个真实数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号