A group VISA algorithm for variable selection

Mkhadri Abdallah; Ouhourane Mohamed

首页> 外文期刊>Statistical Methods and Applications >A group VISA algorithm for variable selection

【24h】

A group VISA algorithm for variable selection

机译：一组用于选择变量的VISA算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of selecting grouped variables in a linear regression model based on penalized least squares. The group-Lasso and the group-Lars procedures are designed for automatically performing both the shrinkage and the selection of important groups of variables. However, since the same tuning parameter is used (as in Lasso or Lars ) for both group variable selection and shrinkage coefficients, it can lead to over shrinkage the significant groups of variables or inclusion of many irrelevant groups of predictors. This situation occurs when the true number of non-zero groups of coefficients is small relative to the number of variables. We introduce a novel sparse regression method, called the Group-VISA (GVISA), which extends the VISA effect to grouped variables. It combines the idea of VISA algorithm which avoids the over shrinkage problem of regression coefficients and the idea of the GLars-type estimator which shrinks and selects the members of the group together. Hence, GVISA is able to select a sparse group model by avoiding the over shrinkage of GLars-type estimator. We distinguish two variants of the GVISA algorithm, each one is associated with each version of GLars (I and II). Moreover, we provide a path algorithm, similar to GLars, for efficiently computing the entire sample path of GVISA coefficients. We establish a theoretical property on sparsity inequality of GVISA estimator that is a non-asymptotic bound on the estimation error. A detailed simulation study in small and high dimensional settings is performed, which illustrates the advantages of the new approach in relation to several other possible methods. Finally, we apply GVISA on two real data sets.

机译：我们考虑在基于惩罚最小二乘的线性回归模型中选择分组变量的问题。 group-Lasso和group-Lars过程旨在自动执行收缩和重要变量组的选择。但是，由于对组变量选择和收缩系数使用了相同的调整参数（如在Lasso或Lars中），因此它可能导致大量变量过度收缩或包含许多无关的预测变量组。当系数的非零组的真实数目相对于变量数目较小时，会发生这种情况。我们介绍了一种新的稀疏回归方法，称为Group-VISA（GVISA），该方法将VISA效果扩展到分组变量。它结合了避免回归系数过度收缩问题的VISA算法的思想和GLars型估计器的思想，后者收缩并选择了组中的成员。因此，GVISA可以避免GLars型估计器的过度收缩，从而选择稀疏组模型。我们区分GVISA算法的两个变体，每个变体与GLars的每个版本（I和II）相关联。此外，我们提供了一种类似于GLars的路径算法，可以有效地计算GVISA系数的整个样本路径。我们建立了GVISA估计的稀疏不等式的理论性质，该性质是估计误差的非渐近界。在小尺寸和高尺寸环境下进行了详细的仿真研究，这说明了新方法相对于其他几种可能方法的优势。最后，我们将GVISA应用于两个真实数据集。

著录项

来源
《Statistical Methods and Applications》 |2015年第1期|41-60|共20页
作者
Mkhadri Abdallah; Ouhourane Mohamed;
展开▼
作者单位

Cadi Ayyad Univ, Fac Sci Semlalia, Marrakech, Morocco;

Cadi Ayyad Univ, Fac Sci Semlalia, Marrakech, Morocco;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Group variable selection; VISA; Grouped Lars; Linear regression;

机译：分组变量选择;VISA;分组Lars;线性回归;
入库时间 2022-08-18 02:28:04

相似文献

外文文献
中文文献
专利

1. Transdimensional Sampling Algorithms for Bayesian Variable Selection in Classification Problems With Many More Variables Than Observations [J] . Lamnisos D, Griffin JE, Steel MFJ Journal of computational and graphical statistics: A joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America . 2009,第3期

机译：分类问题比观察值更多的贝叶斯变量选择的多维采样算法
2. Uninformative variable elimination for improvement of successive projections algorithm on spectral multivariable selection with different calibration algorithms for the rapid and non-destructive determination of protein content in dried laver [J] . Di Wu, Xiaojing Chen, Xiangou Zhu Analytical methods . 2011,第8期

机译：无信息变量消除，用于连续投影算法对光谱多变量选择的改进，具有不同的校正算法，可以快速，无损地确定干紫菜中的蛋白质含量
3. Uninformative variable elimination for improvement of successive projections algorithm on spectral multivariable selection with different calibration algorithms for the rapid and non-destructive determination of protein content in dried laver [J] . Xiaojing Chen, Xiangou Zhu, Xiaochun Guan, Analytical methods . 2011,第8期

机译：无信息变量消除，用于连续投影算法对光谱多变量选择的改进，具有不同的校正算法，可以快速，无损地确定干紫菜中的蛋白质含量
4. SAR PROCESSING ALGORITHMS FOR ENVISAT ASAR IMAGE MODE AND ALTERNATING POLARIZATION MODE IN THE KSPT ENVISAT ASAR PROCESSOR [C] . Gunnar L. Rasmussen, Ole Morten Olsen, Marte Indregard Proceedings of the 22nd Asian Conference on Remote Sensing . 2001

机译：KSPT ENVISAT ASAR处理器中ENVISAT ASAR图像模式和交替极化模式的SAR处理算法
5. An Information Based Optimal Subdata Selection Algorithm for Big Data Linear Regression and a Suitable Variable Selection Algorithm. [D] . Zheng, Yi. 2017

机译：大数据线性回归的基于信息的最优子数据选择算法和合适的变量选择算法。
6. Hybrid Model Based on Genetic Algorithms and SVM Applied to Variable Selection within Fruit Juice Classification [O] . C. Fernandez-Lozano, C. Canto, M. Gestal, 2013

机译：基于遗传算法和支持向量机的混合模型在果汁分类中的变量选择
7. Transdimensional Sampling Algorithms for Bayesian Variable Selection in Classification Problems With Many More Variables Than Observations [O] . Lamnisos, Demetris, Griffin, Jim E., Steel, Mark F.J. 2009

机译：分类问题比观察值更多的贝叶斯变量选择的多维采样算法
8. Pattern Search Ranking and Selection Algorithms for Mixed-Variable Optimization of Stochastic Systems [R] . Iver, T. A. 2004

机译：随机系统混合变量优化模式搜索排序与选择算法

A group VISA algorithm for variable selection

摘要

著录项

相似文献

相关主题

期刊订阅