Early stopping aggregation in selective variable selection ensembles for high-dimensional linear regression models

Zhang Chun-Xia; Zhang Jiang-She; Yin Qing-Yan

首页> 外文期刊>Knowledge-Based Systems >Early stopping aggregation in selective variable selection ensembles for high-dimensional linear regression models

【24h】

Early stopping aggregation in selective variable selection ensembles for high-dimensional linear regression models

机译：高维线性回归模型的选择性变量选择集合中的提早停止聚集

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, variable selection has become the most popular and effective tool to analyze high-dimensional data. Among the existing approaches, variable selection ensembles (VSEs) have exhibited their great power in improving selection accuracy and stabilizing the results of a traditional selection method. The construction of a VSE generally consists of two phases, i.e., ensemble generation and ensemble aggregation. We study selective VSEs in this paper by inserting a pruning step before combining the generated members into a VSE. As a result, a smaller but more accurate subensemble can be obtained. By taking ST2E (stochastic stepwise ensemble) as our main example, we first extended it to handle high-dimensional data. On the basis of its individuals, the aggregation order is rearranged according to their corresponding RIC, (corrected risk inflation criterion) values. Then, only some members ranked ahead are averaged to estimate the importance measures for each candidate variable. In terms of several variable ranking and selection metrics, experiments conducted with simulated and real-world high-dimensional data show that pruned ST2E is superior to several other benchmark methods in most cases. By analyzing the accuracy-diversity patterns of VSEs, the pruning step is found to exclude less accurate members and lead the reserved members to more concentrate on the true importance vector. (C) 2018 Elsevier B.V. All rights reserved.

机译：如今，变量选择已成为分析高维数据的最流行和最有效的工具。在现有方法中，变量选择集成（VSE）在提高选择精度和稳定传统选择方法的结果方面表现出了巨大的威力。 VSE的构建通常包括两个阶段，即，合奏生成和合奏聚合。我们通过在将生成的成员合并到VSE中之前插入修剪步骤来研究选择性VSE。结果，可以获得较小但更准确的子集合。通过以ST2E（随机逐步集成）为主要示例，我们首先将其扩展为处理高维数据。根据其个体，聚合顺序将根据其相应的RIC（校正后的风险通胀标准）值进行重新排列。然后，仅对排名靠前的一些成员进行平均，以估计每个候选变量的重要性度量。就几种变量排名和选择指标而言，对模拟和真实高维数据进行的实验表明，在大多数情况下，修剪后的ST2E优于其他几种基准方法。通过分析VSE的准确性-多样性模式，发现修剪步骤排除了精度较低的成员，并使保留的成员更加专注于真实重要性向量。（C）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2018年第1期|1-11|共11页
作者
Zhang Chun-Xia; Zhang Jiang-She; Yin Qing-Yan;
展开▼
作者单位

Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China;

Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China;

Xian Univ Architecture & Technol, Sch Sci, Xian 710055, Shaanxi, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Variable selection ensemble; Ensemble pruning; Variable selection; Selection accuracy; Aggregation order; Ranking accuracy;

机译：变量选择集合;集合修剪;变量选择;选择精度;聚合顺序;排序精度;

相似文献

外文文献
中文文献
专利

1. A one covariate at a time, multiple testing approach to variable selection in high-dimensional linear regression models: A replication in a narrow sense [J] . Nunez Hector M., Otero Jesus Journal of applied econometrics . 2021,第6期

机译：一次一个协变量，高维线性回归模型中的多次测试方法是在狭义中的复制
2. Variable selection in high-dimensional sparse multiresponse linear regression models [J] . Luo Shan Statistical papers . 2020,第3期

机译：高维稀疏Multiresponse线性回归模型的变量选择
3. VARIABLE SELECTION FOR SPARSE HIGH-DIMENSIONAL NONLINEAR REGRESSION MODELS BY COMBINING NONNEGATIVE GARROTE AND SURE INDEPENDENCE SCREENING [J] . Shuang Wu, Hongqi Xue, Yichao Wu, Statistica Sinica . 2014,第3期

机译：结合非线性负数和确定独立性筛选的稀疏高维非线性回归模型的变量选择
4. A Novel Bagging Ensemble Approach for Variable Ranking and Selection for Linear Regression Models [C] . Chun-Xia Zhang, Jiang-She Zhang, Guan-Wei Wang International workshop on multiple classifier systems . 2015

机译：线性回归模型的变量排序和选择的新型袋装集成方法
5. Shrinkage-based variable selection methods for linear regression and mixed-effects models. [D] . Krishna, Arun. 2009

机译：基于收缩的变量选择方法用于线性回归和混合效果模型。
6. Variable Selection for Sparse High-Dimensional Nonlinear Regression Models by Combining Nonnegative Garrote and Sure Independence Screening [O] . Shuang Wu, Hongqi Xue, Yichao Wu, -1

机译：非负Garrote和Sure独立筛选相结合的稀疏高维非线性回归模型变量选择
7. An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models [O] . Yanxi Xie, Yuewen Li, Zhijie Xia, 2020

机译：一种改进的高维线性回归模型的前向回归变量选择算法

Early stopping aggregation in selective variable selection ensembles for high-dimensional linear regression models

摘要

著录项

相似文献

相关主题

期刊订阅