首页> 外文期刊>Australian & New Zealand journal of statistics >Greedy forward regression for variable screening
【24h】

Greedy forward regression for variable screening

机译:贪婪前向回归用于变量筛选

获取原文
获取原文并翻译 | 示例
       

摘要

In the ultra-high dimensional setting, two popular variable screening methods with the desirable sure screening property are sure independence screening (SIS) and forward regression (FR). Both are classical variable screening methods, and recently have attracted greater attention under high-dimensional data analysis. We consider a new and simple screening method that incorporates multiple predictors at each step of forward regression, with decisions on which variables to incorporate based on the same criterion. If only one step is carried out, the new procedure reduces to SIS. Thus it can be regarded as a generalisation and unification of FR and SIS. More importantly, it preserves the sure screening property and has computational complexity similar to FR at each step, yet it can discover the relevant covariates in fewer steps. Thus it reduces the computational burden of FR drastically while retaining the advantages of the latter over SIS. Furthermore, we show that it can find all the true variables if the number of steps taken is the same as the correct model size, which is a new theoretical result even for the original FR. An extensive simulation study and application to two real data examples demonstrate excellent performance of the proposed method.
机译:在超高维环境中,具有理想的确定筛选属性的两种流行的变量筛选方法是确定独立筛选(SIS)和正向回归(FR)。两者都是经典的变量筛选方法,最近在高维数据分析下引起了更多关注。我们考虑一种新的简单筛选方法,该方法在正向回归的每个步骤都包含多个预测变量,并根据相同的标准来决定合并哪些变量。如果仅执行一个步骤,则新过程将简化为SIS。因此,可以将其视为FR和SIS的概括和统一。更重要的是,它保留了肯定的筛选属性,并且在每一步都具有类似于FR的计算复杂性,但是它可以在更少的步骤中发现相关的协变量。因此,它大大减少了FR的计算负担,同时保留了FR优于SIS的优势。此外,我们表明,如果采取的步骤数与正确的模型大小相同,它将找到所有真实变量,即使对于原始FR,这也是一个新的理论结果。广泛的仿真研究和对两个真实数据示例的应用证明了该方法的出色性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号