首页> 外文期刊>Computational statistics & data analysis >Using random subspace method for prediction and variable importance assessment in linear regression
【24h】

Using random subspace method for prediction and variable importance assessment in linear regression

机译:使用随机子空间方法进行线性回归的预测和变量重要性评估

获取原文
获取原文并翻译 | 示例
           

摘要

A random subset method (RSM) with a new weighting scheme is proposed and investigated for linear regression with a large number of features. Weights of variables are defined as averages of squared values of pertaining t-statistics over fitted models with randomly chosen features. It is argued that such weighting is advisable as it incorporates two factors: a measure of importance of the variable within the considered model and a measure of goodness-of-fit of the model itself. Asymptotic weights assigned by such a scheme are determined as well as assumptions under which the method leads to consistent choice of significant variables in the model. Numerical experiments indicate that the proposed method behaves promisingly when its prediction errors are compared with errors of penalty-based methods such as the lasso and it has much smaller false discovery rate than the other methods considered.
机译:提出了具有新加权方案的随机子集方法(RSM),并研究了具有大量特征的线性回归方法。变量的权重定义为具有随机选择特征的拟合模型上相关t统计量的平方值的平均值。有人认为,这种加权是可取的,因为它包含两个因素:衡量所考虑模型内变量的重要性和衡量模型本身的拟合优度。确定由这种方案分配的渐近权重以及在该方法导致模型中一致选择重要变量的假设。数值实验表明,该方法在将其预测误差与基于惩罚的方法(例如套索)的误差进行比较时表现良好,其误发现率比其他方法要小得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号