首页> 美国卫生研究院文献>other >Sparse Regression by Projection and Sparse Discriminant Analysis
【2h】

Sparse Regression by Projection and Sparse Discriminant Analysis

机译:投影的稀疏回归和稀疏判别分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Recent years have seen active developments of various penalized regression methods, such as LASSO and elastic net, to analyze high dimensional data. In these approaches, the direction and length of the regression coefficients are determined simultaneously. Due to the introduction of penalties, the length of the estimates can be far from being optimal for accurate predictions. We introduce a new framework, regression by projection, and its sparse version to analyze high dimensional data. The unique nature of this framework is that the directions of the regression coefficients are inferred first, and the lengths and the tuning parameters are determined by a cross validation procedure to achieve the largest prediction accuracy. We provide a theoretical result for simultaneous model selection consistency and parameter estimation consistency of our method in high dimension. This new framework is then generalized such that it can be applied to principal components analysis, partial least squares and canonical correlation analysis. We also adapt this framework for discriminant analysis. Compared to the existing methods, where there is relatively little control of the dependency among the sparse components, our method can control the relationships among the components. We present efficient algorithms and related theory for solving the sparse regression by projection problem. Based on extensive simulations and real data analysis, we demonstrate that our method achieves good predictive performance and variable selection in the regression setting, and the ability to control relationships between the sparse components leads to more accurate classification. In available online, the details of the algorithms and theoretical proofs, and R codes for all simulation studies are provided.
机译:近年来,各种惩罚性回归方法(例如LASSO和弹性网)用于分析高维数据的研究得到了积极发展。在这些方法中,同时确定回归系数的方向和长度。由于采用了惩罚措施,因此估算的长度可能远非理想的准确预测值。我们介绍了一个新的框架,通过投影回归,以及它的稀疏版本来分析高维数据。该框架的独特性质是,首先推断回归系数的方向,然后通过交叉验证过程确定长度和调整参数,以实现最大的预测精度。我们为高维方法的同时模型选择一致性和参数估计一致性提供了理论结果。然后对该新框架进行概括,以便可以将其应用于主成分分析,偏最小二乘和规范相关分析。我们还将这个框架用于判别分析。与现有方法相比,在稀疏组件之间对依赖关系的控制相对较少的情况下,我们的方法可以控制组件之间的关系。我们提出了通过投影问题解决稀疏回归的有效算法和相关理论。基于大量的模拟和真实的数据分析,我们证明了我们的方法在回归设置中实现了良好的预测性能和变量选择,并且控制稀疏组件之间的关系的能力导致了更准确的分类。在线提供了所有算法研究的算法和理论证明以及R代码的详细信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号