首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Sparse partial least-squares regression and its applications to high-throughput data analysis
【24h】

Sparse partial least-squares regression and its applications to high-throughput data analysis

机译:稀疏偏最小二乘回归及其在高通量数据分析中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

The partial least-squares (PLS) method is designed for prediction problems where the number of predictors is larger than the number of training samples. PLS is based on latent components that are linear combinations of all of the original predictors, so it automatically employs all predictors regardless of their relevance. This will potentially compromise its performance, but it will also make it difficult to interpret the result. In this paper, we propose a new formulation of the sparse PLS (SPLS) procedure to allow both sparse variable selection and dimension reduction. We use the standard L_(1)-penalty and the unbounded penalty of [1]. We develop a computing algorithm for SPLS by modifying the nonlinear iterative partial least-squares (NIPALS) algorithm, and illustrate the method with an analysis of a cancer dataset. Through the numerical studies we find that our SPLS method generally performs better than the standard PLS and other existing methods in variable selection and prediction.
机译:偏最小二乘(PLS)方法专为预测问题而设计,其中预测变量的数量大于训练样本的数量。 PLS基于潜在成分,这些潜在成分是所有原始预测变量的线性组合,因此,无论它们的相关性如何,它都会自动采用所有预测变量。这可能会损害其性能,但也会使结果难以解释。在本文中,我们提出了一种新的稀疏PLS(SPLS)程序公式,以允许稀疏变量选择和降维。我们使用标准的L_(1)-罚分和[1]的无限罚分。我们通过修改非线性迭代偏最小二乘(NIPALS)算法来开发SPLS的计算算法,并通过对癌症数据集的分析来说明该方法。通过数值研究,我们发现我们的SPLS方法在变量选择和预测方面通常比标准PLS和其他现有方法表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号