首页> 美国卫生研究院文献>Statistical Applications in Genetics and Molecular Biology >Sparse Partial Least Squares Classification for High Dimensional Data
【2h】

Sparse Partial Least Squares Classification for High Dimensional Data

机译:高维数据的稀疏偏最小二乘分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Partial least squares (PLS) is a well known dimension reduction method which has been recently adapted for high dimensional classification problems in genome biology. We develop sparse versions of the recently proposed two PLS-based classification methods using sparse partial least squares (SPLS). These sparse versions aim to achieve variable selection and dimension reduction simultaneously. We consider both binary and multicategory classification. We provide analytical and simulation-based insights about the variable selection properties of these approaches and benchmark them on well known publicly available datasets that involve tumor classification with high dimensional gene expression data. We show that incorporation of SPLS into a generalized linear model (GLM) framework provides higher sensitivity in variable selection for multicategory classification with unbalanced sample sizes between classes. As the sample size increases, the two-stage approach provides comparable sensitivity with better specificity in variable selection. In binary classification and multicategory classification with balanced sample sizes, the two-stage approach provides comparable variable selection and prediction accuracy as the GLM version and is computationally more efficient.
机译:偏最小二乘(PLS)是一种众所周知的降维方法,最近已针对基因组生物学中的高维分类问题进行了调整。我们使用稀疏偏最小二乘(SPLS)开发了最近提出的两种基于PLS的分类方法的稀疏版本。这些稀疏版本旨在同时实现变量选择和降维。我们考虑二进制和多类别分类。我们提供有关这些方法的变量选择属性的基于分析和模拟的见解,并在涉及肿瘤分类和高维基因表达数据的众所周知的公开数据集上对它们进行基准测试。我们表明,将SPLS合并到广义线性模型(GLM)框架中,可以在类别之间样本量不平衡的多类别分类的变量选择中提供更高的敏感性。随着样本量的增加,两步法可在可变选择中提供可比的灵敏度和更好的特异性。在具有均衡样本量的二进制分类和多类别分类中,两阶段方法可提供与GLM版本相当的变量选择和预测精度,并且计算效率更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号