首页> 美国卫生研究院文献>Cancer Informatics >Network-Constrained Group Lasso for High-Dimensional Multinomial Classification with Application to Cancer Subtype Prediction
【2h】

Network-Constrained Group Lasso for High-Dimensional Multinomial Classification with Application to Cancer Subtype Prediction

机译:高维多项式分类的网络受限群套索及其在癌症亚型预测中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Classic multinomial logit model, commonly used in multiclass regression problem, is restricted to few predictors and does not take into account the relationship among variables. It has limited use for genomic data, where the number of genomic features far exceeds the sample size. Genomic features such as gene expressions are usually related by an underlying biological network. Efficient use of the network information is important to improve classification performance as well as the biological interpretability. We proposed a multinomial logit model that is capable of addressing both the high dimensionality of predictors and the underlying network information. Group lasso was used to induce model sparsity, and a network-constraint was imposed to induce the smoothness of the coefficients with respect to the underlying network structure. To deal with the non-smoothness of the objective function in optimization, we developed a proximal gradient algorithm for efficient computation. The proposed model was compared to models with no prior structure information in both simulations and a problem of cancer subtype prediction with real TCGA (the cancer genome atlas) gene expression data. The network-constrained mode outperformed the traditional ones in both cases.
机译:经典的多项式logit模型通常用于多类回归问题,限于少数预测变量,并且未考虑变量之间的关系。它在基因组特征数量远远超过样本数量的基因组数据中用途有限。基因表达等基因组特征通常与潜在的生物学网络相关。网络信息的有效利用对提高分类性能和生物学解释能力很重要。我们提出了一种多项式logit模型,该模型能够解决预测变量的高维和底层网络信息的问题。套索组用于引起模型稀疏性,并且施加网络约束以引起相对于底层网络结构的系数的平滑度。为了解决优化过程中目标函数的非平滑性,我们开发了一种用于高效计算的近端梯度算法。在模拟中,将提出的模型与没有先验结构信息的模型进行了比较,并使用真实的TCGA(癌症基因组图谱)基因表达数据预测了癌症亚型。在两种情况下,网络约束模式均优于传统模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号