首页> 美国卫生研究院文献>Computational and Mathematical Methods in Medicine >IPF-LASSO: Integrative L1-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data
【2h】

IPF-LASSO: Integrative L1-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data

机译:IPF-LASSO:集成L基于多组数据的带罚因子的1-惩罚回归预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

As modern biotechnologies advance, it has become increasingly frequent that different modalities of high-dimensional molecular data (termed “omics” data in this paper), such as gene expression, methylation, and copy number, are collected from the same patient cohort to predict the clinical outcome. While prediction based on omics data has been widely studied in the last fifteen years, little has been done in the statistical literature on the integration of multiple omics modalities to select a subset of variables for prediction, which is a critical task in personalized medicine. In this paper, we propose a simple penalized regression method to address this problem by assigning different penalty factors to different data modalities for feature selection and prediction. The penalty factors can be chosen in a fully data-driven fashion by cross-validation or by taking practical considerations into account. In simulation studies, we compare the prediction performance of our approach, called IPF-LASSO (Integrative LASSO with Penalty Factors) and implemented in the R package ipflasso, with the standard LASSO and sparse group LASSO. The use of IPF-LASSO is also illustrated through applications to two real-life cancer datasets. All data and codes are available on the companion website to ensure reproducibility.
机译:随着现代生物技术的发展,越来越多的人从同一患者队列中收集不同形式的高维分子数据(在本文中称为“组学”数据),例如基因表达,甲基化和拷贝数,以进行预测临床结果。尽管在过去的15年中已经广泛研究了基于组学数据的预测,但是在统计文献中很少有关于集成多种组学方法以选择变量子集进行预测的研究,这在个性化医学中是一项至关重要的任务。在本文中,我们提出了一种简单的惩罚回归方法,通过为特征选择和预测的不同数据模式分配不同的惩罚因子来解决此问题。可以通过交叉验证或通过考虑实际因素,以完全数据驱动的方式选择惩罚因子。在模拟研究中,我们将我们的方法IPF-LASSO(具有惩罚因子的综合LASSO)和R包 ipflasso 中实现的预测性能与标准LASSO和稀疏组LASSO进行了比较。 IPF-LASSO的使用还通过应用于两个现实癌症数据集进行了说明。所有数据和代码都可以在随附的网站上找到,以确保可重复性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号