首页> 外文期刊>Bioinformatics >Gradient lasso for Cox proportional hazards model
【24h】

Gradient lasso for Cox proportional hazards model

机译:Cox比例风险模型的梯度套索

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: There has been an increasing interest in expressing a survival phenotype (e.g. time to cancer recurrence or death) or its distribution in terms of a subset of the expression data of a subset of genes. Due to high dimensionality of gene expression data, however, there is a serious problem of collinearity in fitting a prediction model, e.g. Cox's proportional hazards model. To avoid the collinearity problem, several methods based on penalized Cox proportional hazards models have been proposed. However, those methods suffer from severe computational problems, such as slow or even failed convergence, because of high-dimensional matrix inversions required for model fitting. We propose to implement the penalized Cox regression with a lasso penalty via the gradient lasso algorithm that yields faster convergence to the global optimum than do other algorithms. Moreover the gradient lasso algorithm is guaranteed to converge to the optimum under mild regularity conditions. Hence, our gradient lasso algorithm can be a useful tool in developing a prediction model based on high-dimensional covariates including gene expression data.Results: Results from simulation studies showed that the prediction model by gradient lasso recovers the prognostic genes. Also results from diffuse large B-cell lymphoma datasets and Norway/Stanford breast cancer dataset indicate that our method is very competitive compared with popular existing methods by Park and Hastie and Goeman in its computational time, prediction and selectivity.
机译:动机:人们越来越感兴趣地表达生存表型(例如癌症复发或死亡的时间)或其以子集的一部分基因表达数据的形式表达。然而,由于基因表达数据的高维性,在拟合预测模型(例如预测模型)中存在共线性的严重问题。考克斯比例风险模型。为了避免共线性问题,提出了几种基于惩罚Cox比例风险模型的方法。但是,由于模型拟合所需的高维矩阵求逆,这些方法存在严重的计算问题,例如收敛缓慢甚至失败。我们建议通过梯度套索算法使用套索罚分实现惩罚式Cox回归,该算法比其他算法能更快地收敛到全局最优值。此外,梯度套索算法可确保在适度规则性条件下收敛至最佳值。因此,我们的梯度套索算法可以作为开发基于包含基因表达数据的高维协变量的预测模型的有用工具。结果:仿真研究的结果表明,梯度套索的预测模型可以恢复预后基因。弥散性大型B细胞淋巴瘤数据集和挪威/斯坦福乳腺癌数据集的结果也表明,与Park和Hastie和Goeman的现有流行方法相比,我们的方法在计算时间,预测和选择性方面具有很大的竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号