首页> 外文期刊>Systems biomedicine. >Survival regression by data fusion
【24h】

Survival regression by data fusion

机译:通过数据融合进行生存回归

获取原文
获取原文并翻译 | 示例
       

摘要

Any knowledge discovery could in principal benefit from the fusion of directly or even indirectly related data sources. In this paper we explore whether data fusion by simultaneous matrix factorization could be adapted for survival regression. We propose a new method that jointly infers latent data factors from a number of heterogeneous data sets and estimates regression coefficients of a survival model. We have applied the method to CAMDA 2014 large-scale Cancer Genomes Challenge and modeled survival time as a function of gene, protein and miRNA expression data, and data on methylated and mutated regions. We find that both joint inference of data factors and regression coefficients and data fusion procedure are crucial for performance. Our approach is substantially more accurate than the baseline Aalen's additive model. Latent factors inferred by our approach could be mined further; for CAMDA challenge, we found that the most informative factors are related to known cancer processes.
机译:任何知识发现都可以从直接或什至间接相关的数据源的融合中受益。在本文中,我们探讨了通过同时矩阵分解进行的数据融合是否可以适用于生存回归。我们提出了一种新方法,该方法可以从多个异构数据集中共同推断潜在数据因子,并估算生存模型的回归系数。我们已将该方法应用于CAMDA 2014大规模癌症基因组挑战赛,并根据基因,蛋白质和miRNA表达数据以及甲基化和突变区域的数据对存活时间进行了建模。我们发现,数据因素和回归系数以及数据融合过程的联合推断对于性能至关重要。我们的方法比基线Aalen的加性模型要准确得多。我们的方法推断出的潜在因素可以进一步挖掘;对于CAMDA挑战,我们发现信息最多的因素与已知的癌症过程有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号