...
首页> 外文期刊>Scientific reports. >Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
【24h】

Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery

机译:将基因相互作用信息整合到加权随机生存森林方法中,以进行准确的生存预测和生存生物标记物发现

获取原文
           

摘要

Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively.
机译:准确预测患者风险和识别生存生物标志物是生存分析中的两个重要任务。对于新兴的高通量基因表达数据,随机生存森林(RSF)越来越受到关注,因为它不仅在具有高维变量的生存预测问题上表现出出色的性能,而且能够根据变量识别重要变量在算法中自动计算的重要性。但是,RSF仍然存在一些问题,例如对独立数据集的预测准确性有限以及对生存生物标记物的生物学解释有限。在这项研究中,我们将基因相互作用信息整合到Reweighted RSF模型(RRSF)中,以提高预测准确性并识别具有生物学意义的生存标记。我们将RRSF应用于多形性胶质母细胞瘤(GBM)和食管鳞状细胞癌(ESCC)患者的预测。通过重建的全局通路网络和mRNA-lncRNA共表达网络作为先前的基因相互作用信息,RRSF在三个GBM和两个ESCC数据集上显示出比RSF更好的总体预测性能。此外,RRSF还鉴定了一个具有两个基因和三个IncRNA的信号,分别显示了强大的预后价值和与GBM和ESCC的生物学相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号