首页> 外文会议>IEEE International Conference on Big Data >Classification Models and Survival Analysis for Prostate Cancer Using RNA Sequencing and Clinical Data
【24h】

Classification Models and Survival Analysis for Prostate Cancer Using RNA Sequencing and Clinical Data

机译:RNA测序和临床数据对前列腺癌的分类模型和生存分析

获取原文

摘要

Early detection of cancer can significantly increase the chance of successful treatment. This research performs a study on early cancer detection for prostate cancer patients from whom cancer tissue was analyzed with Illumina Hi-Seq ribonucleic acid (RNA) Sequencing (RNA-Seq). Cancer relevant genes with the most significant correlations with the clinical outcome of the sample type (cancer on-cancer) and the overall survival (OS) were assessed. Traditional cancer diagnosis primarily depends on physicians’ experience to identify morphological abnormalities. Gene expression level data can assist physicians in detecting cancer cases at a much earlier stage and thus can significantly improve the potential of patient treatment. In this research, for the classification task, we applied machine learning and data mining approaches to detect cancer versus non-cancer based on gene expression data. Our goal was to detect cancer at the earliest stage. Besides, for the regression task, survival outcomes in prostate cancer patients were performed. Regression trees were built using cancer-sensitive genes along with clinical attribute ‘Gleason score’ as predictors, and the clinical variable ‘overall survival’ as the target variable. Knowledge in the form of rules is one of the vital tasks in data mining as it provides concise statements of easily understandable and potentially valuable information. For the classification model, we derived rules from a decision tree and interpreted these rules for cancer and non-cancer patients. For the regression or survival model, we generated rules for predicting or estimating the survival time of cancer patients. In this study, cancer-relevant genes were analyzed as predictors, although various genes may interact with genes currently known to contribute to cancer. These findings have implications for assessing gene-gene interactions and gene-environment interactions of prostate cancer as well as for other types of cancer.
机译:早期发现癌症可以显着增加成功治疗的机会。这项研究进行了一项针对前列腺癌患者的早期癌症检测的研究,通过Illumina Hi-Seq核糖核酸(RNA)测序(RNA-Seq)对癌症组织进行了分析。评估与样本类型的临床结果(癌症/非癌症)和总生存期(OS)最相关的癌症相关基因。传统的癌症诊断主要取决于医师的经验来识别形态异常。基因表达水平数据可以帮助医生在更早的阶段发现癌症,从而可以显着提高患者治疗的潜力。在这项研究中,对于分类任务,我们应用了机器学习和数据挖掘方法,基于基因表达数据来检测癌症与非癌症。我们的目标是尽早发现癌症。此外,对于回归任务,还执行了前列腺癌患者的生存结局。回归树是使用对癌症敏感的基因以及临床属性“格里森评分”作为预测变量,并使用临床变量“总体存活率”作为目标变量来构建的。规则形式的知识是数据挖掘中的重要任务之一,因为它提供了易于理解且可能有价值的信息的简洁陈述。对于分类模型,我们从决策树中得出规则,并为癌症和非癌症患者解释这些规则。对于回归或生存模型,我们生成了用于预测或估计癌症患者生存时间的规则。在这项研究中,分析了与癌症相关的基因作为预测因子,尽管各种基因可能与目前已知会导致癌症的基因相互作用。这些发现对于评估前列腺癌以及其他类型的癌症的基因-基因相互作用和基因-环境相互作用具有意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号