首页> 外文期刊>Cancer Informatics >Supervised Classification by Filter Methods and Recursive Feature Elimination Predicts Risk of Radiotherapy-Related Fatigue in Patients with Prostate Cancer
【24h】

Supervised Classification by Filter Methods and Recursive Feature Elimination Predicts Risk of Radiotherapy-Related Fatigue in Patients with Prostate Cancer

机译:通过过滤方法和递归特征消除进行监督分类可预测前列腺癌患者放射治疗相关的疲劳风险

获取原文
           

摘要

Background: Fatigue is a common side effect of cancer (CA) treatment. We used a novel analytical method to identify and validate a specific gene cluster that is predictive of fatigue risk in prostate cancer patients (PCP) treated with radiotherapy (RT).Methods: A total of 44 PCP were categorized into high-fatigue (HF) and low-fatigue (LF) cohorts based on fatigue score change from baseline to RT completion. Fold-change differential and Fisher's linear discriminant analyses (LDA) from 27 subjects with gene expression data at baseline and RT completion generated a reduced base of most discriminatory genes (learning phase). A nearest-neighbor risk (k-NN) prediction model was developed based on small-scale prognostic signatures. The predictive model validity was tested in another 17 subjects using baseline gene expression data (validation phase).Result: The model generated in the learning phase predicted HF classification at RT completion in the validation phase with 76.5% accuracy.Conclusion: The results suggest that a novel analytical algorithm that incorporates fold-change differential analysis, LDA, and a k-NN may have applicability in predicting regimen-related toxicity in cancer patients with high reliability, if we take into account these results and the limited amount of data that we had at disposal. It is expected that the accuracy will be improved by increasing data sampling in the learning phase.
机译:背景:疲劳是癌症(CA)治疗的常见副作用。我们使用一种新颖的分析方法来鉴定和验证一个特定的基因簇,该基因簇可以预测接受放射治疗(RT)的前列腺癌患者(PCP)的疲劳风险。方法:总共44个PCP被归类为高疲劳(HF)和低疲劳(LF)队列基于疲劳评分从基线到RT完成的变化。来自基线和RT完成时的基因表达数据的27位受试者的倍数变化差异分析和Fisher线性判别分析(LDA)导致大多数歧视性基因(学习阶段)的碱基减少。基于小规模的预后特征开发了最近邻风险(k-NN)预测模型。使用基线基因表达数据(验证阶段)在另外17名受试者中测试了预测模型的有效性。结果:在学习阶段生成的模型预测了验证阶段RT完成时的HF分类,准确度为76.5%。结论:结果表明:如果我们考虑到这些结果和有限的数据量,那么一种结合了倍数变化差异分析,LDA和k-NN的新颖分析算法可能在预测癌症患者中与方案相关的毒性方面具有较高的适用性。随时可以使用。期望通过在学习阶段增加数据采样来提高准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号