首页> 外文会议>2nd International Conference on Intelligent Informatics and Biomedical Sciences >A hybrid of data mining and ensemble learning forecasting for recurrent ovarian cancer
【24h】

A hybrid of data mining and ensemble learning forecasting for recurrent ovarian cancer

机译:数据挖掘与整体学习预测相结合的复发性卵巢癌

获取原文
获取原文并翻译 | 示例

摘要

This study applied advanced machine learning techniques and combined with ensemble learning, widely considered as the most successful method to produce objective to an inferential problem of recurrent ovarian cancer. In this study, five machine learning approaches including SVM(support vector machine), C5.0, ELM(extreme learning machine), MARS(Multivariate Adaptive Regression Splines) and RF(Random Forests) were considered to find important risk factors and to predict the recurrence-proneness for ovarian cancer. We use ensemble learning to improve the defect of classification accuracy used normal machine learning: first, selecting important risk factors by ensemble learning, then, using the five machine learning approaches to analyze again. The medical records and pathology were accessible by the Chung Shan Medical University Hospital Tumor Registry. The existing literature on recurrent ovarian cancer reveals that factors include Age, Histology, Grade, Pathologic T, Pathologic N, Pathologic M, Pathologic Stage, The International Federation of Gynecology and Obstetrics (FIGO), Surgical Margins, Performance status, CA125, Operation Optimal Debulking, Chemotherapy Guideline. There are totally 987 patients in the data set. In our study, C5.0 is the superior approach in predicting recurrence of ovarian cancer. Moreover, the classification accuracy of C5.0, MARS, RF and SVM indeed increases after using ensemble learning. Particularly the classification accuracy of C5.0 obviously improves by using ensemble learning with hybrid scheme.
机译:这项研究应用了先进的机器学习技术,并与整体学习相结合,被广泛认为是针对复发性卵巢癌的推断问题产生目标的最成功方法。在这项研究中,考虑了五种机器学习方法,包括SVM(支持向量机),C5.0,ELM(极限学习机),MARS(多元自适应回归样条)和RF(随机森林),以发现重要的风险因素并进行预测卵巢癌的复发倾向。我们使用集合学习来改善使用常规机器学习的分类准确性的缺陷:首先,通过集合学习选择重要的风险因素,然后,使用五种机器学习方法再次进行分析。中山医科大学附属医院肿瘤登记处可提供病历和病理信息。现有的关于复发性卵巢癌的文献表明,因素包括年龄,组织学,等级,病理性T,病理性N,病理性M,病理分期,国际妇产科联合会(FIGO),手术余量,表现状态,CA125,最佳手术实体化,化疗指南。数据集中共有987位患者。在我们的研究中,C5.0是预测卵巢癌复发的更好方法。此外,使用集成学习后,C5.0,MARS,RF和SVM的分类准确率确实提高了。特别是通过结合使用集成学习和混合方案,C5.0的分类精度明显提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号