首页> 外文期刊>Artificial intelligence in medicine >Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
【24h】

Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence

机译:集成数据挖掘分类技术和集成学习以识别危险因素并诊断卵巢癌复发

获取原文
获取原文并翻译 | 示例
           

摘要

Ovarian cancer is the second leading cause of deaths among gynecologic cancers in the world. Approximately 90% of women with ovarian cancer reported having symptoms long before a diagnosis was made. Literature shows that recurrence should be predicted with regard to their personal risk factors and the clinical symptoms of this devastating cancer. In this study, ensemble learning and five data mining approaches, including support vector machine (SVM), C5.0, extreme learning machine (ELM), multivariate adaptive regression splines (MARS), and random forest (RF), were integrated to rank the importance of risk factors and diagnose the recurrence of ovarian cancer. The medical records and pathologic status were extracted from the Chung Shan Medical University Hospital Tumor Registry. Experimental results illustrated that the integrated C5.0 model is a superior approach in predicting the recurrence of ovarian cancer. Moreover, the classification accuracies of C5.0, ELM, MARS, RF, and SVM indeed increased after using the selected important risk factors as predictors. Our findings suggest that The International Federation of Gynecology and Obstetrics (FIGO), Pathologic M, Age, and Pathologic T were the four most critical risk factors for ovarian cancer recurrence. In summary, the above information can support the important influence of personality and clinical symptom representations on all phases of guide interventions, with the complexities of multiple symptoms associated with ovarian cancer in all phases of the recurrent trajectory. (C) 2017 Elsevier B.V. All rights reserved.
机译:卵巢癌是全球妇科癌症中第二大死亡原因。约90%的卵巢癌女性在诊断之前就已出现症状。文献表明,应根据其个人危险因素和这种毁灭性癌症的临床症状预测复发。在这项研究中,集成学习和五种数据挖掘方法,包括支持向量机(SVM),C5.0,极限学习机(ELM),多元自适应回归样条(MARS)和随机森林(RF),进行排名危险因素的重要性和诊断卵巢癌的复发。病历和病理状况摘自中山医科大学附属医院肿瘤登记处。实验结果表明,集成的C5.0模型是预测卵巢癌复发的一种较好方法。此外,在使用选定的重要风险因素作为预测指标后,C5.0,ELM,MARS,RF和SVM的分类准确性确实有所提高。我们的发现表明,国际妇产科联合会(FIGO),病理M,年龄和病理T是卵巢癌复发的四个最关键的危险因素。总之,以上信息可以支持人格和临床症状表征对指导干预措施所有阶段的重要影响,以及在复发轨迹的所有阶段中与卵巢癌相关的多种症状的复杂性。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号