首页> 外文会议>Mediterranean Conference on Medical and Biological Engineering and Computing >Overall Survival Prediction for Women Breast Cancer Using Ensemble Methods and Incomplete Clinical Data
【24h】

Overall Survival Prediction for Women Breast Cancer Using Ensemble Methods and Incomplete Clinical Data

机译:使用集合方法和不完全临床数据的女性乳腺癌的整体生存预测

获取原文

摘要

Breast Cancer is the most common type of cancer in women worldwide. In spite of this fact, there are insufficient studies that, using data mining techniques, are capable of helping medical doctors in their daily practice. This paper presents a comparative study of three ensemble methods (TreeBagger, LPBoost and Subspace) using a clinical dataset with 25% missing values to predict the overall survival of women with breast cancer. To complete the absent values, the k-nearest neighbor (k-NN) algorithm was used with four distinct neighbor values, trying to determine the best one for this particular scenario. Tests were performed for each of the three ensemble methods and each k-NN configuration, and their performance compared using a Friedman test. Despite the complexity of this challenge, the produced results are promising and the best algorithm configuration (TreeBagger using 3 neighbors) presents a prediction accuracy of 73%.
机译:乳腺癌是全球女性中最常见的癌症类型。尽管有了这一事实,但研究不足,使用数据采矿技术,能够帮助医生在日常练习中。本文介绍了三个集合方法(TreeBagger,LPBoost和子空间)的比较研究,使用具有25%缺失值的临床数据集来预测患有乳腺癌的妇女的整体存活率。为了完成不存在值,k-incelte邻居(k-nn)算法与四个不同的邻居值一起使用,尝试为此特定方案确定最佳的邻居值。对三种集合方法和每个K-NN配置中的每一个进行测试,以及使用弗里德曼测试比较的性能。尽管这一挑战的复杂性,所产生的结果是有前途的,并且最好的算法配置(使用3个邻居的TreeBagger)具有73%的预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号