首页> 外文会议>International Conference on Intelligence Science and Big Data Engineering >Evaluating Diagnostic Performance of Machine Learning Algorithms on Breast Cancer
【24h】

Evaluating Diagnostic Performance of Machine Learning Algorithms on Breast Cancer

机译:评估机器学习算法对乳腺癌的诊断性能

获取原文
获取外文期刊封面目录资料

摘要

This paper focuses on comparing performance of six data mining methods namely: Bagging, SVM (SMO), Decorate, C4.5 (J48), Naive Bayes and IBK in analyzing Wisconsin Breast Cancer (WBC) datasets. The datasets were obtained from the UCI Machine Learning Repository and comprises of 699 instances and 11 attributes. A confusion matrix, based on a 10-fold cross validation technique was used in our experiment to provide the basis for measuring the accuracy of each algorithm. We introduce an idea of combining the algorithms at classification level to obtain the most ideal multi-classifier approach for the WBC data set. Waikato Environment Knowledge Explorer (WEKA), open source data mining software was used for the experimental analysis. The experimental results show that SMO offers the best accuracy (97 %) among the six algorithms, while merging SMO, Naive Bayes, J48 and IBK offers the best accuracy (97.3%) on the data set.
机译:本文侧重于比较六种数据采矿方法的性能即:袋装,SVM(SMO),装饰,C4.5(J48),天真贝叶斯和IBK在分析威斯康星州乳腺癌(WBC)数据集。数据集是从UCI机器学习存储库获得的,并包含699个实例和11个属性。在我们的实验中使用了一种基于10倍交叉验证技术的混淆矩阵,为测量每种算法的准确性提供基础。我们介绍了将算法结合在分类级别中的算法,以获得WBC数据集的最理想的多分类方法。 Waikato环境知识资源管理器(Weka),开源数据挖掘软件用于实验分析。实验结果表明,SMO在六种算法中提供了最佳准确性(97%),同时合并了SMO,Naive Bayes,J48和IBK在数据集上提供最佳准确性(97.3%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号