首页> 外文期刊>Measurement >CWV-BANN-SVM ensemble learning classifier for an accurate diagnosis of breast cancer
【24h】

CWV-BANN-SVM ensemble learning classifier for an accurate diagnosis of breast cancer

机译:CWV-BANN-SVM集合学习分类器,用于准确诊断乳腺癌

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a new data mining technique for an accurate prediction of breast cancer (BC), which is one of the major mortality causes among women around the globe. The main objective of our study is to expand an automatic expert system (ES) to provide an accurate diagnosis of BC. Both, Support Vector Machines (SVMs) and Artificial Neural Networks (ANNs) were applied to analyze BC data. The well-known Wisconsin Breast Cancer Dataset (WBCD), available in the UCI repository, was examined in our study. We first tested the SVM algorithm using various values of the C, epsilon and gamma parameters. As a result of the first experiment, we were able to observe that the adjustment of these regularization parameters can greatly improve the performance of the traditional SVM algorithm applied for BC detection. The highest obtained accuracy at the first step was 99.71%. Then, we performed a new BC detection approach based on two ensemble learning techniques: the confidence-weighted voting method and the boosting ensemble technique. Our model, called CWV-BANNSVM, combines boosting ANNs (BANN) and two SVMs, using optimal parameters selected during the first experiment. The performance of the applied methods was evaluated using several popular metrics, such as specificity, sensitivity, precision, FPR, FNR, F-1 score, AUC, Gini and accuracy. The proposed CWV-BANNSVM model was able to improve the performance of the traditional machine learning algorithms applied to BC detection, reaching the accuracy of 100%. To overcome the overfitting issue, we determined and used some appropriate parameter values of polynomial SVM. Our comparison with the existing studies dedicated to BC prediction suggests that the proposed CWV-BANN-SVM model provides one of the best prediction performances overall. (C) 2019 Elsevier Ltd. All rights reserved.
机译:本文提出了一种新的数据挖掘技术,用于准确预测乳腺癌(BC),这是全球妇女的主要死亡率之一。我们研究的主要目的是扩展自动专家系统,以提供对BC的准确诊断。两者都是支持向量机(SVM)和人工神经网络(ANNS)被应用于分析BC数据。在我们的研究中,在UCI存储库中提供了众所周知的威斯康星乳腺癌数据集(WBCD),在我们的研究中进行了审查。我们首先使用C,epsilon和伽马参数的各种值测试SVM算法。由于第一次实验,我们能够观察到这些正则化参数的调整可以大大提高传统SVM算法的性能,适用于BC检测。第一步中获得的最高精度为99.71%。然后,我们基于两种集合学习技术进行了新的BC检测方法:置信加权投票方法和升压集合技术。我们的模型,称为CWV-Bannsvm,使用在第一次实验期间选择的最佳参数来组合升压ANNS(禁区)和两个SVM。使用几种流行的指标评估所应用方法的性能,例如特异性,灵敏度,精度,FPR,FNR,F-1得分,AUC,GINI和准确性。所提出的CWV-Bannsvm模型能够提高应用于BC检测的传统机器学习算法的性能,达到100%的准确性。为了克服过度装备问题,我们确定并使用了多项式SVM的适当参数值。我们与专用于BC预测的现有研究的比较表明,所提出的CWV-BANNS-SVM模型总体提供了最佳预测性能之一。 (c)2019年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号