首页> 外文期刊>Computational Intelligence >A Multicriteria Weighted Vote-Based Classifier Ensemble for Heart Disease Prediction
【24h】

A Multicriteria Weighted Vote-Based Classifier Ensemble for Heart Disease Prediction

机译:用于心脏病预测的基于多准则加权投票的分类器集合。

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The availability of a large amount of medical data leads to the need of intelligent disease prediction and analysis tools to extract hidden information. A large number of data mining and statistical analysis tools are used for disease prediction. Single data-mining techniques show acceptable level of accuracy for heart disease diagnosis. This article focuses on prediction and analysis of heart disease using weighted vote-based classifier ensemble technique. The proposed ensemble model overcomes the limitations of conventional data-mining techniques by employing the ensemble of five heterogeneous classifiers: naive Bayes, decision tree based on Gini index, decision tree based on information gain, instance-based learner, and support vector machines. We have used five benchmark heart disease data sets taken from UCI repository. Each data set contains different set of feature space that ultimately leads to the prediction of heart disease. The effectiveness of proposed ensemble classifier is investigated by comparing the performance with different researchers' techniques. Tenfold cross-validation is used to handle the class imbalance problem. Moreover, confusion matrices and analysis of variance statistics are used to show the prediction results of all classifiers. The experimental results verify that the proposed ensemble classifier can deal with all types of attributes and it has achieved the high diagnosis accuracy of 87.37%, sensitivity of 93.75%, specificity of 92.86%, and F-measure of 82.17%. The F-ratio higher than the F-critical and p-value less than 0.01 for a 95% confidence interval indicate that the results are statistically significant for all the data sets.
机译:大量医疗数据的可用性导致需要智能的疾病预测和分析工具来提取隐藏信息。大量的数据挖掘和统计分析工具用于疾病预测。单一数据挖掘技术显示出可用于心脏病诊断的准确度。本文重点介绍基于加权投票的分类器集成技术对心脏病的预测和分析。所提出的集成模型通过采用五个异构分类器的集成克服了常规数据挖掘技术的局限性:朴素贝叶斯,基于Gini索引的决策树,基于信息增益的决策树,基于实例的学习器和支持向量机。我们已经使用了五个来自UCI资料库的基准心脏病数据集。每个数据集都包含不同的特征空间集,最终导致了心脏病的预测。通过将性能与不同研究人员的技术进行比较,研究了建议的集成分类器的有效性。十倍交叉验证用于处理类不平衡问题。此外,使用混淆矩阵和方差统计分析来显示所有分类器的预测结果。实验结果表明,提出的集成分类器能够处理所有类型的属性,具有较高的诊断准确率87.37%,灵敏度93.75%,特异性92.86%,F度量82.17%。对于95%的置信区间,F比率高于F临界且p值小于0.01,表明对于所有数据集,结果在统计上均具有显着意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号