首页> 外文期刊>BMC Bioinformatics >Prediction of heart disease and classifiers’ sensitivity analysis
【24h】

Prediction of heart disease and classifiers’ sensitivity analysis

机译:预测心脏病和分类器的敏感性分析

获取原文
           

摘要

Heart disease (HD) is one of the most common diseases nowadays, and an early diagnosis of such a disease is a crucial task for many health care providers to prevent their patients for such a disease and to save lives. In this paper, a comparative analysis of different classifiers was performed for the classification of the Heart Disease dataset in order to correctly classify and or predict HD cases with minimal attributes. The set contains 76 attributes including the class attribute, for 1025 patients collected from Cleveland, Hungary, Switzerland, and Long Beach, but in this paper, only a subset of 14 attributes are used, and each attribute has a given set value. The algorithms used K- Nearest Neighbor (K-NN), Naive Bayes, Decision tree J48, JRip, SVM, Adaboost, Stochastic Gradient Decent (SGD) and Decision Table (DT) classifiers to show the performance of the selected classifications algorithms to best classify, and or predict, the HD cases. It was shown that using different classification algorithms for the classification of the HD dataset gives very promising results in term of the classification accuracy for the K-NN (K?=?1), Decision tree J48 and JRip classifiers with accuracy of classification of 99.7073, 98.0488 and 97.2683% respectively. A feature extraction method was performed using Classifier Subset Evaluator on the HD dataset, and results show enhanced performance in term of the classification accuracy for K-NN (N?=?1) and Decision Table classifiers to 100 and 93.8537% respectively after using the selected features by only applying a combination of up to 4 attributes instead of 13 attributes for the predication of the HD cases. Different classifiers were used and compared to classify the HD dataset, and we concluded the benefit of having a reliable feature selection method for HD disease prediction with using minimal number of attributes instead of having to consider all available ones.
机译:心脏病(HD)是如今最常见的疾病之一,对这种疾病的早期诊断是许多医疗服务提供者预防其患者这种疾病并拯救生命的关键任务。在本文中,对核心疾病数据集的分类进行了对不同分类器的比较分析,以便在最小的属性中正确分类和或预测HD案例。该集合包含76个属性,包括类属性,对于从克利夫兰,匈牙利,瑞士和长滩收集的1025名患者,但在本文中,仅使用了14个属性的子集,每个属性都有一个给定的集合值。算法使用k-最近邻(k-nn),朴素贝叶斯,决策树J48,Jrip,SVM,Adaboost,随机渐变体面(SGD)和决策表(DT)分类器,以显示所选分类算法的性能分类和或预测HD案例。结果表明,使用不同的分类算法,用于HD数据集的分类,在K-NN(k?=Δ1),决策树J48和JRIP分类器的分类精度,决策树J48和JRIP分类器的术语具有非常有前途的结果,其准确性为99.7073 ,98.0488和97.2683%。使用CLASS子集评估器在HD DataSet上执行特征提取方法,结果显示了在使用后分别的K-NN(N?=?1)和决策表分类器的分类精度的增强性能。在使用后分别为100和93.8537%仅应用多达4个属性的组合而不是13个属性的所选功能,以预测HD案例。使用不同的分类器并进行比较以分类HD数据集,我们得出了利用使用最小数量的HD疾病预测具有可靠特征选择方法的益处,而不是必须考虑所有可用的属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号