首页> 外文会议>International Conference of the Chilean Computer Science Society >Predicting cardiovascular disease by combining optimal feature selection methods with machine learning
【24h】

Predicting cardiovascular disease by combining optimal feature selection methods with machine learning

机译:通过机器学习结合最优特征选择方法来预测心血管疾病

获取原文

摘要

Cardiovascular Disease (CVD) is one of the main causes of death in the world. Early detection could prevent deaths associated to cardiac problems. In this work, we propose a methodology based on data pre-processing and Machine Learning (ML) techniques for predicting cardiovascular disease, by using the Sleep Heart Health Study (SHHS) dataset. First, the principal component analysis and lowest p-value logistic regression are applied to select optimal features which could be related to the CVD. Then, the selected features are used for training four ML algorithms: Naïve Bayes (NB), Feed Forward Neural Networks (NN), Support Vector Machine (SVM) and Random Forest (RF). A binary feature was considered as output of the proposed models and the SMOTE sampling has been used for balancing the training set. Among the proposed methods, NN provided the best accuracy (0.81) and AUC (0.76) outperforming the results obtained in other studies.
机译:心血管疾病(CVD)是世界上死亡的主要原因之一。早期检测可以防止与心脏病问题相关的死亡。在这项工作中,我们通过使用睡眠心脏健康研究(SHHS)数据集来提出基于数据预处理和机器学习(ML)技术的方法来预测心血管疾病的方法。首先,应用主成分分析和最低p值逻辑回归来选择可能与CVD相关的最佳特征。然后,所选择的特征用于训练四毫升算法:Naïve贝叶斯(NB),馈送前神经网络(NN),支持向量机(SVM)和随机林(RF)。二进制特征被认为是所提出的模型的输出,并且Smote采样已被用于平衡训练集。在所提出的方法中,NN提供了最佳精度(0.81)和AUC(0.76),优于其他研究中获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号