首页> 外文会议>International conference on healthcare science and engineering >Cardiovascular Disease Risk Prediction Based on Random Forest
【24h】

Cardiovascular Disease Risk Prediction Based on Random Forest

机译:基于随机森林的心血管疾病风险预测

获取原文

摘要

Cardiovascular disease (CVD) has caused great harm to human life and health and is one of the most serious diseases in the word. Traditional CVD prediction model to use more rigorous mathematical model approach, the inclusion factor, and the data format has high requirements. However, some medical data have a large number of missing attribute values, and these methods can not be fully adapted. To solve this problem, a CVD prediction model using machine learning algorithm is proposed. By further analyzing the feature contribution of the sample data, NB, SVM, DT, LR, RBF, and RF models are, respectively, constructed to predict cardiovascular diseases. Experimental results show that with the increase in data set capacity, even if it contains a lot of missing data, the effectiveness and capabilities of the proposed RF algorithm on the 2-category data set is superior to the above other machine learning algorithms mentioned. It is sensitivity is 88.0%, specificity is 87.6%, precision is 88.0%, and AUC value is 94.7%, respectively.
机译:心血管疾病(CVD)对人类的生命和健康造成了巨大伤害,并且是世界上最严重的疾病之一。传统的CVD预测模型要使用更严格的数学模型方法,对包含因子和数据格式有很高的要求。但是,某些医学数据缺少大量的属性值,因此这些方法无法完全适应。为了解决这个问题,提出了一种使用机器学习算法的CVD预测模型。通过进一步分析样本数据的特征贡献,分别构建了NB,SVM,DT,LR,RBF和RF模型来预测心血管疾病。实验结果表明,随着数据集容量的增加,即使其中包含大量丢失的数据,在2类数据集上提出的RF算法的有效性和功能也优于上述其他机器学习算法。其灵敏度分别为88.0%,特异性为87.6%,精确度为88.0%和AUC值为94.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号