首页> 外文期刊>BMC Medical Informatics and Decision Making >The prediction of asymptomatic carotid atherosclerosis with electronic health records: a comparative study of six machine learning models
【24h】

The prediction of asymptomatic carotid atherosclerosis with electronic health records: a comparative study of six machine learning models

机译:电子健康记录的无症状颈动脉粥样硬化的预测:六种机器学习模型的比较研究

获取原文
       

摘要

Screening carotid B-mode ultrasonography is a frequently used method to detect subjects with carotid atherosclerosis (CAS). Due to the asymptomatic progression of most CAS patients, early identification is challenging for clinicians, and it may trigger ischemic stroke. Recently, machine learning has shown a strong ability to classify data and a potential for prediction in the medical field. The combined use of machine learning and the electronic health records of patients could provide clinicians with a more convenient and precise method to identify asymptomatic CAS. Retrospective cohort study using routine clinical data of medical check-up subjects from April 19, 2010 to November 15, 2019. Six machine learning models (logistic regression [LR], random forest [RF], decision tree [DT], eXtreme Gradient Boosting [XGB], Gaussian Na?ve Bayes [GNB], and K-Nearest Neighbour [KNN]) were used to predict asymptomatic CAS and compared their predictability in terms of the area under the receiver operating characteristic curve (AUCROC), accuracy (ACC), and F1 score (F1). Of the 18,441 subjects, 6553 were diagnosed with asymptomatic CAS. Compared to DT (AUCROC 0.628, ACC 65.4%, and F1 52.5%), the other five models improved prediction: KNN? ?7.6% (0.704, 68.8%, and 50.9%, respectively), GNB? ?12.5% (0.753, 67.0%, and 46.8%, respectively), XGB? ?16.0% (0.788, 73.4%, and 55.7%, respectively), RF? ?16.6% (0.794, 74.5%, and 56.8%, respectively) and LR? ?18.1% (0.809, 74.7%, and 59.9%, respectively). The highest achieving model, LR predicted 1045/1966 cases (sensitivity 53.2%) and 3088/3566 non-cases (specificity 86.6%). A tenfold cross-validation scheme further verified the predictive ability of the LR. Among machine learning models, LR showed optimal performance in predicting asymptomatic CAS. Our findings set the stage for an early automatic alarming system, allowing a more precise allocation of CAS prevention measures to individuals probably to benefit most.
机译:筛选颈动脉B模式超声检查是一种常用的方法,用于检测颈动脉粥样硬化(CAS)的受试者。由于大多数CAS患者的无症状进展,早期鉴定对临床医生有挑战性,并且可能引发缺血性脑卒中。最近,机器学习表明了对数据和医疗领域预测的潜力进行了强有力的能力。机器学习的联合使用和患者的电子健康记录可以为临床医生提供更方便和精确的方法来识别无症状的CA。回顾性队列研究使用2010年4月19日至2019年11月19日至11月15日的医学检查科目的常规临床研究。六台机器学习模型(Logistic回归[LR],随机森林[RF],决策树[DT],极端梯度提升[XGB],高斯Na'Ve贝贝贝雷斯[GNB]和K最近邻邻[KNN])用于预测无误的CA,并在接收器操作特征曲线(AUCROC)下的区域方面比较其可预测性,精度(ACC )和F1得分(F1)。在18,441个受试者中,用无症状CAS被诊断出6553。与DT(Aucroc 0.628,ACC 65.4%和F1 52.5%)相比,其他五种模型改善预测:KNN? ?7.6%(0.704,68.8%和50.9%),GNB? ?12.5%(0.753,67.0%,分别为46.8%),XGB? ?16.0%(0.788,73.4%,分别为55.7%),RF? ?16.6%(0.794,74.5%和56.8%)和LR? ?18.1%(分别为0.809,74.7%和59.9%)。最高达到的模型,LR预测1045/1966(敏感性53.2%)和3088/3566非案例(特异性86.6%)。十倍交叉验证方案进一步验证了LR的预测能力。在机器学习模型中,LR在预测无症状CA时显示出最佳性能。我们的调查结果为早期自动报警系统设置了舞台,允许更精确地将CAS预防措施分配给个体,可能会受益最多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号