首页> 外文期刊>Diabetes technology & therapeutics >Application of Machine Learning to Identify Clustering of Cardiometabolic Risk Factors in US Adults
【24h】

Application of Machine Learning to Identify Clustering of Cardiometabolic Risk Factors in US Adults

机译:机器学习在美国成年人中识别心脏素危险因素集群的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Aims: The aim of this study is to compare some machine learning methods with traditional statistical parametric analyses using logistic regression to investigate the relationship of risk factors for diabetes and cardiovascular (cardiometabolic risk) for U.S. adults using a cross-sectional data from participants in a wellness improvement program. Methods: Logistic regression was used to find the relationship between individual risk factors, predictor and cardiometabolic risk. Supervised machine learning methods were used to predict risk and produce a ranking of variables' importance. A clustering method was used to identify subpopulations of interest. Predictors were divided into those that are nonmodifiable and those that are modifiable. Results: The population comprised 217,254 adults of whom 8.1% had diabetes. Using logistic regression, six variables were identified to be negatively related and eleven were positively related to cardiometabolic risk. Three supervised machine learning classifiers (random forest, gradient boosting, and bagging) were applied with average AUC to be 0.806. Each classifier also produced a ranking of variables' importance. Four subgroups were identified with a k-medoid clustering algorithm, which were mainly distinguished by gender and diabetes status. Conclusions: The study illustrates that machine learning is an important addition to traditional logistic regression in terms of identifying important cardiometabolic risk factors and ranking their importance and the potential for interventions based on lifestyle and medications at an individual level.
机译:目的:本研究的目的是使用逻辑回归比较传统统计参数分析的一些机器学习方法,以研究美国成年人使用来自参与者的横截面数据的糖尿病和心血管风险的风险因素的关系健康改善计划。方法:使用逻辑回归来寻找个人风险因素,预测因子和心脏异常风险之间的关系。监督机器学习方法用于预测风险并产生变量重要性的排名。使用聚类方法来识别感兴趣的群体。预测因子分为那些不可替代的人和可修改的那些。结果:人口包含217,254名成人,其中8.1%有糖尿病。使用逻辑回归,识别六个变量与否定相关,11个与心脏异常风险正相关。三个监督机器学习分类器(随机森林,梯度提升和袋装)施用平均AUC至0.806。每个分类器还产生了变量的重要性。用K-yemoid聚类算法鉴定了四个亚组,其主要以性别和糖尿病状态分类。结论:该研究表明,根据识别重要的心细镜危险因素并根据生活方式和个人水平的生活方式排名和排序,对传统的逻辑回归是传统逻辑回归的重要补充。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号