...
首页> 外文期刊>SSM - Population Health >Machine learning approaches to the social determinants of health in the health and retirement study
【24h】

Machine learning approaches to the social determinants of health in the health and retirement study

机译:在健康和退休研究中机器学习方法对健康的社会决定因素的影响

获取原文

摘要

Background Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how machine learning may add to our understanding of social determinants of health using data from the Health and Retirement Study. Methods A linear regression of age and gender, and a parsimonious theory-based regression additionally incorporating income, wealth, and education, were used to predict systolic blood pressure, body mass index, waist circumference, and telomere length. Prediction, fit, and interpretability were compared across four machine learning methods: linear regression, penalized regressions, random forests, and neural networks. Results All models had poor out-of-sample prediction. Most machine learning models performed similarly to the simpler models. However, neural networks greatly outperformed the three other methods. Neural networks also had good fit to the data ( R 2 between 0.4–0.6, versus <0.3 for all others). Across machine learning models, nine variables were frequently selected or highly weighted as predictors: dental visits, current smoking, self-rated health, serial-seven subtractions, probability of receiving an inheritance, probability of leaving an inheritance of at least $10,000, number of children ever born, African-American race, and gender. Discussion Some of the machine learning methods do not improve prediction or fit beyond simpler models, however, neural networks performed well. The predictors identified across models suggest underlying social factors that are important predictors of biological indicators of chronic disease, and that the non-linear and interactive relationships between variables fundamental to the neural network approach may be important to consider. Highlights ? “Big data” methods may aid understanding of social determinants of health. ? Neural networks outperform other methods in prediction and variance explained. ? No individual machine learning method was readily interpretable. ? Variables in common among machine learning methods suggest core social determinants.
机译:背景技术社会和经济因素是健康的重要预测指标,并且对卫生系统具有公认的重要性。但是,在生物医学文献中其他地方使用的机器学习尚未广泛应用于研究社会与健康之间的关系。我们使用“健康与退休研究”中的数据来研究机器学习如何增加我们对健康的社会决定因素的理解。方法使用年龄和性别的线性回归以及基于简约理论的回归(另外结合收入,财富和教育)来预测收缩压,体重指数,腰围和端粒长度。在四种机器学习方法(线性回归,惩罚回归,随机森林和神经网络)中比较了预测,拟合和可解释性。结果所有模型的样本外预测均较差。大多数机器学习模型的性能与简单模型相似。但是,神经网络的性能大大优于其他三种方法。神经网络也很好地拟合了数据(R 2在0.4-0.6之间,而其他所有数据均<0.3)。在整个机器学习模型中,经常选择九个变量或对其进行加权,以作为预测变量:牙齿访视,当前吸烟,自我评估的健康状况,连续减七次,获得继承的可能性,留下至少10,000美元的继承可能性,子女,非裔美国人和性别。讨论某些机器学习方法无法改善预测或无法超越简单模型,但神经网络的性能很好。跨模型识别的预测因素表明潜在的社会因素是慢性疾病生物学指标的重要预测因素,而神经网络方法基础变量之间的非线性和交互关系可能是重要考虑因素。强调 ? “大数据”方法可能有助于理解健康的社会决定因素。 ?神经网络在预测和方差解释方面优于其他方法。 ?没有任何一种机器学习方法容易解释。 ?机器学习方法中共有的变量表明了核心的社会决定因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号