Development and validation of a risk score for diabetic kidney disease prediction in type 2 diabetes patients: a machine learning approach

Sarkhosh S. M. Hosseini; Hemmatabadi M.; Esteghamati A.

摘要

Purpose This study aims to develop and validate a risk score to predict the occurrence of DKD in individuals with type 2 diabetes using a machine learning (ML) approach. Methods By implementing Recursive Feature Elimination with Cross-Validation (RFECV) and RFE on the Diabetes Clinic of Imam Khomeini Hospital Complex (IKHC) dataset, the most critical features were identified. These features were used in the multivariate logistic regression (LR) analysis, and the discrimination and calibration of the model were evaluated. Finally, external validation of the model was assessed. Results The development dataset included 1907 type 2 diabetic patients, 763 of whom developed DKD over 5 years. The predictive model performed well in the development dataset by implementing RFECV with the RF algorithm and considering six features (AUC: 79). Using these features, the LR-based risk score indicated appropriate discrimination (AUC: 75.5, 95 CI 73-78) and acceptable calibration (chi(2)=7.44; p value = 0.49). This risk score was then used for 1543 diabetic patients in the validation dataset, including 633 patients with DKD over 5 years. The results showed sufficient discrimination (AUC: 75.8, 95 CI 73-78) of the risk score in the validation dataset. Conclusions We developed and validated a new risk score for predicting DKD via ML approach, which used common features in the periodic screening of type 2 diabetic patients that are readily available. In addition, a web-based online tool that is readily available to the public was developed to calculate the DKD risk score.

机译：目的本研究旨在开发和验证风险评分，以使用机器学习（ML）方法预测 2 型糖尿病患者 DKD 的发生率。方法通过对伊玛目霍梅尼医院综合医院（IKHC）数据集的糖尿病诊所实施交叉验证递归特征消除（RFECV）和RFE，识别出最关键的特征。将这些特征用于多元logistic回归（LR）分析，并评估模型的判别和校准。最后，对模型的外部验证进行了评估。结果开发数据集纳入1907例2型糖尿病患者，其中763例在5年内发生DKD。通过使用 RF 算法实现 RFECV 并考虑六个特征（AUC：79%），预测模型在开发数据集中表现良好。使用这些特征，基于 LR 的风险评分表明适当的区分（AUC： 75.5%， 95% CI 73-78%）和可接受的校准（chi（2）=7.44;p 值 = 0.49）。然后将该风险评分用于验证数据集中的 1543 名糖尿病患者，其中包括 633 名 5 年以上的 DKD 患者。结果显示，验证数据集中的风险评分具有足够的辨别力（AUC：75.8%，95% CI 73-78%）。结论我们开发并验证了一种新的风险评分，用于通过ML方法预测DKD，该评分使用了现成的2型糖尿病患者定期筛查的共同特征。此外，还开发了一种基于网络的在线工具，可供公众随时使用，以计算DKD风险评分。

Development and validation of a risk score for diabetic kidney disease prediction in type 2 diabetes patients: a machine learning approach

摘要

著录项

引文网络

相关主题

期刊订阅