首页> 外文期刊>BMC Bioinformatics >Predicting diabetic retinopathy and identifying interpretable biomedical features using machine learning algorithms
【24h】

Predicting diabetic retinopathy and identifying interpretable biomedical features using machine learning algorithms

机译:使用机器学习算法预测糖尿病性视网膜病变并识别可解释的生物医学特征

获取原文
           

摘要

The risk factors of diabetic retinopathy (DR) were investigated extensively in the past studies, but it remains unknown which risk factors were more associated with the DR than others. If we can detect the DR related risk factors more accurately, we can then exercise early prevention strategies for diabetic retinopathy in the most high-risk population. The purpose of this study is to build a prediction model for the DR in type 2 diabetes mellitus using data mining techniques including the support vector machines, decision trees, artificial neural networks, and logistic regressions. Experimental results demonstrated that prediction performance by support vector machines performed better than the other machine learning algorithms and achieved 79.5% and 0.839 in accuracy and area under the receiver operating characteristic curve using percentage split (i.e., data set divided into 80% as trainning and 20% as test), respectively. Evaluated by three-way data split scheme (i.e., data set divided into 60% as training, 20% as validation, and 20% as independent test), our method obtained slightly lower performance compared to percentage split, which suggested that three-way data split is a better way to evaluate the real performance and prevent overestimation. Moreover, we incorporated approaches proposed in previous studies to evaluate our data set and our prediction performance outperformed the other previous studies in most evaluation measures. This lends support to our assumption that appropriate machine learning algorithms combined with discriminative clinical features can effectively detect diabetic retinopathy. Our method identifies use of insulin and duration of diabetes as novel interpretable features to assist with clinical decisions in identifying the high-risk populations for diabetic retinopathy. If duration of DM increases by 1?year, the odds ratio to have DMR is increased by 9.3%. The odds ratio to have DR is increased by 3.561 times for patients who use insulin compared to patients who do not use insulin. Our results can be used to facilitate development of clinical decision support systems for clinical practice in the future.
机译:在过去的研究中,对糖尿病性视网膜病(DR)的危险因素进行了广泛的研究,但尚不清楚哪些危险因素与DR的关系更大。如果我们能够更准确地检测出与DR相关的危险因素,那么我们就可以针对最高风险人群实施糖尿病视网膜病变的早期预防策略。本研究的目的是使用数据挖掘技术(包括支持向量机,决策树,人工神经网络和逻辑回归)建立2型糖尿病DR的预测模型。实验结果表明,支持向量机的预测性能优于其他机器学习算法,并且使用百分比分割(即,将数据集分为80%作为训练和20%分为20%,在接收器工作特性曲线下,其准确度和面积分别达到了79.5%和0.839) %作为测试)。通过三向数据拆分方案(即,将数据集分为60%作为训练,20%作为验证和20%作为独立测试)进行评估,与百分比拆分相比,我们的方法获得的性能略低,这表明三向数据拆分方案数据拆分是评估实际性能并防止高估的更好方法。此外,我们采用了先前研究中提出的方法来评估我们的数据集,并且在大多数评估方法中,我们的预测性能均优于其他先前研究。这支持了我们的假设,即适当的机器学习算法与可区分的临床特征相结合可以有效地检测出糖尿病性视网膜病变。我们的方法将胰岛素的使用和糖尿病的持续时间确定为新颖的可解释特征,以协助临床决策确定糖尿病性视网膜病的高危人群。如果DM的持续时间增加1年,则拥有DMR的几率会增加9.3%。与不使用胰岛素的患者相比,使用胰岛素的患者发生DR的几率比提高了3.561倍。我们的结果可用于促进未来临床实践的临床决策支持系统的开发。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号