首页> 美国卫生研究院文献>NPJ Digital Medicine >Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
【2h】

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program

机译:深度学习与人类分级者在全国筛查计划中对糖尿病性视网膜病严重程度进行分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening.
机译:深度学习算法已用于以专家级别的准确性检测糖尿病性视网膜病变(DR)。本研究旨在在大规模临床人群中验证一种这样的算法,并将该算法的性能与人类评分者的性能进行比较。来自泰国社区为基础的全国性DR筛查计划中的总共25,326例糖尿病患者的可分级视网膜图像进行了DR严重度和可参考的糖尿病性黄斑水肿(DME)分析。由国际视网膜专家小组评审的等级作为参考标准。相对于人类评分者,用于检测可参考的DR(中度NPDR或更差),深度学习算法具有明显更高的灵敏度(0.97 vs. 0.74,p <0.001)和更低的特异性(0.96 vs. 0.98,p <0.001) 。对于严重或较差的NPDR,PDR和DME的每个类别,算法的灵敏度也更高(所有比较的p <0.001)。该算法和人类评分者用于确定DR严重程度水平的二次加权kappa分别为0.85和0.78(差异p <0.001)。在用于确定可参考疾病的不同严重程度的DR中,深度学习显着降低了假阴性率(降低了23%),但代价是假阳性率略高(2%)。深度学习算法可以用作DR筛选的有价值的工具。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号