首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Deep Learning Applied to Chest X-Rays Exploiting and Preventing Shortcuts
【24h】

Deep Learning Applied to Chest X-Rays Exploiting and Preventing Shortcuts

机译:深入学习应用于胸部X射线利用和防止快捷方式

获取原文
           

摘要

While deep learning has shown promise in improving the automated diagnosis of disease based on chest X-rays, deep networks may exhibit undesirable behavior related to short-cuts. This paper studies the case of spurious class skew in which patients with a particular attribute are spuriously more likely to have the outcome of interest. For instance, clinical protocols might lead to a dataset in which patients with pacemakers are disproportionately likely to have congestive heart failure. This skew can lead to models that take shortcuts by heavily relying on the biased attribute. We explore this problem across a number of attributes in the context of diagnosing the cause of acute hypoxemic respiratory failure. Applied to chest X-rays, we show that i) deep nets can accurately identify many patient attributes including sex (AUROC = 0.96) and age (AUROC 0.90), ii) they tend to exploit correlations between such attributes and the outcome label when learning to predict a diagnosis, leading to poor performance when such correlations do not hold in the test population (e.g., everyone in the test set is male), and iii) a simple transfer learning approach is surprisingly effective at preventing the shortcut and promoting good generalization performance. On the task of diagnosing congestive heart failure based on a set of chest X-rays skewed towards older patients (age $geq$ 63), the proposed approach improves generalization over standard training from 0.66 (95% CI: 0.54-0.77) to 0.84 (95% CI: 0.73-0.92) AUROC. While simple, the proposed approach has the potential to improve the performance of models across populations by encouraging reliance on clinically relevant manifestations of disease, i.e., those that a clinician would use to make a diagnosis.
机译:虽然深入学习表明了在提高基于胸部X射线的疾病自动诊断方面,但深网络可能表现出与短切有关的不良行为。本文研究了虚假级偏斜的情况,其中特定属性的患者虚幻地更有可能具有兴趣的结果。例如,临床方案可能导致与起搏器患者不成比例地具有充血性心力衰竭的数据集。这种偏斜可以通过大量依赖偏置属性来导致采用快捷方式的模型。我们在诊断急性低氧呼吸衰竭原因的背景下探讨了许多属性的问题。应用于胸部X射线,我们表明i)深网络可以准确地识别许多患者属性,包括性别(Auroc = 0.96)和年龄(Auroc 0.90),II)它们倾向于利用这些属性与学习时的结果标签之间的相关性为了预测诊断,导致性能不佳,当这些相关性不持有测试人群(例如,测试集中的每个人是男性)和III),并且III)简单的转移学习方法在预防快捷方式和促进良好的普遍方面令人惊讶地有效表现。基于一组胸部X射线对老年患者的胸部X射线诊断诊断性心力衰竭的任务(年龄$ GEQ $ 63),该方法从0.66(95%CI:0.54-0.77)增加标准培训的普遍化。 0.84(95%CI:0.73-0.92)Auroc。虽然简单,所提出的方法通过促进依赖于临床相关的疾病表现,即临床医生将用于进行诊断,可以通过促进依赖群体来改善模型的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号