...
首页> 外文期刊>Journal of medical Internet research >Artificial Intelligence-Based Differential Diagnosis: Development and Validation of a Probabilistic Model to Address Lack of Large-Scale Clinical Datasets
【24h】

Artificial Intelligence-Based Differential Diagnosis: Development and Validation of a Probabilistic Model to Address Lack of Large-Scale Clinical Datasets

机译:基于人工智能的差异诊断:概率模型的开发和验证解决缺乏大规模临床数据集

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Machine-learning or deep-learning algorithms for clinical diagnosis are inherently dependent on the availability of large-scale clinical datasets. Lack of such datasets and inherent problems such as overfitting often necessitate the development of innovative solutions. Probabilistic modeling closely mimics the rationale behind clinical diagnosis and represents a unique solution. Objective The aim of this study was to develop and validate a probabilistic model for differential diagnosis in different medical domains. Methods Numerical values of symptom-disease associations were utilized to mathematically represent medical domain knowledge. These values served as the core engine for the probabilistic model. For the given set of symptoms, the model was utilized to produce a ranked list of differential diagnoses, which was compared to the differential diagnosis constructed by a physician in a consult. Practicing medical specialists were integral in the development and validation of this model. Clinical vignettes (patient case studies) were utilized to compare the accuracy of doctors and the model against the assumed gold standard. The accuracy analysis was carried out over the following metrics: top 3 accuracy, precision, and recall. Results The model demonstrated a statistically significant improvement ( P =.002) in diagnostic accuracy (85%) as compared to the doctors’ performance (67%). This advantage was retained across all three categories of clinical vignettes: 100% vs 82% ( P &.001) for highly specific disease presentation, 83% vs 65% for moderately specific disease presentation ( P =.005), and 72% vs 49% ( P &.001) for nonspecific disease presentation. The model performed slightly better than the doctors’ average in precision (62% vs 60%, P =.43) but there was no improvement with respect to recall (53% vs 56%, P =.27). However, neither difference was statistically significant. Conclusions The present study demonstrates a drastic improvement over previously reported results that can be attributed to the development of a stable probabilistic framework utilizing symptom-disease associations to mathematically represent medical domain knowledge. The current iteration relies on static, manually curated values for calculating the degree of association. Shifting to real-world data–derived values represents the next step in model development.
机译:背景技术用于临床诊断的机器学习或深度学习算法本身取决于大规模临床数据集的可用性。缺乏这样的数据集和过度装备等固有问题往往需要开发创新解决方案。概率模型密切模仿临床诊断背后的理由,代表了独特的解决方案。目的本研究的目的是开发和验证不同医学领域的鉴别诊断的概率模型。方法使用症状疾病关联的数值用于数学上代表医疗领域知识。这些值用作概率模型的核心引擎。对于给定的症状集,利用该模型来产生排名的差异诊断列表,其与医师在咨询中构建的差异诊断进行了比较。练习医学专家在该模型的开发和验证方面是一体的。利用临床小叶(患者案例研究)来比较医生和模型对假设金标准的准确性。通过以下度量标准进行准确性分析:前3个精度,精度和召回。结果,与医生的性能相比,该模型在诊断准确率(85%)中展示了统计上显着的改进(P = .002)。这种优势在所有三类临床血征中保留:100%vs 82%(p& .001),用于高度特异性的疾病呈现,适度特异性疾病呈现的83%vs 65%(p = .005),72%对于非特异性疾病介绍,与49%(P& .001)。该模型比医生的平均水平略好于精度(62%Vs 60%,P = .43),但对召回没有改善(53%Vs 56%,p = .27)。然而,差异均有统计学意义。结论本研究表明,对先前报告的结果,可以归因于利用症状 - 疾病关联的稳定概率框架的发展,以便在数学上代表医疗领域知识。当前迭代依赖于计算关联程度的静态,手动静音值。转换到现实世界的数据派生值代表模型开发的下一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号