...
首页> 外文期刊>Korean journal of radiology : >Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers
【24h】

Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers

机译:报告医学图像诊断分析人工智能算法性能的研究设计特征:最新发表论文的结果

获取原文
           

摘要

Objective To evaluate the design characteristics of studies that evaluated the performance of artificial intelligence (AI) algorithms for the diagnostic analysis of medical images. Materials and Methods PubMed MEDLINE and Embase databases were searched to identify original research articles published between January 1, 2018 and August 17, 2018 that investigated the performance of AI algorithms that analyze medical images to provide diagnostic decisions. Eligible articles were evaluated to determine 1) whether the study used external validation rather than internal validation, and in case of external validation, whether the data for validation were collected, 2) with diagnostic cohort design instead of diagnostic case-control design, 3) from multiple institutions, and 4) in a prospective manner. These are fundamental methodologic features recommended for clinical validation of AI performance in real-world practice. The studies that fulfilled the above criteria were identified. We classified the publishing journals into medical vs. non-medical journal groups. Then, the results were compared between medical and non-medical journals. Results Of 516 eligible published studies, only 6% (31 studies) performed external validation. None of the 31 studies adopted all three design features: diagnostic cohort design, the inclusion of multiple institutions, and prospective data collection for external validation. No significant difference was found between medical and non-medical journals. Conclusion Nearly all of the studies published in the study period that evaluated the performance of AI algorithms for diagnostic analysis of medical images were designed as proof-of-concept technical feasibility studies and did not have the design features that are recommended for robust validation of the real-world clinical performance of AI algorithms.
机译:目的评估评估人工智能(AI)算法对医学图像诊断分析性能的研究的设计特征。材料和方法搜索PubMed MEDLINE和Embase数据库以识别2018年1月1日至2018年8月17日之间发表的原创研究文章,这些研究文章研究了分析医学图像以提供诊断决策的AI算法的性能。对符合条件的文章进行评估,以确定1)该研究是否使用外部验证而不是内部验证;在外部验证的情况下,是否收集了验证数据; 2)使用诊断队列设计而不是诊断病例对照设计; 3)来自多个机构,并且4)以预期的方式。这些是推荐用于实际实践中AI性能临床验证的基本方法学功能。确定满足上述标准的研究。我们将出版期刊分为医学期刊和非医学期刊。然后,比较医学期刊和非医学期刊的结果。结果在516项合格的已发表研究中,只有6%(31个研究)进行了外部验证。 31项研究中没有一项采用所有三个设计特征:诊断队列设计,多个机构的纳入以及用于外部验证的前瞻性数据收集。医学和非医学期刊之间没有发现显着差异。结论在研究期间发表的几乎所有评估AI算法在医学图像诊断分析中的性能的研究均被设计为概念验证技术可行性研究,并且不具有推荐用于可靠验证的设计特征。 AI算法的实际临床表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号