首页> 外文期刊>Frontiers in Neurology >Automated Generation of Radiologic Descriptions on Brain Volume Changes From T1-Weighted MR Images: Initial Assessment of Feasibility
【24h】

Automated Generation of Radiologic Descriptions on Brain Volume Changes From T1-Weighted MR Images: Initial Assessment of Feasibility

机译:从T1加权MR图像自动生成脑体积变化的放射学描述:可行性初步评估

获取原文
           

摘要

Purpose: To examine the feasibility and potential difficulties of automatically generating radiologic reports (RRs) to articulate the clinically important features of brain magnetic resonance (MR) images. Materials and Methods: We focused on examining brain atrophy by using magnetization-prepared rapid gradient-echo (MPRAGE) images. The technology was based on multi-atlas whole-brain segmentation that identified 283 structures, from which larger superstructures were created to represent the anatomic units most frequently used in RRs. Through two layers of data-reduction filters, based on anatomic and clinical knowledge, raw images (~10 MB) were converted to a few kilobytes of human-readable sentences. The tool was applied to images from 92 patients with memory problems, and the results were compared to RRs independently produced by three experienced radiologists. The mechanisms of disagreement were investigated to understand where machine–human interface succeeded or failed. Results: The automatically generated sentences had low sensitivity (mean: 24.5%) and precision (mean: 24.9%) values; these were significantly lower than the inter-rater sensitivity (mean: 32.7%) and precision (mean: 32.2%) of the radiologists. The causes of disagreement were divided into six error categories: mismatch of anatomic definitions (7.2 ± 9.3%), data-reduction errors (11.4 ± 3.9%), translator errors (3.1 ± 3.1%), difference in the spatial extent of used anatomic terms (8.3 ± 6.7%), segmentation quality (9.8 ± 2.0%), and threshold for sentence-triggering (60.2 ± 16.3%). Conclusion: These error mechanisms raise interesting questions about the potential of automated report generation and the quality of image reading by humans. The most significant discrepancy between the human and automatically generated RRs was caused by the sentence-triggering threshold (the degree of abnormality), which was fixed to z-score &2.0 for the automated generation, while the thresholds by radiologists varied among different anatomical structures.
机译:目的:研究自动生成放射学报告(RR)以阐明脑磁共振(MR)图像的临床重要特征的可行性和潜在困难。材料和方法:我们致力于通过磁化准备的快速梯度回波(MPRAGE)图像检查大脑萎缩。该技术基于多图集全脑分割,该分割识别出283个结构,从中创建了较大的上层结构,以表示RR中最常用的解剖单位。通过基于解剖和临床知识的两层数据缩减过滤器,原始图像(约10 MB)被转换为几千字节的人类可读句子。该工具被用于92位有记忆问题的患者的图像,并将结果与​​三位经验丰富的放射科医生独立产生的RR进行了比较。对分歧机制进行了研究,以了解人机界面在哪里成功或失败。结果:自动生成的句子的敏感度(平均:24.5%)和精确度(平均:24.9%)低;这些显着低于放射科医生的评定者间敏感性(平均:32.7%)和精确度(平均:32.2%)。引起分歧的原因分为六个错误类别:解剖结构定义不匹配(7.2±9.3%),数据减少错误(11.4±3.9%),翻译错误(3.1±3.1%),使用的解剖空间范围差异字词(8.3±6.7%),细分质量(9.8±2.0%)和句子触发阈值(60.2±16.3%)。结论:这些错误机制引发了有关自动生成报告的潜力和人类阅读图像的质量的有趣问题。人为和自动生成的RR之间最显着的差异是由触发句子的阈值(异常程度)引起的,该阈值对于自动生成固定为z评分> 2.0,而放射线医师的阈值因解剖结构而异结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号