【24h】

Reinforced Transformer for Medical Image Captioning

机译:用于医学图像字幕的增强型变压器

获取原文

摘要

Computerized medical image report generation is of great significance in automating the workflow of medical diagnosis and treatment for reducing health disparities. However, this task presents several challenges, where the generated medical image report should be precise, coherent and contain heterogeneous information. Current deep learning based medical image captioning models rely on recurrent neural networks and only extract top-down visual features, which make them slow and prone to generate incoherent and hard to comprehend reports. To tackle this challenging problem, this paper proposes a hierarchical Irans-former based medical imaging report generation model. Our proposed model consists of two parts: (1) An Image Encoder extracts heuristic visual features by a bottom-up attention mechanism; (2) a non-recurrent Captioning Decoder improves the computational efficiency by parallel computation. The former identifies regions of interest via a bottom-up attention module and extracts top-down visual features. Then the Transformer based captioning decoder generates a coherent paragraph of medical imaging report. The proposed model is trained by using a self-critical reinforcement learning method. We evaluate the proposed model on publicly available datasets of IU X-ray. The experiment results show that our proposed model has improved the performance in BLEU-1 by more than 50% compared with other state-of-the-art image captioning methods.
机译:计算机化的医学图像报告生成对于自动化医学诊断和治疗流程以减少健康差异具有重要意义。但是,此任务提出了一些挑战,其中生成的医学图像报告应准确,连贯并包含异构信息。当前基于深度学习的医学图像字幕模型依赖于递归神经网络,并且仅提取自上而下的视觉特征,这使其缓慢且易于生成不连贯且难以理解的报告。为了解决这个具有挑战性的问题,本文提出了一个基于分层的伊朗前人的医学影像报告生成模型。我们提出的模型包括两个部分:(1)图像编码器通过自下而上的注意力机制提取启发式视觉特征; (2)非循环字幕解码器通过并行计算提高了计算效率。前者通过自下而上的注意力模块识别兴趣区域,并提取自上而下的视觉特征。然后,基于Transformer的字幕解码器会生成医学成像报告的相关段落。通过使用自关键强化学习方法来训练提出的模型。我们在IU X射线的公开数据集上评估提出的模型。实验结果表明,与其他最新的图像字幕方法相比,我们提出的模型将BLEU-1的性能提高了50%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号