首页> 外文会议>International Symposium on Multispectral Image Processing and Pattern Recognition >Image caption generation method based on adaptive attention mechanism
【24h】

Image caption generation method based on adaptive attention mechanism

机译:基于自适应注意机制的图像字幕生成方法

获取原文

摘要

An image caption generation model with adaptive attention mechanism is proposed for dealing with the weakness of theimage description model by the local image features. Under the framework of encoder and decoder architecture, the localand global features of images are extracted by using inception V3 and VGG19 network models at the encoder. Since theadaptive attention mechanism proposed in this paper can automatically identify and acquire the importance of local andglobal image information, the decoder can generate sentences describing the image more intuitively and accurately. Theproposed model is trained and tested on Microsoft COCO dataset. The experimental results show that the proposedmethod can extract more abundant and complete information from the image and generate more accurate sentences,compared with the image caption model based on local features.
机译:提出了一种具有自适应注意机制的图像标题生成模型,用于处理弱点图像描述模型由本地图像功能。在编码器和解码器架构的框架下,本地通过在编码器处使用Incepion V3和VGG19网络模型来提取图像的全局特征。自从此以来本文提出的自适应注意机制可以自动识别和获取当地的重要性和全局图像信息,解码器可以生成更直观且准确地描述图像的句子。这在Microsoft Coco DataSet上培训并测试了提出的模型。实验结果表明提出的方法可以从图像中提取更丰富和完整的信息并生成更准确的句子,与基于本地特征的图像字幕模型相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号