Image caption generation method based on adaptive attention mechanism

机译：基于自适应注意机制的图像字幕生成方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An image caption generation model with adaptive attention mechanism is proposed for dealing with the weakness of theimage description model by the local image features. Under the framework of encoder and decoder architecture, the localand global features of images are extracted by using inception V3 and VGG19 network models at the encoder. Since theadaptive attention mechanism proposed in this paper can automatically identify and acquire the importance of local andglobal image information, the decoder can generate sentences describing the image more intuitively and accurately. Theproposed model is trained and tested on Microsoft COCO dataset. The experimental results show that the proposedmethod can extract more abundant and complete information from the image and generate more accurate sentences,compared with the image caption model based on local features.

机译：提出了一种具有自适应注意机制的图像标题生成模型，用于处理弱点图像描述模型由本地图像功能。在编码器和解码器架构的框架下，本地通过在编码器处使用Incepion V3和VGG19网络模型来提取图像的全局特征。自从此以来本文提出的自适应注意机制可以自动识别和获取当地的重要性和全局图像信息，解码器可以生成更直观且准确地描述图像的句子。这在Microsoft Coco DataSet上培训并测试了提出的模型。实验结果表明提出的方法可以从图像中提取更丰富和完整的信息并生成更准确的句子，与基于本地特征的图像字幕模型相比。

著录项

来源
《International Symposium on Multispectral Image Processing and Pattern Recognition》|2020年|1 CD-ROM|共8页
会议地点
作者
Huazhong Jin; Yu Wu; Fang Wan; Man Hu; Qingqing Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Image caption generation with dual attention mechanism [J] . Information Processing & Management . 2020,第2期

机译：具有双重关注机制的图像字幕生成
2. Adaptive Attention-based High-level Semantic Introduction for Image Caption [J] . Liu Xiaoxiao, Xu Qingyang ACM transactions on multimedia computing communications and applications . 2020,第4期

机译：基于自适应的图像标题的高级语义介绍
3. Image Captioning Using Region-Based Attention Joint with Time-Varying Attention [J] . Wang Weixuan, Hu Haifeng Neural processing letters . 2019,第1期

机译：使用基于区域的注意力联合时变注意力的图像字幕
4. Image caption generation method based on adaptive attention mechanism [C] . Huazhong Jin, Yu Wu, Fang Wan, International Symposium on Multispectral Image Processing and Pattern Recognition . 2020

机译：基于自适应注意机制的图像字幕生成方法
5. Feature-sensitive and adaptive image triangulation: A super-pixel-based scheme for image segmentation and mesh generation. [D] . Xu, Ming. 2016

机译：特征敏感和自适应图像三角剖分：基于超像素的图像分割和网格生成方案。
6. An Overview of Image Caption Generation Methods [O] . Haoran Wang, Yue Zhang, Xiaosheng Yu 2020

机译：图像字幕生成方法概述
7. Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map [O] . Boeun Kim, Saim Shin, Hyedong Jung 2019

机译：使用标题注意图的基于变化的自动统计器的多个图像标题

Image caption generation method based on adaptive attention mechanism

摘要

著录项

相似文献

相关主题

期刊订阅