Generating Description with Multi-feature Fusion and Saliency Maps of Image

机译：使用图像的多特征融合和显着性图生成描述

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Generating description for an image can be regard as visual understanding. It is across artificial intelligence, machine learning, natural language processing and many other areas. In this paper, we present a model that generates description for images based on RNN (recurrent neural network) with object attention and multi-feature of images. The deep recurrent neural networks have excellent performance in machine translation, so we use it to generate natural sentence description for images. The proposed method uses single CNN (convolution neural network) that is trained on ImageNet to extract image features. But we think it can not adequately contain the content in images, it may only focus on the object area of image. So we add scene information to image feature using CNN which is trained on Places205. Experiments show that model with multi-feature extracted by two CNNs perform better than which with a single feature. In addition, we make saliency weights on images to emphasize the salient objects in images. We evaluate our model on MSCOCO based on public metrics, and the results show that our model performs better than several state-of-the-art methods.

机译：生成图像描述可以视为视觉理解。它涉及人工智能，机器学习，自然语言处理和许多其他领域。在本文中，我们提出了一个基于RNN（递归神经网络）为图像生成描述的模型，该模型具有对象注意力和图像的多特征。深度递归神经网络在机器翻译中具有出色的性能，因此我们使用它来生成图像的自然句子描述。所提出的方法使用在ImageNet上训练的单个CNN（卷积神经网络）来提取图像特征。但是我们认为它不能充分包含图像中的内容，它可能只关注图像的对象区域。因此，我们使用在Places205上训练的CNN将场景信息添加到图像特征中。实验表明，由两个CNN提取的具有多个特征的模型的性能要优于具有单个特征的模型。另外，我们对图像进行显着权重，以强调图像中的显着对象。我们基于公共指标对MSCOCO进行了模型评估，结果表明，该模型的性能优于几种最先进的方法。

著录项

来源
《International conference on graphic and image processing》|2017年|106151D.1-106151D.8|共8页
会议地点
作者
Lisha Liu; Yuxuan Ding; Chunna Tian; Bo Yuan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Significant object detection; multi-feature; LSTM (long short term memory); normalization;

机译：重大物体检测;多功能LSTM（长期短期记忆）;正常化;

相似文献

外文文献
中文文献
专利

1. Enhanced decision fusion of semantically segmented images via local majority saliency map [J] . M. Hossny, S. Nahavandi, D. Creighton, Electronics Letters . 2017,第15期

机译：通过局部多数显着图增强语义分割图像的决策融合
2. MFC-Net: Multi-feature fusion cross neural network for salient object detection [J] . Wang Zhenyu, Zhang Yunzhou, Liu Yan, Image and Vision Computing . 2021,第Sepa期

机译：MFC-NET：多重特征融合交叉神经网络，用于突出对象检测
3. Salient Region Detection with Multi-Feature Fusion and Edge Constraint [J] . Cheng XU, Wei HAN, Dongzhen WANG, IEICE transactions on information and systems . 2020,第4期

机译：具有多重特征融合和边缘约束的突出区域检测
4. Generating Description with Multi-feature Fusion and Saliency Maps of Image [C] . Lisha Liu, Yuxuan Ding, Chunna Tian, International Conference on Graphic and Image Processing . 2017

机译：用多个融合和图像的显着图生成描述
5. Multiresolution image fusion of Thematic Mapper imagery with synthetic aperture radar imagery [D] . Meek, Theodore R. 1999

机译：专题Mapper影像与合成孔径雷达影像的多分辨率图像融合
6. PSDSD-A Superpixel Generating Method Based on Pixel Saliency Difference and Spatial Distance for SAR Images [O] . Tao Xie, Jingjian Huang, Qingzhan Shi, 2019

机译：基于像素显着性差异和空间距离的SAR图像PSDSD超像素生成方法
7. A NOVEL FUSION-BASED UNSUPERVISED APPROACH FOR MULTISPECTRAL IMAGE CHANGE DETECTION WITH SALIENCY MAPS [O] . A. Zhang, G. Jiang, L. Shao, 2017

机译：一种新的基于融合的无监督方法，用于利用图的多光谱图像变化检测

Generating Description with Multi-feature Fusion and Saliency Maps of Image

摘要

著录项

相似文献

相关主题

期刊订阅