What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation

机译：图像说什么主题：具有主题表示的神经图像字幕模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image captioning aims to generate descriptions of images with natural language sentences automatically. Most methods tackle this problem in an end-to-end fashion in recent years, which generates captions directly from image level features but ignores high-level semantic information. The method that introduced attribute concept into the CNN-RNN framework made a considerable improvement while the performance depended on the manually selected attributes heavily. In this paper, we propose a topic-guided neural image captioning model which incorporates a topic model into the CNN-RNN framework. Our model represents each image as a set of topics and each topic as various words with relevant distributions. We conduct experiments on Microsoft COCO dataset. The results show that our model outperforms the baselines and achieves promising performance. It verifies that the topic features are effective to represent high-level semantic information of images.

机译：图像字幕的目的是自动生成带有自然语言句子的图像描述。近年来，大多数方法都以端到端的方式解决了这个问题，该方法直接从图像级功能生成字幕，但忽略了高级语义信息。将属性概念引入到CNN-RNN框架中的方法进行了相当大的改进，而性能则严重依赖于手动选择的属性。在本文中，我们提出了一个主题指导的神经图像字幕模型，该模型将主题模型合并到了CNN-RNN框架中。我们的模型将每个图像表示为一组主题，将每个主题表示为具有相关分布的各个单词。我们对Microsoft COCO数据集进行实验。结果表明，我们的模型优于基线，并实现了令人满意的性能。它验证了主题特征可以有效地表示图像的高级语义信息。

著录项

来源
《IEEE International Conference on Multimedia Expo Workshops》|2019年|447-452|共6页
会议地点
作者
Feng Chen; Songxian Xie; Xinyi Li; Shasha Li; Jintao Tang; Ting Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
computer vision; convolutional neural nets; image representation; learning (artificial intelligence); natural language processing; recurrent neural nets; text analysis;

机译：计算机视觉;卷积神经网络;图像表示;学习（人工智能）;自然语言处理;递归神经网络;文本分析;

相似文献

外文文献
中文文献
专利

1. Topic modeling and improvement of image representation for large-scale image retrieval [J] . Nguyen Anh Tu, Dong-Luong Dinh, Rasel Mostofa Kamal, Information Sciences: An International Journal . 2016,第Null期

机译：用于大规模图像检索的主题建模和图像表示的改进
2. A neural image captioning model with caption-to-images semantic constructor [J] . Su Jinsong, Tang Jialong, Lu Ziyao, Neurocomputing . 2019,第Nova20期

机译：具有字幕到图像语义构造函数的神经图像字幕模型
3. Visual Topic Network: Building better image representations for images in social media [J] . Zhenxing Niu, Gang Hua, Qi Tian, Computer vision and image understanding . 2015,第jula期

机译：视觉主题网络：为社交媒体中的图像构建更好的图像表示
4. What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation [C] . Feng Chen, Songxian Xie, Xinyi Li, IEEE International Conference on Multimedia amp;amp;amp;amp;amp;amp; Expo Workshops . 2019

机译：图像有哪些主题说：具有主题表示的神经图像标题模型
5. Topic Uncovering and Image Annotation via Scalable Probit Normal Correlated Topic Models [D] . Yu, Xingchen. 2015

机译：通过可扩展的Probit正常相关主题模型进行主题发现和图像注释
6. Structured Correspondence Topic Models for Mining Captioned Figures in Biological Literature [O] . Amr Ahmed, Eric P. Xing, William W. Cohen, -1

机译：生物文献中带字幕人物的结构化对应主题模型
7. Probabilistic models for topic learning from images and captions in online biomedical literatures [O] . Xin Chen, Caimei Lu, Yuan An, 2009

机译：从在线生物医学文献中的图像和标题中学习主题的概率模型

What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation

摘要

著录项

相似文献

相关主题

期刊订阅