首页> 外文会议>IEEE International Conference on Multimedia amp;amp;amp;amp;amp;amp; Expo Workshops >What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation
【24h】

What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation

机译:图像有哪些主题说:具有主题表示的神经图像标题模型

获取原文

摘要

Image captioning aims to generate descriptions of images with natural language sentences automatically. Most methods tackle this problem in an end-to-end fashion in recent years, which generates captions directly from image level features but ignores high-level semantic information. The method that introduced attribute concept into the CNN-RNN framework made a considerable improvement while the performance depended on the manually selected attributes heavily. In this paper, we propose a topic-guided neural image captioning model which incorporates a topic model into the CNN-RNN framework. Our model represents each image as a set of topics and each topic as various words with relevant distributions. We conduct experiments on Microsoft COCO dataset. The results show that our model outperforms the baselines and achieves promising performance. It verifies that the topic features are effective to represent high-level semantic information of images.
机译:图像标题旨在自动生成具有自然语言句子的图像的描述。大多数方法近年来以端到端的方式解决这个问题,直接从图像级别功能生成标题,但忽略了高级语义信息。将属性概念引入CNN-RNN框架的方法具有相当大的改进,而性能依赖于手动所选属性。在本文中,我们提出了一种主题引导的神经图像标题模型,其将主题模型结合到CNN-RNN框架中。我们的模型代表每个图像作为一组主题和每个主题作为具有相关分布的各种单词。我们对Microsoft Coco DataSet进行实验。结果表明,我们的模型优于基线,实现了有希望的性能。它验证主题功能是否有效地代表图像的高级语义信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号