首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >To be an Artist: Automatic Generation on Food Image Aesthetic Captioning
【24h】

To be an Artist: Automatic Generation on Food Image Aesthetic Captioning

机译:成为艺术家:在食物图像上的自动一代审美标题

获取原文

摘要

Image aesthetic captioning is a multi-modal task that is to generate aesthetic critiques for images. In contrast to common image captioning tasks, where different captions aimed at providing factual descriptions of a same image are always similar, captions with respect to different aesthetic attributes of the same image can be totally different in an aesthetic captioning task. Such inter-aspect differences are always overlooked, which leads to the lack of diversity and coherence of the captions generated by most of the existing image aesthetic captioning systems. In this paper, we propose a novel model to generate aesthetic captions for food images. Our model redefines food image aesthetic captioning as a compositional task that consists of two separated modules, i.e., a single-aspect captioning and an unsupervised text compression. The first module is guaranteed to generate the captions and learn feature representations of each aesthetic attribute. Then, the second module is supposed to study the associations among all feature representations and automatically aggregate captions of all aesthetic attributes to a final sentence. We also collect a dataset which contains pair-wise image-comment data related to six aesthetic attributes. Two new evaluation criteria are introduced to comprehensively assess the quality of the generated captions. Experiments on the dataset demonstrate the effectiveness of the proposed model.
机译:图像美学字幕是一种多模态任务,即为图像产生美学批评。与常见的图像标题任务相反,其中旨在提供相同图像的事实描述的不同标题始终相似,相对于同一图像的不同审美属性的标题可以在美学标题任务中完全不同。这种间隔差异总是被忽略,这导致大多数现有图像美学标题系统产生的标题缺乏多样性和相干性。在本文中,我们提出了一种新型模型来产生食物图像的审美标题。我们的模型将食物图像审美标题重新定义为组成任务,由两个分隔的模块组成,即单个方面标题和无监督的文本压缩。保证第一个模块生成每个审美属性的标题并学习特征表示。然后,第二个模块应该研究所有特征表示中的关联,并自动将所有美学属性的标题聚合到最终句子。我们还收集一个数据集,其中包含与六个美学属性相关的配对图像注释数据。引入了两个新的评估标准以全面评估所生成的标题的质量。数据集上的实验证明了所提出的模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号