To be an Artist: Automatic Generation on Food Image Aesthetic Captioning

机译：成为艺术家：在食物图像上的自动一代审美标题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image aesthetic captioning is a multi-modal task that is to generate aesthetic critiques for images. In contrast to common image captioning tasks, where different captions aimed at providing factual descriptions of a same image are always similar, captions with respect to different aesthetic attributes of the same image can be totally different in an aesthetic captioning task. Such inter-aspect differences are always overlooked, which leads to the lack of diversity and coherence of the captions generated by most of the existing image aesthetic captioning systems. In this paper, we propose a novel model to generate aesthetic captions for food images. Our model redefines food image aesthetic captioning as a compositional task that consists of two separated modules, i.e., a single-aspect captioning and an unsupervised text compression. The first module is guaranteed to generate the captions and learn feature representations of each aesthetic attribute. Then, the second module is supposed to study the associations among all feature representations and automatically aggregate captions of all aesthetic attributes to a final sentence. We also collect a dataset which contains pair-wise image-comment data related to six aesthetic attributes. Two new evaluation criteria are introduced to comprehensively assess the quality of the generated captions. Experiments on the dataset demonstrate the effectiveness of the proposed model.

机译：图像美学字幕是一种多模态任务，即为图像产生美学批评。与常见的图像标题任务相反，其中旨在提供相同图像的事实描述的不同标题始终相似，相对于同一图像的不同审美属性的标题可以在美学标题任务中完全不同。这种间隔差异总是被忽略，这导致大多数现有图像美学标题系统产生的标题缺乏多样性和相干性。在本文中，我们提出了一种新型模型来产生食物图像的审美标题。我们的模型将食物图像审美标题重新定义为组成任务，由两个分隔的模块组成，即单个方面标题和无监督的文本压缩。保证第一个模块生成每个审美属性的标题并学习特征表示。然后，第二个模块应该研究所有特征表示中的关联，并自动将所有美学属性的标题聚合到最终句子。我们还收集一个数据集，其中包含与六个美学属性相关的配对图像注释数据。引入了两个新的评估标准以全面评估所生成的标题的质量。数据集上的实验证明了所提出的模型的有效性。

著录项

来源
《IEEE International Conference on Tools with Artificial Intelligence》|2020年|779-786|共8页
会议地点
作者
Xiaohan Zou; Cheng Lin; Yinjia Zhang; Qinpei Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Measurement; Image coding; Conferences; Coherence; Tools; Task analysis; Artificial intelligence;

机译：测量;图像编码;会议;一致性;工具;任务分析;人工智能;

相似文献

外文文献
中文文献
专利

1. Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images [J] . Soft computing: A fusion of foundations, methodologies and applications . 2020,第2期

机译：从图像中集成Word Embeddings和Syntactic树的小说模型
2. Automatic Caption Generation for News Images [J] . Feng Yansong, Lapata Mirella Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2013,第4期

机译：自动为新闻图像生成字幕
3. A novel automatic image caption generation using bidirectional long-short term memory framework [J] . Ye Zhongfu, Khan Rashid, Naqvi Nuzhat, Multimedia Tools and Applications . 2021,第17期

机译：使用双向长短短期内存框架的新型自动图像字幕生成
4. Automatic caption generation for annotated images by using clustering algorithm [C] . Sivakrishna Reddy A., Monolisa N., Nathiya M., 2015 IEEE International Conference on Innovations in Information , Embedded and Communication Systems . 2015

机译：使用聚类算法自动为带注释的图像生成字幕
5. Generation of Humorous Caption for Cartoon Images Using Deep Learning [D] . Shanmuga Sundaram, Rajesh. 2018

机译：使用深度学习的卡通形象的幽默标题
6. An Overview of Image Caption Generation Methods [O] . Haoran Wang, Yue Zhang, Xiaosheng Yu 2020

机译：图像字幕生成方法概述
7. Automatic Sentence Generation for Images via Key-phrase Estimation using Large-Scale Captioned Images [O] . 牛久祥孝 2014

机译：通过使用大型字幕图像的关键词短语估计自动生成图像的句子

To be an Artist: Automatic Generation on Food Image Aesthetic Captioning

摘要

著录项

相似文献

相关主题

期刊订阅