首页> 外文会议>Conference on Multimedia Information Processing and Retrieval >Image Captioning with Clause-Focused Metrics in a Multi-modal Setting for Marketing
【24h】

Image Captioning with Clause-Focused Metrics in a Multi-modal Setting for Marketing

机译:在营销的多模式设置中以条款为中心的指标对图像进行字幕

获取原文

摘要

Automatically generating descriptive captions for images is a well-researched area in computer vision. However, existing evaluation approaches focus on measuring the similarity between two sentences disregarding fine-grained semantics of the captions. In our setting of images depicting persons interacting with branded products, the subject, predicate, object and the name of the branded product are important evaluation criteria of the generated captions. Generating image captions with these constraints is a new challenge, which we tackle in this work. By simultaneously predicting integer-valued ratings that describe attributes of the human-product interaction, we optimize a deep neural network architecture in a multi-task learning setting, which considerably improves the caption quality. Furthermore, we introduce a novel metric that allows us to assess whether the generated captions meet our requirements (i.e., subject, predicate, object, and product name) and describe a series of experiments on caption quality and how to address annotator disagreements for the image ratings with an approach called soft targets. We also show that our novel clause-focused metrics are also applicable to other image captioning datasets, such as the popular MSCOCO dataset.
机译:自动生成图像的描述性标题是计算机视觉中一个经过深入研究的领域。但是,现有的评估方法侧重于测量两个句子之间的相似性,而无视字幕的细粒度语义。在我们描述人物与品牌产品互动的图像的设置中,品牌产品的主题,谓词,宾语和名称是所生成字幕的重要评估标准。在这些约束条件下生成图像字幕是一项新的挑战,我们将在这项工作中加以解决。通过同时预测描述人与产品交互属性的整数值评分,我们在多任务学习设置中优化了深度神经网络体系结构,从而极大地提高了字幕质量。此外,我们引入了一种新颖的度量标准,该标准使我们能够评估所生成的字幕是否满足我们的要求(即主题,谓词,宾语和产品名称),并描述了一系列有关字幕质量以及如何解决图像注释者分歧的实验使用称为软目标的方法进行评分。我们还表明,我们新颖的以条款为中心的指标也适用于其他图像字幕数据集,例如流行的MSCOCO数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号