首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Language-Based Image Editing with Recurrent Attentive Models
【24h】

Language-Based Image Editing with Recurrent Attentive Models

机译:基于语言的图像编辑与经常性细心模型

获取原文

摘要

We investigate the problem of Language-Based Image Editing (LBIE). Given a source image and a natural language description, we want to generate a target image by editing the source image based on the description. We propose a generic modeling framework for two subtasks of LBIE: language-based image segmentation and image colorization. The framework uses recurrent attentive models to fuse image and language features. Instead of using a fixed step size, we introduce for each region of the image a termination gate to dynamically determine after each inference step whether to continue extrapolating additional information from the textual description. The effectiveness of the framework is validated on three datasets. First, we introduce a synthetic dataset, called CoSaL, to evaluate the end-to-end performance of our LBIE system. Second, we show that the framework leads to state-of-the-art performance on image segmentation on the ReferIt dataset. Third, we present the first language-based colorization result on the Oxford-102 Flowers dataset.
机译:我们调查基于语言的图像编辑(LBIE)的问题。给定源图像和自然语言描述,我们想通过基于描述来编辑源图像来生成目标图像。我们为LBIE的两个子任务提出了一个通用的建模框架:基于语言的图像分割和图像着色。该框架使用经常性的细心模型来熔断图像和语言功能。不使用固定的步长,我们向图像的每个区域介绍终端门以动态地确定每个推理步骤后是否继续从文本描述中继续推断附加信息。框架的有效性在三个数据集中验证。首先,我们介绍了一个称为檐口的合成数据集,以评估我们的LBIE系统的端到端性能。其次,我们表明该框架在引用数据集上的图像分段上导致最先进的性能。第三,我们在牛津-102花数据集上介绍了基于语言的彩色结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号