Language-Based Image Editing with Recurrent Attentive Models

机译：基于语言的图像编辑与经常性细心模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate the problem of Language-Based Image Editing (LBIE). Given a source image and a natural language description, we want to generate a target image by editing the source image based on the description. We propose a generic modeling framework for two subtasks of LBIE: language-based image segmentation and image colorization. The framework uses recurrent attentive models to fuse image and language features. Instead of using a fixed step size, we introduce for each region of the image a termination gate to dynamically determine after each inference step whether to continue extrapolating additional information from the textual description. The effectiveness of the framework is validated on three datasets. First, we introduce a synthetic dataset, called CoSaL, to evaluate the end-to-end performance of our LBIE system. Second, we show that the framework leads to state-of-the-art performance on image segmentation on the ReferIt dataset. Third, we present the first language-based colorization result on the Oxford-102 Flowers dataset.

机译：我们调查基于语言的图像编辑（LBIE）的问题。给定源图像和自然语言描述，我们想通过基于描述来编辑源图像来生成目标图像。我们为LBIE的两个子任务提出了一个通用的建模框架：基于语言的图像分割和图像着色。该框架使用经常性的细心模型来熔断图像和语言功能。不使用固定的步长，我们向图像的每个区域介绍终端门以动态地确定每个推理步骤后是否继续从文本描述中继续推断附加信息。框架的有效性在三个数据集中验证。首先，我们介绍了一个称为檐口的合成数据集，以评估我们的LBIE系统的端到端性能。其次，我们表明该框架在引用数据集上的图像分段上导致最先进的性能。第三，我们在牛津-102花数据集上介绍了基于语言的彩色结果。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|731p|共9页
会议地点
作者
Jianbo Chen; Yelong Shen; Jianfeng Gao; Jingjing Liu; Xiaodong Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Image segmentation; Task analysis; Natural languages; Logic gates; Image color analysis; Feature extraction; Gallium nitride;

机译：图像分割;任务分析;自然语言;逻辑门;图像颜色分析;特征提取;氮化镓;

相似文献

外文文献
中文文献
专利

1. Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images [J] . Gao Zishu, Li En, Wang Zhe, Neural processing letters . 2021,第1期

机译：基于单个和多个图像的细心复发网络的对象重建
2. A multi-scale attentive recurrent network for image dehazing [J] . Wang Yibin, Yin Shibai, Basu Anup Multimedia Tools and Applications . 2021,第21a23期

机译：用于图像脱水的多尺度殷勤经常性网络
3. Attentive U-recurrent encoder-decoder network for image dehazing [J] . Yin Shibai, Wang Yibin, Yang Yee-Hong Neurocomputing . 2021,第MAYa21期

机译：用于图像脱水的细心U形循环编码器 - 解码器网络
4. Language-Based Image Editing with Recurrent Attentive Models [C] . Jianbo Chen, Yelong Shen, Jianfeng Gao, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：基于循环注意力模型的基于语言的图像编辑
5. Image-based remodeling: a framework for creating, visualizing, and editing image-based models. [D] . Colburn, R. Alex. 2014

机译：基于图像的重塑：用于创建，可视化和编辑基于图像的模型的框架。
6. Analyzing Neuroimaging Data Through Recurrent Deep Learning Models [O] . Armin W. Thomas, Hauke R. Heekeren, Klaus-Robert Müller, 2019

机译：通过反复性深度学习模型分析神经影像数据
7. Language-Based Image Editing with Recurrent Attentive Models [O] . Jianbo Chen, Yelong Shen, Jianfeng Gao, 2018

机译：基于语言的图像编辑与经常性细心模型

Language-Based Image Editing with Recurrent Attentive Models

摘要

著录项

相似文献

相关主题

期刊订阅