Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

机译：开放式编辑：使用开放词汇表的开放式映像操作

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel algorithm, named Open-Edit, which is the first attempt on open-domain image manipulation with open-vocabulary instructions. It is a challenging task considering the large variation of image domains and the lack of training supervision. Our approach takes advantage of the unified visual-semantic embedding space pretrained on a general image-caption dataset, and manipulates the embedded visual features by applying text-guided vector arithmetic on the image feature maps. A structure-preserving image decoder then generates the manipulated images from the manipulated feature maps. We further propose an on-the-fly sample-specific optimization approach with cycle-consistency constraints to regularize the manipulated images and force them to preserve details of the source images. Our approach shows promising results in manipulating open-vocabulary color, texture, and high-level attributes for various scenarios of open-domain images (Code is released at https://github.com/xh-liu/Open-Edit).

机译：我们提出了一种名为Open-Edit的新颖算法，这是使用开放词汇表的开放式映像操作的第一次尝试。考虑到图像域的大变异以及缺乏培训监督，这是一项有挑战性的任务。我们的方法利用了在一般图像标题数据集上覆盖的统一视觉语义嵌入空间，并通过在图像特征映射上应用文本引导的矢量算法来操纵嵌入式视觉功能。然后，结构保存的图像解码器从操纵的特征映射生成操纵图像。我们进一步提出了一种具有循环一致性约束的循环一致性的优化方法，以规范操纵图像并强迫它们以保留源图像的细节。我们的方法显示有希望的导致操作开放词汇的颜色，纹理和高级属性，用于各种场景的开放式域图像（代码在https://github.com/xh-liuu/open -edit发布）。

著录项

来源
《European Conference on Computer Vision》|2020年|817p|共18页
会议地点
作者
Xihui Liu; Zhe Lin; Jianming Zhang; Handong Zhao; Quan Tran; Xiaogang Wang; Hongsheng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic 3D reconstruction of SEM images based on Nano-robotic manipulation and epipolar plane images [J] . Ding Weili, Zhang Yanxin, Lu Haojian, Ultramicroscopy . 2019,第期

机译：基于纳米机器机械操纵和截图图像的SEM图像自动三维重建
2. Location Instruction-Based Motion Generation for Sequential Robotic Manipulation [J] . Shao Quanquan, Hu Jie, Wang Weiming, Quality Control, Transactions . 2020,第期

机译：基于位置指令的顺序机器人操纵运动生成
3. The Effects of Instruction Manipulation on Motor Performance Following Action Observation [J] . Silvi Frenkel-Toledo, Moshe Einat, Zvi Kozol Frontiers in Human Neuroscience . 2020,第4期

机译：教学操纵对动作观察后电机性能的影响
4. Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions [C] . Xihui Liu, Zhe Lin, Jianming Zhang, European Conference on Computer Vision . 2020

机译：开放式编辑：使用开放词汇表的开放式映像操作
5. Digital image manipulation detection on Facebook images. [D] . Marrion, Charina G. 2016

机译：Facebook图像上的数字图像操纵检测。
6. The Effects of Instruction Manipulation on Motor Performance Following Action Observation [O] . Silvi Frenkel-Toledo, Moshe Einat, Zvi Kozol 2020

机译：动作观察后指令操作对运动成绩的影响
7. Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions [O] . Xihui Liu, Zhe Lin, Jianming Zhang, 2020

机译：开放式编辑：使用开放词汇表的开放式映像操作
8. Development of a Systematic and Quantitative Method for Evaluating Instructional Material. Instructional Material Adequacy Guide and Evaluation Standard (IMAGES). [R] . Perkins, J. C. 1978

机译：制定教学材料评价的系统定量方法。教学材料充分性指南和评估标准（ImaGEs）。

Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

摘要

著录项

相似文献

相关主题

期刊订阅