首页> 外文会议>European Conference on Computer Vision >Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
【24h】

Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

机译:开放式编辑:使用开放词汇表的开放式映像操作

获取原文

摘要

We propose a novel algorithm, named Open-Edit, which is the first attempt on open-domain image manipulation with open-vocabulary instructions. It is a challenging task considering the large variation of image domains and the lack of training supervision. Our approach takes advantage of the unified visual-semantic embedding space pretrained on a general image-caption dataset, and manipulates the embedded visual features by applying text-guided vector arithmetic on the image feature maps. A structure-preserving image decoder then generates the manipulated images from the manipulated feature maps. We further propose an on-the-fly sample-specific optimization approach with cycle-consistency constraints to regularize the manipulated images and force them to preserve details of the source images. Our approach shows promising results in manipulating open-vocabulary color, texture, and high-level attributes for various scenarios of open-domain images (Code is released at https://github.com/xh-liu/Open-Edit).
机译:我们提出了一种名为Open-Edit的新颖算法,这是使用开放词汇表的开放式映像操作的第一次尝试。考虑到图像域的大变异以及缺乏培训监督,这是一项有挑战性的任务。我们的方法利用了在一般图像标题数据集上覆盖的统一视觉语义嵌入空间,并通过在图像特征映射上应用文本引导的矢量算法来操纵嵌入式视觉功能。然后,结构保存的图像解码器从操纵的特征映射生成操纵图像。我们进一步提出了一种具有循环一致性约束的循环一致性的优化方法,以规范操纵图像并强迫它们以保留源图像的细节。我们的方法显示有希望的导致操作开放词汇的颜色,纹理和高级属性,用于各种场景的开放式域图像(代码在https://github.com/xh-liuu/open -edit发布)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号