IIT-GAT: Instance-level image transformation via unsupervised generative attention networks with disentangled representations

Shao Mingwen; Zhang Youcai; Fan Yuan; Zuo Wangmeng; Meng Deyu

首页> 外文期刊>Knowledge-Based Systems >IIT-GAT: Instance-level image transformation via unsupervised generative attention networks with disentangled representations

【24h】

IIT-GAT: Instance-level image transformation via unsupervised generative attention networks with disentangled representations

机译：IIT-GAT：通过无监督的生成关注网络具有解除不诚格表示的实例级图像转换

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image-to-image translation is an important research field in computer vision, which is widely associated with Generative Adversarial Networks (GANs) and dual learning. However, the existing methods mainly translate the global image of the source domain to the target domain, which fails to implement instance-level image-to-image translation, and the translation results in the target domain cannot be controlled. In this paper, an instance-level image-to-image translation network (IIT-GAT) is proposed, which includes attention module and feature-encoder module. The attention module is used to guide our model to focus on more interesting instance to generate instance masks, which helps to separate instance and background of an image. The feature-encoder module is used to embed the images into two different spaces: domain-invariant content space and domain-specific attribute space. The content features and attribute features of different images are used as input to generator simultaneously to improve the controllability of image-to-image translation. To this end, we introduce a local self-reconstruction loss that encourages the network to learn the style feature of target instances. Generally, our method not only improves the quality of instance-level image-to-image translation, but also increases controllability on this basis. Extensive experiments are conducted on multiple datasets to validate the effectiveness of the proposed framework, and the results show our method has better performance than previous methods. (C) 2021 Elsevier B.V. All rights reserved.

机译：图像到图像转换是计算机视觉中的一个重要研究领域，与生成的对抗网络（GANS）和双学习广泛相关。但是，现有方法主要将源域的全局映像转换为目标域，这无法实现实例级图像到图像转换，并且无法控制目标域的转换。在本文中，提出了一个实例级图像到图像转换网络（IIT-GAT），包括注意模块和特征编码器模块。注意模块用于指导我们的模型，专注于更有趣的实例来生成实例掩码，这有助于分离图像的实例和背景。特征编码器模块用于将图像嵌入到两个不同的空格中：域不变内容空间和域特定的属性空间。不同图像的内容特征和属性特征用作同时发电机的输入，以提高图像到图像转换的可控性。为此，我们介绍了局部自我重建损失，鼓励网络了解目标实例的风格特征。通常，我们的方法不仅提高了实例级图像到图像转换的质量，而且还提高了这种基础的可控性。在多个数据集上进行了广泛的实验，以验证所提出的框架的有效性，结果表明我们的方法具有比以前的方法更好的性能。（c）2021 elestvier b.v.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2021年第5期|107122.1-107122.12|共12页
作者
Shao Mingwen; Zhang Youcai; Fan Yuan; Zuo Wangmeng; Meng Deyu;
展开▼
作者单位

China Univ Petr Coll Comp Sci & Technol Qingdao Peoples R China;

China Univ Petr Coll Comp Sci & Technol Qingdao Peoples R China;

China Univ Petr Coll Comp Sci & Technol Qingdao Peoples R China;

Harbin Inst Technol Coll Comp Sci & Technol Harbin Peoples R China;

Xi An Jiao Tong Univ Coll Informat & Syst Sci Xian Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Generative adversarial networks; Image-to-image translation; Attention mechanism; Disentangled representation;

机译：生成的对抗网络;图像到图像翻译;注意机制;解开的代表;

相似文献

外文文献
中文文献
专利

1. ST-GAN: Unsupervised Facial Image Semantic Transformation Using Generative Adversarial Networks [J] . Jichao Zhang, Fan Zhong, Gongze Cao, JMLR: Workshop and Conference Proceedings . 2017,第3期

机译：ST-GAN：使用生成对抗网络的无监督人脸图像语义转换
2. An improved unsupervised representation learning generative adversarial network for remote sensing image scene classification [J] . Wei Yufan, Luo Xiaobo, Hu Lixin, Remote sensing letters . 2020,第4a6期

机译：一种改进的遥感图像场景分类学习生成的逆势网络
3. Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images [J] . Bilel Benjdira, Yakoub Bazi, Anis Koubaa, Remote Sensing . 2019,第11期

机译：使用生成式对抗网络的航空图像语义分割的无监督域自适应
4. DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks [C] . Shuang Ma, Jianlong Fu, Chang Wen Chen, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：DA-GAN：深度注意生成对抗网络的实例级图像翻译
5. A Deeper Look at the Unsupervised Learning of Disentangled Representations in β-Vae from the Perspective of Core Object Recognition [D] . Sikka, Harshvardhan Digvijay. 2020

机译：从核心对象识别的角度来看，更深入地看看β-VAE中的解散表示的无情的学习
6. MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks [O] . Hengshi Yu, Joshua D. Welch 2021

机译：密歇根州：使用生成的对抗网络从单细胞数据的脱屑表示抽样
7. Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation [O] . Hao Tang, Dan Xu, Nicu Sebe, 2019

机译：无监督图像到图像翻译的注意力指导生成对抗网络

IIT-GAT: Instance-level image transformation via unsupervised generative attention networks with disentangled representations

摘要

著录项

相似文献

相关主题

期刊订阅